Python-將json文件寫入ES資料庫_ZenDei技術網路在線

Python-將json文件寫入ES資料庫

-Advertisement-

1、安裝Elasticsearch資料庫 PS：在此之前需首先安裝Java SE環境下載elasticsearch-6.5.2版本，進入/elasticsearch-6.5.2/bin目錄，雙擊執行elasticsearch.bat 打開瀏覽器輸入http://localhost:9200 顯示以 ...

1、安裝Elasticsearch資料庫

PS：在此之前需首先安裝Java SE環境

下載elasticsearch-6.5.2版本，進入/elasticsearch-6.5.2/bin目錄，雙擊執行elasticsearch.bat 打開瀏覽器輸入http://localhost:9200 顯示以下內容則說明安裝成功

安裝head插件，便於查看管理（還可以用kibana）

首先安裝Nodejs（下載地址https://nodejs.org/en/）

再下載 elasticsearch-head-master包解壓到/elasticsearch-6.5.2/下（鏈接：https://pan.baidu.com/s/1oX9wKuAYrvY2ZRBT0cos6A
提取碼：5ik4）

修改配置文件elasticsearch-6.5.2\config\elasticsearch.yml如下：

進入elasticsearch-head-master目錄下執行 npm install -g grunt-cli，再執行npm install 安裝依賴

在elasticsearch-head-master目錄下找到Gruntfile.js文件修改伺服器監聽地址如下：

執行grunt server命令啟動head服務

訪問地址http://localhost:9100/即可訪問head管理頁面

2、將json文件寫入ES資料庫（py腳本如下）

# -*- coding: UTF-8 -*-

from itertools import islice
import json , sys
from elasticsearch import Elasticsearch , helpers
import threading

_index = 'indextest'   #修改為索引名
_type = 'string'     #修改為類型名
es_url = 'http://192.168.116.1:9200/'  #修改為elasticsearch伺服器

reload(sys)
sys.setdefaultencoding('utf-8')
es = Elasticsearch(es_url)
es.indices.create(index=_index, ignore=400)
chunk_len = 10
num = 0

def bulk_es(chunk_data):
    bulks=[]
    try:
        for i in xrange(chunk_len):
            bulks.append({
                    "_index": _index,
                    "_type": _type,
                    "_source": chunk_data[i]
                })
        helpers.bulk(es, bulks)
    except:
        pass

with open(sys.argv[1]) as f:
    while True:
        lines = list(islice(f, chunk_len))
        num =num +chunk_len
        sys.stdout.write('\r' + 'num:'+'%d' % num)
        sys.stdout.flush()
        bulk_es(lines)
        if not lines:
            print "\n"
            print "task has finished"
            break

您的分享是我們最大的動力!

-Advertisement-

更多相關文章

（一）圖資料庫的基本認識

本系列筆記是在看完《neo4j權威指南》基礎上做的記錄。方便於自己後面查閱！！ 1.圖庫介紹圖資料庫（Graph Database）是基於圖論實現的一種新型NoSQL資料庫。它的數據存儲結構和數據的查詢方式都是以圖論為基礎的。圖論中圖的基本元素為節點和邊，在圖資料庫中對應的就是節點和關係。在圖數據 ...
在windows上安裝不同(兩個)版本的Mysql資料庫

1.起因: 需要導入一個sql文件,發現死活導不進去.當執行到這一句時,就有問題.經過一番搜索,原來是我的資料庫版本(原先Mysql版本5.5)低了,而支持該語句的版本應該是至少要5.7.那我索性就去Mysql官網去下載了個最新版本的(8.0.15). 2.過程: 那麼問題來了:有兩個解決方案.1. ...
mysql關於視圖的用法以及作用

關於視圖的用法以及作用。作用一：提高了重用性，就像一個函數。如果要頻繁獲取user的name和goods的name。就應該使用以下sql語言。示例： select a.name as username, b.name as goodsname from user as a, goods as b ...
MySQL slow_log表不能修改成innodb引擎

背景從mysql.slow_log 獲取慢查詢日誌很慢，該表是csv表，沒有索引。想添加索引來加速訪問，而csv引擎不能添加索引（csv引擎存儲是以逗號分割的文本來存儲的），只能改存儲引擎來添加索引了 mysql.slow_log表能改成myisam，不能改成innodb mysql.gener ...
ES 13 - Elasticsearch的元欄位（_index、_type、_source、_routing等）

元欄位是ES為每個文檔配置的內置欄位, 主要用於ES內部相關操作. ES有多種類型的元欄位, 在使用和提高性能方面有很強大的地方, 這篇文章列舉常用元欄位的功能和使用方法, 包括_index、_type、_source、_routing等, 歡迎交流吖~ ...
MYSQL 筆記

本人是一名學生，正在學習過程中，所以筆記涵蓋的還不是很廣，不過也算基本夠用，希望以後能更加完善。登陸資料庫 mysql -h 主機名 -u 用戶名 -p > mysql -u root -p 列出資料庫 show databases; 選擇資料庫 use 資料庫名; 查看當前資料庫 select ...
MySQL基礎----py全棧

MySQL基礎 py全棧 [TOC] 一、引言 1、什麼是數據？描述事物的符號記錄，可以使數字，也可以是文字，圖形、圖像等，數據有多種形式，它們都可以經過數字化存入電腦，數據的含義成為數據的語義 2、什麼是資料庫（DB）？存儲數據的倉庫，是長期存放電腦內、有組織、可共用的大量數據的集合，數據 ...
(oralce)pga_aggregate_target與workarea_size_policy相互關係驗證

pga_aggregate_target與workarea_size_policy相互關係驗證 ...