如果要從遠程服務器加載CSV,則需要運行一個簡單的HTTPServer或類似的HTTPServer上託管文件。然後,你可以簡單地使用
LOAD CSV FROM "http://192.x.x.x/myfile.csv" as row
在另一方面,你可以從熊貓數據幀導入文件。我創建一個計算線性迴歸梯度一個簡單的腳本,並將其保存回Neo4j的
from neo4j.v1 import GraphDatabase
import pandas as pd
import numpy as np
driver = GraphDatabase.driver("bolt://192.168.x.x:7687", auth=("neo4j", "neo4j"))
session = driver.session()
def weekly_count_gradient(data):
df = pd.DataFrame([r.values() for r in data], columns=data.keys())
df["week"] = df.start.apply(lambda x: pd.to_datetime(x).week if pd.notnull(x) else None)
df["year"] = df.start.apply(lambda x: pd.to_datetime(x).year if pd.notnull(x) else None)
group = df.groupby(["week","year","company"]).start.count().reset_index()
for name in group["company"].unique():
if group[group["company"] == name].shape[0] >= 5:
x = np.array([i[1] if i[0] == 2016 else i[1] + 52 for i in group[group.company == name][["year","week"]].values])
y = group[group.company == name]["start"].values
fit = np.polyfit(x,y,deg=1)
update = session.run("MATCH (a:Company{code:{code}}) SET a.weekly_count_gradient = toFLOAT({gradient}) RETURN a.code,{"code":name,"gradient":fit[0]})
這裏的關鍵是,你運行一個帶參數的查詢,參數可以來自任何地方(列表/字典/熊貓)