2017-06-29 81 views
0

我想使用ETL將文件導入到Orientdb中的文件模型中csv文件 我不知道這是否作爲新手正確,而不是文檔模型上的大量文檔,但我試過是:Orientdb將CSV文件導入到文檔模型

{ 
    "config": { 
    "log": "debug" 
    }, 
    "begin": [], 
    "source": { 
    "file": { 
     "path": "C:/Users/M/Desktop/files/lact.csv" 
    } 
    }, 
    "extractor": 
{ "csv": 
     { "separator": ",", 
     "nullValue": "NULL" 
     } 
    }, 
    "transformers": [ 
    { 
     "log": {} 
    } 
    ], 
    "loader": { 
    "orientdb": { 
     "dbURL": "plocal:../databases/Model_doc", 



     "dbType": "document", 
     "classes": [ 
     { 
      "name": "Annotations" 
     }, 


     ] 
    } 
    }, 
    "end": [] 
} 

顯示該文件的內容的解析之後,我的這句話: [orientdb] DEBUG orientdb:發現在課堂上「空」

的CSV文件0文件

"Entry","Entry_name","Status","Protein_names","Gene_names","Organism","Length","Cross_reference(STRING)" 
"Q29836","1B67_HUMAN","reviewed","HLA class I histocompatibility antigen, B-67 alpha chain (MHC class I antigen B*67)","HLA-B HLAB","Homo sapiens (Human)","362","9606.ENSP00000399168;" 
"P30501","1C02_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-2 alpha chain (MHC class I antigen Cw*2)","HLA-C HLAC","Homo sapiens (Human)","366","" 
"P30508","1C12_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-12 alpha chain (MHC class I antigen Cw*12)","HLA-C HLAC","Homo sapiens (Human)","366","" 
"Q29960","1C16_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-16 alpha chain (MHC class I antigen Cw*16)","HLA-C HLAC","Homo sapiens (Human)","366","" 
"Q29865","1C18_HUMAN","reviewed","HLA class I histocompatibility antigen, Cw-18 alpha chain (MHC class I antigen Cw*18)","HLA-C HLAC","Homo sapiens (Human)","366","" 
+0

嗨,你能發佈csv文件嗎?你使用什麼版本? Thx –

+0

@MichelaBonizzi我正在使用2.2.22 oientdb公社版本。至於csv我編輯了問題 –

+0

@MichelaBonizzi有什麼想法嗎? –

回答

1

我想你的代碼,我也有同樣的消息:

[orientdb] DEBUG orientdb: found 0 documents in class 'null' 

但我已經能夠導入所有的數據,正如你從截圖中看到的那樣。

enter image description here

做到這一點作爲@RobertoFranchini說,你要補充一點:

"transformers": [ 
{ 
    "log": {} 
}, 
{ 
    "field": { 
    "fieldName": "@class", 
    "value": "Annotations" 
    } 
} 
], 

我做了這個小改變您的CSV文件:

Entry,Entry_name,Status,Protein_names,Gene_names,Organism,Length,Cross_reference(STRING) 
Q29836,1B67_HUMAN,reviewed,HLA class I histocompatibility antigen, B-67 alpha chain (MHC class I antigen B*67),HLA-B HLAB,Homo sapiens (Human),362,9606.ENSP00000399168 
P30501,1C02_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-2 alpha chain (MHC class I antigen Cw*2),HLA-C HLAC,Homo sapiens (Human),366, 
P30508,1C12_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-12 alpha chain (MHC class I antigen Cw*12),HLA-C HLAC,Homo sapiens (Human),366, 
Q29960,1C16_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-16 alpha chain (MHC class I antigen Cw*16),HLA-C HLAC,Homo sapiens (Human),366, 
Q29865,1C18_HUMAN,reviewed,HLA class I histocompatibility antigen, Cw-18 alpha chain (MHC class I antigen Cw*18),HLA-C HLAC,Homo sapiens (Human),366, 

和所有的數據已導入。

希望它有幫助。

問候。

+0

謝謝你已經修好了 –

1

你需要一個類分配給文件,現場變壓器加入連鎖,只要登錄後

"transformers": [ 
{ 
    "log": {} 
}, 
{ 
    "field": { 
    "fieldName": "@class", 
    "value": "Annotations" 
    } 
} 
], 
+0

Franchini,沒有工作我添加了該行,但問題仍然存在 –

+0

在您的重複問題中,您使用class而不是@class。請更新這個問題。 –