2017-09-01 45 views
0

嘗試使用apache drill 1.11.0在下面的csv文件上進行select *查詢。Apache鑽取未能讀取此csv

id,email,first_name,last_name,middle_name,suffix,work_phone,mobile_phone,gender,picture,speciality,taxonomy_code,education_details,experience_details,keywords,doctor_npi,wait_time,created_tstamp,created_by,last_updated_tstamp,last_updated_by,is_deleted 
1,[email protected],XXXXX,XXXX,,Dr,912225711234,,M,assets/images/doctorIcon.png,Primary Care Physician,Primary Care Doctor,M.D,3 years,Primary Care Doctor,1043259765,10,2015-04-22 17:20:48.0,,2015-12-16 12:06:27.0,,N 
2,[email protected],XXXX,XXXX,,Dr,913375311234,,M,assets/images/doctorIcon.png,Eye Doctor,EYE Care Doctor,MD,5 years,,1619932076,20,2015-04-30 11:07:57.0,,2015-11-07 08:49:57.0,,N 

我得到這個錯誤:

org.apache.drill.common.exceptions.UserRemoteException: DATA_READ ERROR: Error processing input: , line=1, char=292. Content parsed: [ ] Failure while reading file file:/..... Happened at or shortly before byte position 292. Fragment 0:0 [Error Id: 1ce7d94a-c06e-4633-af97-f3eceb1b5350 on 172.16.16.57:31010] 

這裏有什麼問題?

回答

0

不知何故列標題名稱「後綴」不起作用。如果我使用任何其他標題名稱它的作品。

1

這似乎是Apache Drill中的一個錯誤,但Praveen是正確的問題與後綴列有關。後綴列是Drill(filename,suffix,fqn,filepath)[4]中的四個隱式列之一。儘管這裏預期的行爲應該是隱式列後綴輸出(即csv)而不是錯誤的結果。我會爲此提交Jira。

如果列名與隱式列名相同,則可以使用ALTER SYSTEM|SESSION SET命令更改默認隱式列名。 例如: ALTER session SET `drill.exec.storage.implicit.suffix.column.label` = 'appendix';

[1] https://drill.apache.org/docs/querying-a-file-system-introduction/

+0

創建吉拉 - https://issues.apache.org/jira/browse/DRILL-5767 –