我有tsv格式的數據,我想將它插入到hbase中,問題是每個字段都被「'」(單引號)括起來。在hbase中使用封閉的
我如何能夠提供其在導入命令:
HBase的org.apache.hadoop.hbase.mapreduce.ImportTsv -Dimporttsv.bulk.output =輸出1 -Dimporttsv.columns = HBASE_ROW_KEY,數據: EMPLOYEE_ID,數據:employee_name,數據:指定數據:TELEPHONE_NUMBER,數據:artist_mbid, 員工/ EMP
我發現可用HBase的選項有: 在 https://hbase.apache.org/book.html
importtsv
Other options that may be specified with -D include:
-Dimporttsv.skip.bad.lines=false - fail if encountering an invalid line
'-Dimporttsv.separator=|' - eg separate on pipes instead of tabs
-Dimporttsv.timestamp=currentTimeAsLong - use the specified timestamp for the import
-Dimporttsv.mapper.class=my.Mapper - A user-defined Mapper to use instead of org.apache.hadoop.hbase.mapreduce.TsvImporterMapper
我正在尋找像MySQL的命令等同:LOAD DATA INFILE '產出' INTO TERMINATED BY '\ t' OPTIONALLY ENCLOSED BY '「' 結尾BY '\ n'
LINES TABLE員工FIELDS