2012-10-05 48 views
1

我有一個看起來像下面,其中的字段由分隔的文件「?」:加載具有非標準分隔成豬文件

(01-01-2011-04:43:50?2521795691802591407?94.20.58.165? 

當我使用

mac = load 'Activity_1295336_01-01-2011.log.gz'using PigStorage('?'); 

我仍然不能訪問內部字段,例如mac$1

+0

?被保留爲參數我想嘗試更改分隔符來檢查 – Satya

回答

0

它可以在我的身邊預期...

$> pig --version 
Apache Pig version 0.9.2-cdh4.0.0 (rexported) 
compiled Jun 04 2012, 17:42:27 

$> cat temp1 
01-01-2011-04:43:50?2521795691802591407?94.20.58.165? 

grunt> a = load '/temp1' using PigStorage('?') as (datetime, id, ip); 
grunt> dump a; 
grunt> >> (01-01-2011-04:43:50,2521795691802591407,94.20.58.165,) 
grunt> b = foreach a { funky = CONCAT(ip, '_-* FUNKY'); generate datetime, id, funky;} 
grunt> dump b; 
grunt> >> (01-01-2011-04:43:50,2521795691802591407,94.20.58.165_-* FUNKY)