2013-07-09 60 views
0

當我在grunt上運行我的豬腳本時,輸出看起來不錯。下面的例子grunt和java上的PIG輸出差異

2013-07-08 16:58:40,640 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Success! 
2013-07-08 16:58:40,647 [main] INFO org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 1 
2013-07-08 16:58:40,647 [main] INFO org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input paths to process : 1 
((email,[email protected]),{(rrr24,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr10,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr20,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr23,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr9,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr8,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr22,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr21,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{})}) 
((email,[email protected]),{(rrr0,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr6,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr7,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr3,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr1,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr5,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr4,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{}),(rrr2,(full_name,rachana),email,(state,ca),(birth_year,2013),(gender,female),{})}) 
grunt> 

我可以看到FULL_NAME,電子郵件,birth_year,性別,而是當我運行同樣用java

package com.chegg.hwh.tracking.dao; 

import org.apache.pig.ExecType; 
import org.apache.pig.PigServer; 

public class HWHDataPigMapReduce { 

    public static void main(String args[]) throws Exception { 
     PigServer pigServer = new PigServer(ExecType.LOCAL); 

     pigServer.registerQuery("rows = LOAD 'cassandra://hwh_tracking/users' USING org.apache.cassandra.hadoop.pig.CassandraStorage();"); 
     pigServer.registerQuery("emailgroup = group rows by email;"); 
     pigServer.dumpSchema("emailgroup"); 

    } 

} 

輸出:

emailgroup: {group: (name: chararray,value: chararray),rows: {(key: chararray,full_name: (name: chararray,value: chararray),email: (name: chararray,value: chararray),state: (name: chararray,value: chararray),birth_year: (name: chararray,value: long),gender: (name: chararray,value: chararray),columns: {(name: chararray,value: bytearray)})}} 

我嘗試使用爲( full_name:chararray)但沒有區別。我在這裏錯過了什麼。 annyone能幫忙嗎?

+0

你可以附上你在grunt中執行的代碼嗎? – Frederic

+0

rows = LOAD'cassandra:// hwh_tracking/users'USING org.apache.cassandra.hadoop.pig.CassandraStorage(); emailgroup = group email by email; dump emailgroup; – plzdontkillme

回答