我剛剛開始使用Pig學習，爲此我在Ubuntu 14.04 LTS上使用Pig版本0.16.0安裝了僞分佈式Hadoop 2.7.2。以下是我的配置，豬和Hadoop -在Hadoop 2.7.2上的Pig-0.16.0 - 錯誤1002：無法存儲別名

文件：.bashrc中

#=============================================================== 
# Hadoop Variable List 

export JAVA_HOME=/usr/lib/jvm/java-9-oracle 
export HADOOP_INSTALL=/home/hadoop/hadoop 
export PATH=$PATH:$HADOOP_INSTALL/bin 
export PATH=$PATH:$HADOOP_INSTALL/sbin 
export HADOOP_MAPRED_HOME=$HADOOP_INSTALL 
export HADOOP_COMMON_HOME=$HADOOP_INSTALL 
export HADOOP_HDFS_HOME=$HADOOP_INSTALL 
export HADOOP_HOME=$HADOOP_INSTALL 
export YARN_HOME=$HADOOP_INSTALL 
export HADOOP_COMMON_LIB_NATIVE_DIR=$HADOOP_INSTALL/lib/native 
export HADOOP_OPTS="-Djava.library.path=$HADOOP_INSTALL/lib/native" 

#=============================================================== 
# PIG variable 
export PIG_HOME="/home/hadoop/pig" 
export PIG_INSTALL="$PIG_HOME" 
export PIG_CONF_DIR="$PIG_HOME/conf" 
export PIG_CLASSPATH="$HADOOP_INSTALL/conf" 
export HADOOPDIR="$HADOOP_INSTALL/conf" 
export PATH="$PIG_HOME/bin:$PATH"

=======================

和下面的目錄從我執行生豬

-rw-rw-r-- 1 hadoop hadoop 540117 Jul 15 12:41 myfile.txt 
[email protected]:~$ pwd 
/home/hadoop

我複製該文件到HDFS太

[email protected]:~$ hadoop fs -ls -R /user/hadoop 
-rw-r--r-- 1 hadoop supergroup  540117 2016-07-15 12:48 /user/hadoop/myfile.txt

現在......當我在Grunt shell中執行以下命令時，它會給出錯誤！

grunt> a = load 'myfile.txt' as line; 
grunt> store a into 'c.out'; 

2016-07-15 12:56:38,670 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Processing aliases a 
2016-07-15 12:56:38,670 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - detailed locations: M: a[1,4],a[-1,-1] C: R: 
2016-07-15 12:56:38,684 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 0% complete 
2016-07-15 12:56:38,685 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Running jobs are [job_1468556821972_0006] 
2016-07-15 12:56:53,959 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 50% complete 
2016-07-15 12:56:53,959 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Running jobs are [job_1468556821972_0006] 
2016-07-15 12:57:25,722 [main] WARN org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Ooops! Some job has failed! Specify -stop_on_failure if you want Pig to stop immediately on failure. 
2016-07-15 12:57:25,722 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - job job_1468556821972_0006 has failed! Stop running all dependent jobs 
2016-07-15 12:57:25,722 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete 
2016-07-15 12:57:25,726 [main] INFO org.apache.hadoop.yarn.client.RMProxy - Connecting to ResourceManager at /0.0.0.0:8032 
2016-07-15 12:57:25,786 [main] INFO org.apache.hadoop.yarn.client.RMProxy - Connecting to ResourceManager at /0.0.0.0:8032 
2016-07-15 12:57:25,839 [main] ERROR org.apache.pig.tools.pigstats.mapreduce.MRPigStatsUtil - 1 map reduce job(s) failed! 
2016-07-15 12:57:25,841 [main] INFO org.apache.pig.tools.pigstats.mapreduce.SimplePigStats - Script Statistics: 

HadoopVersion PigVersion UserId StartedAt FinishedAt Features 
2.7.2 0.16.0 hadoop 2016-07-15 12:56:36 2016-07-15 12:57:25 UNKNOWN 

Failed! 

Failed Jobs: 
JobId Alias Feature Message Outputs 
job_1468556821972_0006 a MAP_ONLY Message: Job failed!  hdfs://localhost:9001/user/hadoop/c.out, 

Input(s): 
Failed to read data from "hdfs://localhost:9001/user/hadoop/myfile.txt" 

Output(s): 
Failed to produce result in "hdfs://localhost:9001/user/hadoop/c.out" 

Counters: 
Total records written : 0 
Total bytes written : 0 
Spillable Memory Manager spill count : 0 
Total bags proactively spilled: 0 
Total records proactively spilled: 0 

Job DAG: 
job_1468556821972_0006 


2016-07-15 12:57:25,842 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Failed!

我曾嘗試只是在本地模式下，而不是在MapReduce的模式，但沒有執行PIG解決它通過其他方式似乎是工作。每次這些簡單的兩個命令都失敗了。

以下消息

2016-07-15 12:56:38,670 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Processing aliases a 
2016-07-15 12:56:38,670 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - detailed locations: M: a[1,4],a[-1,-1] C: R: 
2016-07-15 12:56:38,684 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 0% complete 
2016-07-15 12:56:38,685 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Running jobs are [job_1468556821972_0006] 
2016-07-15 12:56:53,959 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 50% complete 
2016-07-15 12:56:53,959 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Running jobs are [job_1468556821972_0006] 
2016-07-15 12:57:25,722 [main] WARN org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Ooops! Some job has failed! Specify -stop_on_failure if you want Pig to stop immediately on failure. 
2016-07-15 12:57:25,722 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - job job_1468556821972_0006 has failed! Stop running all dependent jobs 
2016-07-15 12:57:25,722 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - 100% complete 
2016-07-15 12:57:25,726 [main] INFO org.apache.hadoop.yarn.client.RMProxy - Connecting to ResourceManager at /0.0.0.0:8032 
2016-07-15 12:57:25,786 [main] INFO org.apache.hadoop.yarn.client.RMProxy - Connecting to ResourceManager at /0.0.0.0:8032 
2016-07-15 12:57:25,839 [main] ERROR org.apache.pig.tools.pigstats.mapreduce.MRPigStatsUtil - 1 map reduce job(s) failed! 
2016-07-15 12:57:25,841 [main] INFO org.apache.pig.tools.pigstats.mapreduce.SimplePigStats - Script Statistics: 

HadoopVersion PigVersion UserId StartedAt FinishedAt Features 
2.7.2 0.16.0 hadoop 2016-07-15 12:56:36 2016-07-15 12:57:25 UNKNOWN 

Failed! 

Failed Jobs: 
JobId Alias Feature Message Outputs 
job_1468556821972_0006 a MAP_ONLY Message: Job failed!  hdfs://localhost:9001/user/hadoop/c.out, 

Input(s): 
Failed to read data from "hdfs://localhost:9001/user/hadoop/myfile.txt" 

Output(s): 
Failed to produce result in "hdfs://localhost:9001/user/hadoop/c.out" 

Counters: 
Total records written : 0 
Total bytes written : 0 
Spillable Memory Manager spill count : 0 
Total bags proactively spilled: 0 
Total records proactively spilled: 0 

Job DAG: 
job_1468556821972_0006 


2016-07-15 12:57:25,842 [main] INFO org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Failed!

請求你的幫助錯誤日誌文件打印！

來源

2016-07-15 Rajeev Sharma

指定要加載到的字段的完整路徑和數據類型。

a = load 'hdfs://localhost:9001/user/hadoop/myfile.txt' AS (line:chararray); 
store a into 'hdfs://localhost:9001/user/hadoop/c.out';

來源

2016-07-15 17:27:03

謝謝。我不得不重新安裝所有的hadoop和豬檔案，當我這樣做......隨着你的評論..他解決了這個問題..謝謝！ –

在Hadoop 2.7.2上的Pig-0.16.0 - 錯誤1002：無法存儲別名

=======================

回答

相關問題