2012-12-11 45 views
1

有誰知道在EMR上運行的MapR使用Amazon的S3Distcp工具是否存在問題?我試圖使用它,但不斷收到在/ mnt/VAR以下異常/日誌/的Hadoop /步驟:將文件從S3複製到Amazon EMR上的maprfs

Exception in thread "main" java.lang.RuntimeException: Unable to delete directory hdfs:/tmp/e9333a37-f400-4982-9687-326e33d9b37d/files 
at com.amazon.external.elasticmapreduce.s3distcp.S3DistCp.deleteRecursive(S3DistCp.java:606) 
at com.amazon.external.elasticmapreduce.s3distcp.S3DistCp.run(S3DistCp.java:464) 
at com.amazon.external.elasticmapreduce.s3distcp.S3DistCp.run(S3DistCp.java:216) 
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) 
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:79) 
at com.amazon.external.elasticmapreduce.s3distcp.Main.main(Main.java:12) 
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) 
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) 
at java.lang.reflect.Method.invoke(Method.java:597) 
at org.apache.hadoop.util.RunJar.main(RunJar.java:186) 
Caused by: java.io.IOException: Incomplete HDFS URI, no host: hdfs:/tmp/e9333a37-f400-4982-9687-326e33d9b37d/files 
at org.apache.hadoop.hdfs.DistributedFileSystem.initialize(DistributedFileSystem.java:85) 
at org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:1416) 
at org.apache.hadoop.fs.FileSystem.access$100(FileSystem.java:69) 
at org.apache.hadoop.fs.FileSystem$Cache.getInternal(FileSystem.java:1450) 
at org.apache.hadoop.fs.FileSystem$Cache.get(FileSystem.java:1432) 
at org.apache.hadoop.fs.FileSystem.get(FileSystem.java:232) 
at com.amazon.external.elasticmapreduce.s3distcp.S3DistCp.deleteRecursive(S3DistCp.java:603) 

我使用提交作業步驟的命令行是:

elastic-mapreduce --jobflow $JOB_ID --jar s3://us-east-1.elasticmapreduce/libs/s3distcp/1.latest/s3distcp.jar \ 
--args '--src,s3n://PVData/raw, \ 
--dest,/PVData/raw' 

對於--dest參數,我嘗試過maprfs:///VDData/ raw和hdfs:///PVData/raw,它們也不起作用。

回答

2

我在MapR論壇(http://bit.ly/S7gzcv)上得到了這個問題的答案。問題是我需要使用--tmpDir參數指定臨時目錄爲maprfs:/// tmp s3distcp