我正在使用EMR通過紗線運行火花作業。作業失敗,但我無法找到EMR在哪裏登錄異常,我可以在主節點屏幕輸出上看到如下回溯。應該有一個更詳細的日誌文件,顯示導致異常的原因,但我找不到它在哪裏。我查看了不顯示任何錯誤的hdfs:// var/log/spark/app/application_xxx。這裏是我遞交申請:AWS-EMR:日誌文件在哪裏?
spark-submit --deploy-mode cluster --master yarn --num-executors 1 --executor-cores 2 --executor-memory 5g word2vec_app.py hdfs:///test/r8_no_sto.txt
這裏是主節點上的屏幕輸出:
Exception in thread "main" org.apache.spark.SparkException: Application application_1488419676573_0005 finished with failed status
at org.apache.spark.deploy.yarn.Client.run(Client.scala:1167)
at org.apache.spark.deploy.yarn.Client$.main(Client.scala:1213)
at org.apache.spark.deploy.yarn.Client.main(Client.scala)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at org.apache.spark.deploy.SparkSubmit$.org$apache$spark$deploy$SparkSubmit$$runMain(SparkSubmit.scala:738)
at org.apache.spark.deploy.SparkSubmit$.doRunMain$1(SparkSubmit.scala:187)
at org.apache.spark.deploy.SparkSubmit$.submit(SparkSubmit.scala:212)
at org.apache.spark.deploy.SparkSubmit$.main(SparkSubmit.scala:126)
at org.apache.spark.deploy.SparkSubmit.main(SparkSubmit.scala)
s3中會有一個文件夾。雖然配置EMR我認爲我們可以選擇設置日誌目錄。默認情況下它存儲在s3中。路徑是s3:// aws-logs- [ACCOUNT_NUMBER] - [AVAILABILITY_ZONE]/elasticmapreduce/ –