2015-07-01 49 views
1

我試圖在使用ec2實例的Amazon EMR上運行mrjob。它一直在工作,直到我意識到我正在使用python包(機械化,BeautifulSoup,boto)。所以,我加入到我的mrjob.conf文件,但現在我不斷收到此錯誤:Amazon EMR + mrjob:引導程序錯誤,「引導操作1返回了非零返回碼」

No handlers could be found for logger "mrjob.emr" 
Traceback (most recent call last): 
    File "run_job.py", line 102, in <module> 
    run() 
    File "run_job.py", line 76, in run 
    runner.run() 
    File "C:\Program Files (x86)\Python278\lib\site-packages\mrjob\runner.py", line 464, in run 
    self._run() 
    File "C:\Program Files (x86)\Python278\lib\site-packages\mrjob\emr.py", line 821, in _run 
    self._wait_for_job_to_complete() 
    File "C:\Program Files (x86)\Python278\lib\site-packages\mrjob\emr.py", line 1689, in _wait_for_jo 
b_to_complete 
    raise Exception(msg) 
Exception: Job on job flow j-CZDG75Z1X58 failed with status FAILED: On the master instance (i-0ef8bb 
cc), bootstrap action 1 returned a non-zero return code 

我在我的智慧結束了,我一直在努力,現在解決這個相當長的一段時間,我似乎無法得到它。這裏是conf文件:

runners: 
    emr: 
    bootstrap_mrjob: True 
    ami_version: 2.4.11 

    bootstrap: 
     - sudo apt-get install -y python-pip 
     - sudo pip install mechanize 
     - sudo pip install bs4 
     - sudo pip install mrjob 
     - sudo pip install boto 

    aws_access_key_id: xxxx 
    aws_region: xxxx 
    aws_secret_access_key: xxxx 
    num_ec2_core_instances: 1 
    ec2_core_instance_type: m1.small 
    iam_instance_profile: xxxx 

回答

0

終於想通了!

在mrjob.conf

,我改變了自舉:和所有的元素來

bootstrap_cmds: 
    - sudo easy_install pip 
    - sudo easy_install mechanize 
    - sudo easy_install boto 
    - sudo easy_install BeautifulSoup4 

原來mjrob幾乎總是會自動安裝自己的軟件包到EC2實例。所以我根本不需要在bootstrap中有。

現在它像一個魅力運行! :)

相關問題