我正在使用Google Dataflow Service爲ETL運行一些apache-beam腳本。工作流失敗。原因:(35af2d4d3e5569e4):數據流似乎卡住
最初完成需要4-5分鐘才能完成的作業,但是現在一小時後它們會失敗並出現以下錯誤。
工作流失敗。原因:(35af2d4d3e5569e4):Dataflow似乎卡住了。
看來,這項工作並沒有真正開始。
我正在通過使用Python SDK 2.1.0執行它作爲this問題的答案切換SDK的問題,我嘗試使用python SDK 2.0.0執行它,但沒有運氣。
作業ID:2017-09-28_04_28_31-11363700448712622518
更新:
@BenChambers建議清理日誌後,看來喬布斯並沒有因爲失敗的啓動起始
工人的日誌顯示出以下日誌4次
Running setup.py install for dataflow-worker: finished with status 'done'
Successfully installed dataflow-worker-2.1.0
Executing: /usr/local/bin/pip install /var/opt/google/dataflow/workflow.tar.gz
Processing /var/opt/google/dataflow/workflow.tar.gz
Complete output from command python setup.py egg_info:
Traceback (most recent call last):
File "<string>", line 1, in <module>
IOError: [Errno 2] No such file or directory: '/tmp/pip-YAAeGg-build/setup.py'
----------------------------------------
Command "python setup.py egg_info" failed with error code 1 in /tmp/pip-YAAeGg-build/
/usr/local/bin/pip failed with exit status 1
Dataflow base path override: https://dataflow.googleapis.com/
Failed to report setup error to service: could not lease work item to report failure (no work items returned)
(如在數據流文檔所提到的,一個束宣佈它爲失敗嘗試過4次)
我已經更新檢查日誌後的問題,你可以建議如何繼續下去? – user3816204