2016-08-19 169 views
0

我想build TensorFlow from source與GPU支持運行它。要安裝工具包,我使用運行文件來安裝我使用其他驅動程序工具的驅動程序,因爲我沒有讓Ubuntu啓動到文本模式,如CUDA documentationstop lightdmstart lightdm也不工作,它給我(也與sudo):如何安裝CUDA驅動程序TensorFlow

Name com.ubuntu.Upstart does not exist 

到目前爲止,我可以從TensorFlow存儲庫構建發行版。然而,當我試圖運行的例子如如何對

bazel-bin/tensorflow/cc/tutorials_example_trainer --use_gpu 

指定的GPU顯然無法找到:

[email protected]:~/Documents/repos/tensoflow_fork$ bazel-bin/tensorflow/cc/tutorials_example_trainer --use_gpu 
I tensorflow/stream_executor/dso_loader.cc:108] successfully opened CUDA library libcublas.so locally 
I tensorflow/stream_executor/dso_loader.cc:108] successfully opened CUDA library libcudnn.so locally 
I tensorflow/stream_executor/dso_loader.cc:108] successfully opened CUDA library libcufft.so locally 
I tensorflow/stream_executor/dso_loader.cc:108] successfully opened CUDA library libcuda.so.1 locally 
I tensorflow/stream_executor/dso_loader.cc:108] successfully opened CUDA library libcurand.so locally 
E tensorflow/stream_executor/cuda/cuda_driver.cc:491] failed call to cuInit: CUDA_ERROR_UNKNOWN 
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:153] retrieving CUDA diagnostic information for host: jonas-Aspire-V5-591G 
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:160] hostname: jonas-Aspire-V5-591G 
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:185] libcuda reported version is: 352.63.0 
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:356] driver version file contents: """NVRM version: NVIDIA UNIX x86_64 Kernel Module 352.63 Sat Nov 7 21:25:42 PST 2015 GCC version: gcc version 
    4.9.2 (Ubuntu 4.9.2-10ubuntu13) """ 
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:189] kernel reported version is: 352.63.0 
I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:293] kernel version seems to match DSO: 352.63.0 
I tensorflow/core/common_runtime/gpu/gpu_init.cc:81] No GPU devices available on machine. 
F tensorflow/cc/tutorials/example_trainer.cc:125] Check failed: ::tensorflow::Status::OK() == (session->Run({{"x", x}}, {"y:0", "y_normalized:0"}, {}, &outputs)) (OK vs. Invalid argument: Cannot assign a device to node 'y': Could not satisfy explicit device specification '/gpu:0' because no devices matching that specification are registered in this process; available devices: /job:localhost/replica:0/task:0/cpu:0 
    [[Node: y = MatMul[T=DT_FLOAT, transpose_a=false, transpose_b=false, _device="/gpu:0"](Const, x)]]) 
Aborted 

我使用一個乾淨的Ubuntu 15.04安裝宏碁筆記本與GTX950M。

誰能告訴我如何正確安裝驅動程序?

回答

1

你可以運行deviceQuery(自帶cuda安裝)?你能看到nvidia出現在lspci/lsmod/nvidia-smi中嗎?

lsmod |grep nvidia 
dmesg | grep -i nvidia 
lspci | grep -i nvidia 
nvidia-smi 

您可以重新加載NVIDIA模塊,並尋找錯誤信息

modprobe -r nvidia 
dmesg | tail 
sudo dmesg | grep NVRM 

相關問題https://github.com/tensorflow/tensorflow/issues/601

+0

謝謝,但我終於得到它通過添加[圖形驅動程序庫]在Ubuntu 16.04上運行(http://www.webupd8.org/2016/06/how-to-install-latest-nvidia-drivers-in.html)並通過其他驅動程序安裝最新的穩定版本... – jns