2017-08-01 41 views
0

我在GTX-1080和Ubuntu 16.04LTS上使用基於gpu的服務器。使用正常tensorflow安裝,我得到下面的警告,同時運行應用程序 -構建失敗 - 在gpu服務器中的Tensorflow - 錯誤 - 找不到@ local_config_cuda // crosstool

2017-08-01 14:49:57.232126: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.1 instructions, but these are available on your machine and could speed up CPU computations. 
2017-08-01 14:49:57.232157: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use SSE4.2 instructions, but these are available on your machine and could speed up CPU computations. 
2017-08-01 14:49:57.232162: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX instructions, but these are available on your machine and could speed up CPU computations. 
2017-08-01 14:49:57.232165: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use AVX2 instructions, but these are available on your machine and could speed up CPU computations. 
2017-08-01 14:49:57.232169: W tensorflow/core/platform/cpu_feature_guard.cc:45] The TensorFlow library wasn't compiled to use FMA instructions, but these are available on your machine and could speed up CPU computations. 

當試圖建立最新的代碼版本 - V1.3.0 - rc1-531-gcd4c17e,本地構建失敗,下面給出錯誤信息

[email protected]:~/Workouts/tensorflow$ bazel build --config=opt --config=cuda --cxxopt="-D_GLIBCXX_USE_CXX11_ABI=0" ./tensorflow/tools/pip_package:build_pip_package 
....... 
ERROR: no such package '@local_config_cuda//crosstool': Traceback (most recent call last): 
File "/home/user/Workouts/tensorflow/third_party/gpus/cuda_configure.bzl", line 1039 
    _create_local_cuda_repository(repository_ctx) 
File "/home/user/Workouts/tensorflow/third_party/gpus/cuda_configure.bzl", line 976, in _create_local_cuda_repository 
    _host_compiler_includes(repository_ctx, cc) 
File "/home/user/Workouts/tensorflow/third_party/gpus/cuda_configure.bzl", line 145, in _host_compiler_includes 
    get_cxx_inc_directories(repository_ctx, cc) 
File "/home/user/Workouts/tensorflow/third_party/gpus/cuda_configure.bzl", line 120, in get_cxx_inc_directories 
    set(includes_cpp) 
depsets cannot contain mutable items 
INFO: Elapsed time: 5.488s 
FAILED: Build did NOT complete successfully (3 packages loaded) 

下面給出的是對平臺和配置的一些額外的細節

[email protected]:~/Workouts/tensorflow$ python --version 
Python 2.7.12 

[email protected]:~/Workouts/tensorflow$ gcc --version 
gcc (Ubuntu 5.4.0-6ubuntu1~16.04.4) 5.4.0 20160609 
Copyright (C) 2015 Free Software Foundation, Inc. 
This is free software; see the source for copying conditions. There is NO 
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. 

[email protected]:~/Workouts/tensorflow$ bazel version 
Build label: 0.5.3 
Build target: bazel-out/local-fastbuild/bin/src/main/java/com/google/devtools/build/lib/bazel/BazelServer_deploy.jar 
Build time: Fri Jul 28 08:34:59 2017 (15) 
Build timestamp: 15
Build timestamp as int: 15

配置準備以下

[email protected]:~/Workouts/tensorflow$ ./configure 
WARNING: Running Bazel server needs to be killed, because the startup options are different. 
Please specify the location of python. [Default is /usr/bin/python]: 
Found possible Python library paths: 
/usr/local/lib/python2.7/dist-packages 
/usr/lib/python2.7/dist-packages 
Please input the desired Python library path to use. Default is /usr/local/lib/python2.7/dist-packages 
Do you wish to build TensorFlow with jemalloc as malloc support? [Y/n]: 
jemalloc as malloc support will be enabled for TensorFlow. 

Do you wish to build TensorFlow with Google Cloud Platform support? [y/N]: 
No Google Cloud Platform support will be enabled for TensorFlow. 

Do you wish to build TensorFlow with Hadoop File System support? [y/N]: 
No Hadoop File System support will be enabled for TensorFlow. 

Do you wish to build TensorFlow with XLA JIT support? [y/N]: 
No XLA JIT support will be enabled for TensorFlow. 

Do you wish to build TensorFlow with VERBS support? [y/N]: 
No VERBS support will be enabled for TensorFlow. 

Do you wish to build TensorFlow with OpenCL support? [y/N]: 
No OpenCL support will be enabled for TensorFlow. 

Do you wish to build TensorFlow with CUDA support? [y/N]: Y 
CUDA support will be enabled for TensorFlow. 

Please specify the CUDA SDK version you want to use, e.g. 7.0. [Leave empty to default to CUDA 8.0]: 
Please specify the location where CUDA 8.0 toolkit is installed. Refer to README.md for more details. [Default is /usr/local/cuda]: 
"Please specify the cuDNN version you want to use. [Leave empty to default to cuDNN 6.0]: 
Please specify the location where cuDNN 6 library is installed. Refer to README.md for more details. [Default is /usr/local/cuda]: 
Please specify a list of comma-separated Cuda compute capabilities you want to build with. 
You can find the compute capability of your device at:  https://developer.nvidia.com/cuda-gpus. 
Please note that each additional compute capability significantly increases your build time and binary size. [Default is: 6.1,6.1,6.1,6.1]6.1 
Do you want to use clang as CUDA compiler? [y/N]: 
nvcc will be used as CUDA compiler. 

Please specify which gcc should be used by nvcc as the host compiler. [Default is /usr/bin/gcc]: 
Do you wish to build TensorFlow with MPI support? [y/N]: 
No MPI support will be enabled for TensorFlow. 

Please specify optimization flags to use during compilation when bazel option "--config=opt" is specified [Default is -march=native]: 
Add "--config=mkl" to your bazel command to build with MKL support. 
Please note that MKL on MacOS or windows is still not supported. 
If you would like to use a local MKL instead of downloading, please set the environment variable "TF_MKL_ROOT" every time before build. 
Configuration finished 

一些額外的細節給出 -

[email protected]:~/Workouts/tensorflow$ echo $CUDA_HOME 
/usr/local/cuda-8.0 
[email protected]:~/Workouts/tensorflow$ echo $LD_LIBRARY_PATH 
/usr/local/cuda-8.0/lib64 

我缺少的東西?

回答

0

嘗試添加這對您的構建命令

[email protected]_config_cuda//crosstool:toolchain 

這裏是我們所有的TF + TF發球Dockerfiles的既包括CPU,GPU,AVX等

https://github.com/fluxcapacitor/pipeline/tree/master/package.ml/tensorflow/16d39e9-d690fdd

命名約定爲tensorflow- [tf_git_hash] - [tf_serving_git_hash]。這些Dockerfiles與前幾天一樣是最新的。

另一個很好的資源是TensorFlow詹金斯/ CI頁: http://ci.tensorflow.org/

其建立是大量參數,所以他們有點棘手,以適應自己的環境,但肯定有助於我們獲得所提到的Dockerfile的以上。

相關問題