TensorFlow没有使用GPU

用户名

我使用AMI坐在一台AWS Deep Learning机器上。现在我正在尝试从TensorFlow运行简单的入门示例

# Creates a graph.
a = tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape=[2, 3], name='a')
b = tf.constant([1.0, 2.0, 3.0, 4.0, 5.0, 6.0], shape=[3, 2], name='b')
c = tf.matmul(a, b)
# Creates a session with log_device_placement set to True.
sess = tf.Session(config=tf.ConfigProto(log_device_placement=True))
# Runs the op.
print(sess.run(c))

但是看来我的机器没有使用我的GPU。

MatMul_2:(MatMul):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830238:我tensorflow / core / common_runtime / simple_placer.cc:847] MatMul_2:(MatMul )/ job:本地主机/副本:0 /任务:0 / cpu:0 MatMul_1:(MatMul):/ job:本地主机/副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830259:我tensorflow / core / common_runtime / simple_placer.cc:847] MatMul_1:(MatMul)/ job:localhost / replica:0 / task:0 / cpu:0 MatMul:(MatMul):/ job:localhost / replica:0 / task :0 / cpu:0 2017-07-09 00:51:03.830271:我tensorflow / core / common_runtime / simple_placer.cc:847] MatMul:(MatMul)/ job:localhost /副本:0 / task:0 / cpu: 0 b_2:(Const):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830283:我tensorflow / core / common_runtime / simple_placer.cc:847] b_2:( const)/ job:localhost / replica:0 / task:0 / cpu:0 a_2:(Const):/ job:localhost / replica:0 / task:0 / cpu:0 2017-07-09 00:51:03.830312 :我tensorflow / core / common_runtime / simple_placer.cc:847] a_2:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 b_1:(Const):/ job:localhost / replica:0 / task :0 / cpu:0 2017-07-09 00:51:03.830324:我tensorflow / core / common_runtime / simple_placer.cc:847] b_1:(const)/ job:localhost / replica:0 / task:0 / cpu: 0 a_1:(const):/ job:localhost /副本:0 / task:0 / cpu:0 2017-07-09 00:51:03.830337:我tensorflow / core / common_runtime / simple_placer.cc:847] a_1:( const)/ job:localhost / replica:0 / task:0 / cpu:0 b:(Const):/ job:localhost / replica:0 / task:0 / cpu:0 2017-07-09 00:51:03.830348 :我tensorflow / core / common_runtime / simple_placer.cc:847] b:(const)/ job:localhost / replica:0 / task:0 / cpu:0 a:(Const):/ job:localhost / replica:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830358:我tensorflow / core / common_runtime / simple_placer.cc:847] a:(const)/ job:localhost / replica:0 / task:0 / cpu 0:00(常量)/ job:本地主机/副本:0 /任务:0 / cpu:0 b_1:(常量):/ job:本地主机/副本:0 /任务:0 / cpu:0 2017-07-09 00:51: 03.830324:我tensorflow / core / common_runtime / simple_placer.cc:847] b_1:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 a_1:(Const):/ job:localhost / replica:0 / task:0 / cpu:0 2017-07-09 00:51:03.830337:I tensorflow / core / common_runtime / simple_placer.cc:847] a_1:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 b:(Const):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830348:我tensorflow / core / common_runtime / simple_placer.cc:847] b :(常量)/ job:本地主机/副本:0 /任务:0 / cpu:0 a:(常量):/ job:本地主机/副本:0 /任务:0 / cpu:0 2017-07-09 00:51 :03.830358:我tensorflow / core / common_runtime / simple_placer.cc:847] a:(const)/ job:localhost / replica:0 / task:0 / cpu:0(常量)/ job:本地主机/副本:0 /任务:0 / cpu:0 b_1:(常量):/ job:本地主机/副本:0 /任务:0 / cpu:0 2017-07-09 00:51: 03.830324:我tensorflow / core / common_runtime / simple_placer.cc:847] b_1:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 a_1:(Const):/ job:localhost / replica:0 / task:0 / cpu:0 2017-07-09 00:51:03.830337:I tensorflow / core / common_runtime / simple_placer.cc:847] a_1:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 b:(Const):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830348:我tensorflow / core / common_runtime / simple_placer.cc:847] b :(常量)/ job:本地主机/副本:0 /任务:0 / cpu:0 a:(常量):/ job:本地主机/副本:0 /任务:0 / cpu:0 2017-07-09 00:51 :03.830358:我tensorflow / core / common_runtime / simple_placer.cc:847] a:(const)/ job:localhost / replica:0 / task:0 / cpu:00 / task:0 / cpu:0 2017-07-09 00:51:03.830324:I tensorflow / core / common_runtime / simple_placer.cc:847] b_1:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 a_1:(const):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830337:我tensorflow / core / common_runtime / simple_placer.cc:847] a_1:(常量)/ job:本地主机/副本:0 /任务:0 / cpu:0 b:(常量):/ job:本地主机/副本:0 / task:0 / cpu:0 2017-07-09 00: 51:03.830348:我tensorflow / core / common_runtime / simple_placer.cc:847] b:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 a:(Const):/ job:localhost / replica :0 /任务:0 / CPU:0 2017-07-09 00:51:03.830358:我tensorflow / core / common_runtime / simple_placer.cc:847] a:(const)/ job:localhost / replica:0 / task: 0 / CPU:00 / task:0 / cpu:0 2017-07-09 00:51:03.830324:I tensorflow / core / common_runtime / simple_placer.cc:847] b_1:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 a_1:(const):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830337:我tensorflow / core / common_runtime / simple_placer.cc:847] a_1:(常量)/ job:本地主机/副本:0 /任务:0 / cpu:0 b:(常量):/ job:本地主机/副本:0 / task:0 / cpu:0 2017-07-09 00: 51:03.830348:我tensorflow / core / common_runtime / simple_placer.cc:847] b:(Const)/ job:localhost / replica:0 / task:0 / cpu:0 a:(Const):/ job:localhost / replica :0 /任务:0 / CPU:0 2017-07-09 00:51:03.830358:我tensorflow / core / common_runtime / simple_placer.cc:847] a:(const)/ job:localhost / replica:0 / task: 0 / CPU:0本地主机/副本:0 /任务:0 / CPU:0 2017-07-09 00:51:03.830337:我tensorflow / core / common_runtime / simple_placer.cc:847] a_1:(Const)/ job:localhost /副本:0 / task:0 / cpu:0 b:(const):/ job:localhost /副本:0 / task:0 / cpu:0 2017-07-09 00:51:03.830348:我tensorflow / core / common_runtime / simple_placer。 cc:847] b:(const)/ job:localhost / replica:0 / task:0 / cpu:0 a:(Const):/ job:localhost / replica:0 / task:0 / cpu:0 2017-07 -09 00:51:03.830358:我tensorflow / core / common_runtime / simple_placer.cc:847] a:(Const)/ job:localhost / replica:0 / task:0 / cpu:0本地主机/副本:0 /任务:0 / CPU:0 2017-07-09 00:51:03.830337:我tensorflow / core / common_runtime / simple_placer.cc:847] a_1:(Const)/ job:localhost /副本:0 / task:0 / cpu:0 b:(const):/ job:localhost /副本:0 / task:0 / cpu:0 2017-07-09 00:51:03.830348:我tensorflow / core / common_runtime / simple_placer。 cc:847] b:(const)/ job:localhost / replica:0 / task:0 / cpu:0 a:(Const):/ job:localhost / replica:0 / task:0 / cpu:0 2017-07 -09 00:51:03.830358:我tensorflow / core / common_runtime / simple_placer.cc:847] a:(Const)/ job:localhost / replica:0 / task:0 / cpu:00 /任务:0 / cpu:0a:(const):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830358:我tensorflow / core / common_runtime / simple_placer .cc:847] a:(const)/ job:localhost / replica:0 / task:0 / cpu:00 /任务:0 / cpu:0a:(const):/ job:localhost /副本:0 /任务:0 / cpu:0 2017-07-09 00:51:03.830358:我tensorflow / core / common_runtime / simple_placer .cc:847] a:(const)/ job:localhost / replica:0 / task:0 / cpu:0

如果尝试使用手动指定GPU,tf.device('/gpu:0'):则会出现以下错误:

InvalidArgumentError:无法为操作'MatMul_3'分配设备:该操作已显式分配给/ device:GPU:0,但可用设备为[/ job:localhost /副本:0 / task:0 / cpu:0]。确保设备规格引用的是有效设备。[[节点:MatMul_3 = MatMul [T = DT_FLOAT,transpose_a = false,transpose_b = false,_device =“ / device:GPU:0”](a_3,b_3)]]

我对AMI所做的唯一更改是将TensorFlow更新为最新版本

这是我跑步观看nvidia-smi时看到的内容

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 367.57                 Driver Version: 367.57                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  Tesla K80           On   | 0000:00:1E.0     Off |                    0 |
| N/A   44C    P8    27W / 149W |      0MiB / 11439MiB |      0%      Default |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID  Type  Process name                               Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+
狮子座

1.检查您的实例,是否选择GPU?
使用“观看nvidia-smi”查看GPU信息。

2.检查您的AMI和Tensorflow版本,可能它不支持GPU或配置错误。

我使用以下AMI:深度学习AMI Amazon Linux(ami-296e7850)。

本文收集自互联网,转载请注明来源。

如有侵权,请联系 [email protected] 删除。

编辑于
0

我来说两句

0 条评论
登录 后参与评论

相关文章