Tensorflow在python for循环中太慢了

Question

Tensorflow在python for循环中太慢了

Cfi*_*Yoi 2 python for-loop machine-learning tensorflow

我想在Tensorflow中创建一个函数,对于给定数据X的每一行,仅对某些采样类应用softmax函数,比如说,K个类中的2个,并返回一个矩阵S,其中S.shape = (N,K)(N :给定数据的行数和K的总类别数).

矩阵S最终将包含零,并且在采样类为每一行定义的索引中包含非零值.

在简单的python中我使用高级索引,但在Tensorflow中我无法弄清楚如何制作它.我最初的问题是,我提出了numpy代码.

所以我试图在Tensorflow中找到一个解决方案,主要思想不是将S用作二维矩阵而是用作一维阵列.代码看起来像这样:

num_samps = 2
S = tf.Variable(tf.zeros(shape=(N*K)))
W = tf.Variable(tf.random_uniform((K,D)))
tfx = tf.placeholder(tf.float32,shape=(None,D))
sampled_ind = tf.random_uniform(dtype=tf.int32, minval=0, maxval=K-1, shape=[num_samps])
ar_to_sof = tf.matmul(tfx,tf.gather(W,sampled_ind),transpose_b=True)
updates = tf.reshape(tf.nn.softmax(ar_to_sof),shape=(num_samps,))
init = tf.initialize_all_variables()
sess = tf.Session()
sess.run(init)
for line in range(N):
    inds_new = sampled_ind + line*K
    sess.run(tf.scatter_update(S,inds_new,updates), feed_dict={tfx: X[line:line+1]})

S = tf.reshape(S,shape=(N,K))

Run Code Online (Sandbox Code Playgroud)

这是有效的,结果是预期的.但它运行得非常慢.为什么会这样？我怎么能更快地完成这项工作？

Answer 1

syg*_*ygi 7

在张量流编程时,学习定义操作和执行它们之间的区别至关重要.tf.当你在python中运行时,大多数函数都会以计算图形的形式添加操作.

例如,当你这样做时:

tf.scatter_update(S,inds_new,updates)

Run Code Online (Sandbox Code Playgroud)

以及:

inds_new = sampled_ind + line*K

Run Code Online (Sandbox Code Playgroud)

多次,您的计算图形增长超出了必要的范围,填补了所有内存并大大减慢了速度.

你应该做的是在循环之前定义一次计算:

init = tf.initialize_all_variables()
inds_new = sampled_ind + line*K
update_op = tf.scatter_update(S, inds_new, updates)
sess = tf.Session()
sess.run(init)
for line in range(N):
    sess.run(update_op, feed_dict={tfx: X[line:line+1]})

Run Code Online (Sandbox Code Playgroud)

这样,你的计算图只包含一个副本inds_new和update_op.请注意,执行时update_op,inds_new也将隐式执行,因为它是计算图中的父级.

您还应该知道,update_op每次运行时可能会有不同的结果,并且很好并且预期.

顺便说一下,调试此类问题的一种好方法是使用张量板可视化计算图.在代码中添加:

summary_writer = tf.train.SummaryWriter('some_logdir', sess.graph_def)

Run Code Online (Sandbox Code Playgroud)

然后在控制台中运行:

tensorboard --logdir=some_logdir

Run Code Online (Sandbox Code Playgroud)

在服务的html页面上会有一张计算图的图片,你可以在那里检查你的张量.

归档时间：	9 年，1 月前
查看次数：	1976 次
最近记录：	8 年，1 月前