numpy - Tensorflow infinity mask breaks gradient -

March 15, 2010

i'm trying softmax on selected indices, using infinity mask silent out unwanted ones. however, gradient of unwanted entires become nan opposed 0.

the reason didn't use boolean mask mask indices different in batch, can't end nice matrix form. if there's workaround here i'll more happy adopt.

the code tested infinity mask

import numpy np import tensorflow tf  = tf.placeholder(tf.float32, [5]) inf_mask = tf.placeholder(tf.float32, [5])  b = tf.multiply(a, inf_mask) sf = tf.nn.softmax(b)  loss = (sf[2] - 0) grad = tf.gradients(loss, a)  sess = tf.session()  a_np = np.ones([5]) np_mask = np.ones([5]) * 4 np_mask[1] = -np.inf  print sess.run([sf, grad], feed_dict={     a: a_np,     inf_mask: np_mask })  sess.close()

the output

[array([ 0.25,  0.  ,  0.25,  0.25,  0.25], dtype=float32), [array([-0.25,   nan,  0.75, -0.25, -0.25], dtype=float32)]]

the mask working gradient has nan, should have been 0 think.

Search This Blog

RT

numpy - Tensorflow infinity mask breaks gradient -

Comments

Post a Comment

Popular posts from this blog

python - Selenium remoteWebDriver (& SauceLabs) Firefox moseMoveTo action exception -

html - How to custom Bootstrap grid height? -

Ansible warning on jinja2 braces on when -