Hi
when using tf.nn.softmax in my agent and np.random.multinomial on the result, once in maybe a 10000 episodes I get this error -
in mtrand.RandomState.multinomial (numpy/random/mtrand/mtrand.c:37769)
ValueError: sum(pvals[:-1]) > 1.0
I guess its because the result of tf.nn.softmax in that case doesn't sum exactly to 1 for some reason.
Is there any way to fix this issue?
would a try catch be the best option? or should I normalize again just in case (or after checking if it sums to 1)
thanks!