I think the loss in the code id wrong, how can you explain about it? #12

moving-on · 2018-12-27T05:11:20Z

self.cross_entropy_loss = tf.nn.softmax_cross_entropy_with_logits(logits=self.logprobs[:, -1, :], labels=self.states)

Why you use softmax_cross_entropy_with_logits here， the first state is "[10.0, 128.0, 1.0, 1.0]*args.max_layers"，so does the labels. The final output of RNN contributes to the action, why you use softmax on the action?

moving-on · 2018-12-27T08:54:43Z

for example:
state=[10.0, 128.0, 1.0, 1.0,10.0, 128.0, 1.0, 1.0], the final output with softmax is [0.1,0.1,0.1,0.1,0.1,0.1,0.2,0.2], then the loss is:
-(10log0.1+128log0.1+1log0.1+1log0.1+10log0.1+128log0.1+1log0.2+1log0.2)
what does this mean?

gcooq · 2019-12-26T02:22:47Z

me too, I think each layer should use a softmax function, not the whole output with a single softmax function.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

I think the loss in the code id wrong, how can you explain about it? #12

I think the loss in the code id wrong, how can you explain about it? #12

moving-on commented Dec 27, 2018

moving-on commented Dec 27, 2018

gcooq commented Dec 26, 2019

I think the loss in the code id wrong, how can you explain about it? #12

I think the loss in the code id wrong, how can you explain about it? #12

Comments

moving-on commented Dec 27, 2018

moving-on commented Dec 27, 2018

gcooq commented Dec 26, 2019