Is the model in code matches what is described in the paper? #3

YuanTingHsieh · 2018-06-22T01:20:11Z

In the first attention block,
You get hidden features using a filter AttnW to conv. with state outputs,
I believe that's the formula W h_k,
BUT you also have state outputs pass through a linear layer and get y
Then you add y and hidden features then pass through an tanh then multiply by a matrix

In the paper, I only see tanh(W h_k)

Also in your code there is AttnV,
where I can't find corresponding description in the paper.

The paper only has gate V and gate W

Could you kindly explain this?
I am really confusing.
Thank you!

nwy2010 · 2019-08-31T08:15:55Z

Yes, I have the same doubts

YuanTingHsieh changed the title ~~Is the model in code match what describes in the paper?~~ Is the model in code matches what is described in the paper? Jun 22, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the model in code matches what is described in the paper? #3

Is the model in code matches what is described in the paper? #3

YuanTingHsieh commented Jun 22, 2018

nwy2010 commented Aug 31, 2019

Is the model in code matches what is described in the paper? #3

Is the model in code matches what is described in the paper? #3

Comments

YuanTingHsieh commented Jun 22, 2018

nwy2010 commented Aug 31, 2019