LSTM: Move Wx matrix multiplication out of the loop in forward #187

antihutka · 2017-04-28T09:57:00Z

Move one of the addmm calls out of the loop and do it in one call across all timesteps. This should provide a significant speedup when running with small batch_size.
I was able to get 10-20% speedup with batch_size=8 when running on CPU, but I'm unable to test it on GPU at the moment.

dgcrouse · 2017-04-28T15:27:42Z

I can test GPU execution on CUDA this weekend, can someone check OpenCL?

LSTM: Move Wx matrix multiplication out of the loop in forward

1fbdc5b

antihutka mentioned this pull request Jun 2, 2017

Any info for tweaking training settings for those with little background in LSTMs? #196

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LSTM: Move Wx matrix multiplication out of the loop in forward #187

LSTM: Move Wx matrix multiplication out of the loop in forward #187

antihutka commented Apr 28, 2017

dgcrouse commented Apr 28, 2017

LSTM: Move Wx matrix multiplication out of the loop in forward #187

Are you sure you want to change the base?

LSTM: Move Wx matrix multiplication out of the loop in forward #187

Conversation

antihutka commented Apr 28, 2017

dgcrouse commented Apr 28, 2017