Using stored checkpoint in a java program #141

already-taken-m17 · 2016-09-08T15:10:02Z

Can the stored checkpoints be used by a java program and then sampling can be done in java itself ? If yes, can someone please share some ideas about this ?

jcjohnson · 2016-09-08T15:30:51Z

That would be nontrivial.

Checkpoints are stored using Torch serialization so you'd either have to implement a decoder in Java or write checkpoints from Lua in some other format.

You'd also need to implement the forward pass in Java for whatever Torch modules your network contains, and these would need to be binary compatible with the Torch implementations.

Can I ask what use case you have in mind? There may be an easier way.

already-taken-m17 · 2016-09-08T17:14:05Z

yeah, sure.
I've trained the model on a huge dataset, but now sampling and some more operations on samples are slower than my requirements.
I have a feeling that after writing sampling code (forward pass) in java, the results may be faster.
Do you think it is possible i.e. java code for sampling will be faster ?
kindly suggest alternatives too if you have some in mind.
Thanks!

jcjohnson · 2016-09-08T18:05:06Z

If you are mainly concerned about performance then rewriting in Java does not seem like a good solution; it would be much simpler to optimize the existing Lua sampling. There is certainly low-hanging fruit for optimization, such as this pull request: #138

Sampling speed is fundamentally limited by the model itself; generating each character requires some large matrix multiplies. You should make sure that your BLAS implementation is properly set up. As a last resort you can also try training smaller models; depending on the dataset and training you may find that smaller models perform just as well as larger models.

already-taken-m17 · 2016-09-08T20:16:44Z

Yeah, I'll try to train some smaller models too and will see if the performance is same with a boost in speed.
Also, will try to look into dl4j implementations of lstm and compare the results and speed.

dgcrouse closed this as completed Apr 27, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using stored checkpoint in a java program #141

Using stored checkpoint in a java program #141

already-taken-m17 commented Sep 8, 2016

jcjohnson commented Sep 8, 2016

already-taken-m17 commented Sep 8, 2016

jcjohnson commented Sep 8, 2016 •

edited

Loading

already-taken-m17 commented Sep 8, 2016

Using stored checkpoint in a java program #141

Using stored checkpoint in a java program #141

Comments

already-taken-m17 commented Sep 8, 2016

jcjohnson commented Sep 8, 2016

already-taken-m17 commented Sep 8, 2016

jcjohnson commented Sep 8, 2016 • edited Loading

already-taken-m17 commented Sep 8, 2016

jcjohnson commented Sep 8, 2016 •

edited

Loading