Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nan loss when training on Windows #17

Open
rose-jinyang opened this issue Oct 22, 2022 · 5 comments
Open

nan loss when training on Windows #17

rose-jinyang opened this issue Oct 22, 2022 · 5 comments
Labels
enhancement New feature or request

Comments

@rose-jinyang
Copy link

Hello
How are you?
Thanks for contributing to this project.
I am going to train a model on Windows.
But I get nan loss values at the beginning.
I checked the same training project on Google Colab (Ubuntu) and it works well.
I think that the there are some issues in the current project for Windows.

@RizwanMunawar
Copy link
Owner

Hi @rose-jinyang! I am fine.
At the start of training, the loss can be zero. There can be a possibility that the current batch loaded for training will provide zero loss value on the window machine. It should be much better if you will compare the results after 5 to 6 epochs at least. Also, I will recommend you use Linux/Google Colab, as it could speed up the training process.

@RizwanMunawar RizwanMunawar added the enhancement New feature or request label Oct 23, 2022
@rose-jinyang
Copy link
Author

Thanks for your quick reply.
I tested on Google Colab but It works well even at the first epoch.
But when training on Windows, the losses are nan even after 5 epochs.

@RizwanMunawar
Copy link
Owner

@rose-jinyang! I will test the results on windows and explore the topic in detail to optimize code as much as possible especially for windows.

@rose-jinyang
Copy link
Author

Thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants
@rose-jinyang @RizwanMunawar and others