I am doing conformer-transducer with multilingual ASR. Why does val loss produce NaN? #11311
Unanswered
SEOLJINYOUNG
asked this question in
Q&A
Replies: 1 comment
-
I'm not sure if this is the issue, but in my case, changing the precision to 32 or bf16 allows the loss curve to converge properly. Also, your learning rate seems a bit high. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello. I am doing multilingual ASR in English and Korean by referring to the tutorial.
Multilingual ASR
In this tutorial, the base model uses the stt_enes_contextnet_large pre-trained model,
In my case I use stt_en_conformer_transducer_small.
My problem is that it seems to be learning, but val loss returns NaN.
In the validation stage, the prediction comes out like this.
[train stage]
[valid stage]
I would be grateful if you could give me some advice regarding this.
This is the overall code I ran.
[code]
The dataset has the following sizes:
I stopped learning in progress.
[train_loss]
[val_loss]
Beta Was this translation helpful? Give feedback.
All reactions