Trained custom data on mini_librispeech recipe but inference just gives 1 speaker for whole audio file. #33

saumyaborwankar · 2021-11-19T07:11:16Z

SPEAKER aaak 1   11.40    0.10 <NA> <NA> aaak_4 <NA>
SPEAKER aaak 1   14.00    0.10 <NA> <NA> aaak_4 <NA>

This is the hyp_0.3_1.rttm I got after scoring. For the entire aaak.wav file only aaak_4 speaker is detected.

"main/DER": 0.4484034770634306,
"validation/main/DER": 0.5290581162324649,

This is the DER after 200 epochs. Can someone help me understand why the inference is detecting just one speaker.

aaaa wav_8/aaaa.wav
aaab wav_8/aaab.wav

This is wav.scp (first 2 lines)

aaab-000521-000625 Khanna
aaab-000829-000923 Khanna

This is the utt2spk file

aaab-000521-000625 aaab 5.21 6.25
aaab-000829-000923 aaab 8.29 9.23

This is the segments file

The text was updated successfully, but these errors were encountered:

kli017 · 2021-12-14T01:38:12Z

Hello, I met the same problem while training on mini_librispeech recipe. I made a 2 speaker no overlap dataset and with the epoch increase the model just detect 1 speaker. Do you find the reason?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trained custom data on mini_librispeech recipe but inference just gives 1 speaker for whole audio file. #33

Trained custom data on mini_librispeech recipe but inference just gives 1 speaker for whole audio file. #33

saumyaborwankar commented Nov 19, 2021 •

edited

Loading

kli017 commented Dec 14, 2021

Trained custom data on mini_librispeech recipe but inference just gives 1 speaker for whole audio file. #33

Trained custom data on mini_librispeech recipe but inference just gives 1 speaker for whole audio file. #33

Comments

saumyaborwankar commented Nov 19, 2021 • edited Loading

kli017 commented Dec 14, 2021

saumyaborwankar commented Nov 19, 2021 •

edited

Loading