You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I want to thank you for making this significant contribution to the field. The project was extremely well done, and I have learned a lot about how to structure a large python project from reading your code.
I have attempted to train a model using the 'all_quality' config in experiments.json using a subset of the datasets (musan, gtzan, muspeak). The training was aborted after 28/50 epochs due to a tf/keras bug, and I was saving the model after each epoch. The last model saved had these results:
I thought I might be able to use that model to make a prediction on a 30 minute wav file (mostly music with 4 segments of speech of about 3 minutes each). The output of predict.py labeled the entire 30 minutes as speech, so I think I'm doing something wrong. I wasn't sure what to put for the --mean_path and --std_path. I just see mean.npy and var.npy files in the filelists_* directories of the datasets. Are these supposed to be combined in some way and the result passed to predict.py?
I would be grateful for any advice you can give about training and running predictions.
Merci
The text was updated successfully, but these errors were encountered:
jmm5491
changed the title
Trainig a model and generating a prediction
Training a model and generating a prediction
Feb 7, 2021
I want to thank you for making this significant contribution to the field. The project was extremely well done, and I have learned a lot about how to structure a large python project from reading your code.
I have attempted to train a model using the 'all_quality' config in experiments.json using a subset of the datasets (musan, gtzan, muspeak). The training was aborted after 28/50 epochs due to a tf/keras bug, and I was saving the model after each epoch. The last model saved had these results:
1214/1214 [==============================] - 557s 457ms/step - loss: 0.1321 - binary_accuracy: 0.9514 - categorical_accuracy: 0.8334 - val_loss: 0.1099 - val_binary_accuracy: 0.9621 - val_categorical_accuracy: 0.9421
I thought I might be able to use that model to make a prediction on a 30 minute wav file (mostly music with 4 segments of speech of about 3 minutes each). The output of predict.py labeled the entire 30 minutes as speech, so I think I'm doing something wrong. I wasn't sure what to put for the --mean_path and --std_path. I just see mean.npy and var.npy files in the filelists_* directories of the datasets. Are these supposed to be combined in some way and the result passed to predict.py?
I would be grateful for any advice you can give about training and running predictions.
Merci
The text was updated successfully, but these errors were encountered: