Skip to content
This repository has been archived by the owner on Mar 8, 2023. It is now read-only.

Other italian models for transfer learning #116

Open
DanBmh opened this issue Dec 28, 2020 · 4 comments
Open

Other italian models for transfer learning #116

DanBmh opened this issue Dec 28, 2020 · 4 comments
Labels
enhancement New feature or request

Comments

@DanBmh
Copy link

DanBmh commented Dec 28, 2020

Hi,

I just wanted to notify you that I did train another Italian model some time ago, which you can find here: https://gitlab.com/Jaco-Assistant/deepspeech-polyglot

I did use about the same amount of training data like you, but got much better accuracy results (0.248 vs 0.399 WER on CV-test), so you might be able to find some ideas for hyperparameter optimization there.

PS: It would be great if you can share a link to your model in the pretrained models collection thread: https://discourse.mozilla.org/t/links-to-pretrained-models/62688

@Mte90
Copy link
Member

Mte90 commented Dec 28, 2020

Hi thanks for the sharing!
We will look in to that @nefastosaturo soon.

I will write in that thread in the meantime.

@nefastosaturo
Copy link
Collaborator

Hello @DanBmh thank you for sharing! I found your work some time ago and is a very nice one! I also see that you want to collaborate with RHASSPY, that will be awesome!

About deepspeech, yes the hyperparameters optimization will be one of the next thing to do. I noticed that you used data augmentation and we will for sure follow that strategy

@DanBmh
Copy link
Author

DanBmh commented Apr 6, 2021

Hi, this time I would like to notify you, that my new project Scribosermo is now ready to use.
This might be quite interesting for you, because it works well with a small amount of data, I did need ~280h of Spanish to reach competitive results.

You can find a somewhat longer description in my post here: https://discourse.mozilla.org/t/links-to-pretrained-models/62688/26
But I didn't train an Italian model, so this would be up to you:)

@DanBmh
Copy link
Author

DanBmh commented Jul 5, 2021

Scribosermo now has an Italian model as well, reaching a WER of 11.5% with ~360h traindata.
Thanks for your Mitads project by the way, it was used to create the language model:)

@Mte90 Mte90 changed the title Another italian model Other italian model for transfer learning Oct 5, 2021
@Mte90 Mte90 changed the title Other italian model for transfer learning Other italian models for transfer learning Oct 5, 2021
@Mte90 Mte90 added the enhancement New feature or request label Oct 5, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants