Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
pretrain with KSS data
finetune HalfLife scientist data
- input text
- "신은 우리의 수학 문제에는 관심이 없다. 신은 다만 경험적으로 통합할 뿐이다."
- output
glowtts-mbmelgan.mov
halfLife_mbmelgan.mov
-
glowtts
- trained with kss data 190000 step
- train ipynb file : coqui_train_glowtts.ipynb
- google drive link : https://drive.google.com/drive/folders/1quLOabjkAmmw6mFbcCsMqmGxMC4bbbCW
-
multiband-melgan
- trained with korea concat data (KSS, Zeroth and Pansori-TEDxKR) 150000 step
- train ipynb file : coqui_train_mbmelgan.ipynb
- google drive link : https://drive.google.com/drive/folders/1FOlcOjx47j_ALNw28rZkr62iOWqHY6tE
-
halfLife finetuned glowtts
- trained with kss data 190000 step + halfLife 90000 step
- train ipynb file : halfLife_finetune_glowtts.ipynb
- google drive link : https://drive.google.com/drive/folders/1RubvJSDKZ_hNp3xj8mCocwtWG3KBmT4R?usp=sharing
-
halfLife finetuned multiband-melgan
- trained with korea concat data (KSS, Zeroth and Pansori-TEDxKR) 150000 step + halfLife 20000 step
- train ipynb file : halfLife_finetune_mbmelgan.ipynb
- google drive link : https://drive.google.com/drive/folders/15eAW8jTHSIOAisiPQa03VOMOH-pACguc?usp=sharing
!pip install TTS
!pip install jamo
!pip install torchaudio==0.9.0
!pip install gdown
!conda install -c conda-forge kaggle -y
!pip install librosa
- coqui tts
- TensorFlowTTS
- glow tts
- Multi-band MelGAN
- FastSpeech 2
- speech-japanese-korean-vietnamese
- openslr
- half_life_dataset
- KSS Dataset
- Zeroth Korean
- Pansori-TEDxKR
- Fine-Tuning with a small dataset
- Siri를 아이유 목소리로 바꾸기
- 인공지능 deep voice를 이용한 TTS(음성합성) 구현하기 _ 손석희 앵커
- SCE-TTS: 내 목소리로 TTS 만들기
- huggingface_fastspeech2_kss
- huggingface_TensorFlowTTS
- jamo
- korean.py
- freeconvert