v0.2.0

erogol released this 11 Aug 08:43

· 1390 commits to main since this release

🐸 v0.2.0

🐞Bug Fixes

Fix phoneme pre-compute issue.
Fix multi-speaker setup in Tacotron models.
Fix small issues in the Trainer regarding multi-optimizer training.

💾 Code updates

W&B integration for model logging and experiment tracking, (👑 @AyushExel)
Code uses the Tensorboard by default. For W&B, you need to set log_dashboard option in the config and define project_name and wandb_entity.
Use ffsspec for model saving/loading (👑 @agrinh)
Allow models to define their own symbol list with in-class make_symbols()
Allow choosing after epoch or after step LR scheduler update with scheduler_after_epoch.
Make converting spectrogram from amplitude to DB optional with do_amp_to_db_linear and do_amp_to_db_linear options.

🗒️ Docs updates

Add GlowTTS and VITS docs.

🤖 Model implementations

VITS implementation with pre-trained models (https://arxiv.org/abs/2106.06103)

🚀 Model releases

vocoder_models--ja--kokoro--hifigan_v1 (👑 @kaiidams)

HiFiGAN model trained on Kokoro dataset to complement the existing Japanese model.

Try it out:
```
tts --model_name tts_models/ja/kokoro/tacotron2-DDC --text "こんにちは、今日はいい天気ですか？"
```
tts_models--en--ljspeech--tacotronDDC_ph

TacotronDDC with phonemes trained on LJSpeech. It is to fix the pronunciation errors caused by the raw text
in the released TacotronDDC model.

Try it out:
```
tts --model_name tts_models/en/ljspeech/tacotronDDC_ph --text "hello, how are you today?"
```

tts_models--en--ljspeech--vits

VITS model trained on LJSpeech.

Try it out:

tts --model_name tts_models/en/ljspeech/vits --text "hello, how are you today?"

tts_models--en--vctk--vits

VITS model trained on VCTK with multi-speaker support.

Try it out:
```
tts-server --model_name tts_models/en/vctk/vits     
```
vocoder_models--en--ljspeech--univnet

UnivNet model trained on LJSpeech to complement the TacotronDDC model above.

Try it out:
```
tts --model_name tts_models/en/ljspeech/tacotronDDC_ph --text "hello, how are you today?"
```

Contributors

agrinh, AyushExel, and kaiidams

Assets 7

5 Join discussion