Skip to content

v0.2.0

Compare
Choose a tag to compare
@erogol erogol released this 11 Aug 08:43
· 1390 commits to main since this release

🐸 v0.2.0

🐞Bug Fixes

  • Fix phoneme pre-compute issue.
  • Fix multi-speaker setup in Tacotron models.
  • Fix small issues in the Trainer regarding multi-optimizer training.

💾 Code updates

  • W&B integration for model logging and experiment tracking, (👑 @AyushExel)
    Code uses the Tensorboard by default. For W&B, you need to set log_dashboard option in the config and define project_name and wandb_entity.
  • Use ffsspec for model saving/loading (👑 @agrinh)
  • Allow models to define their own symbol list with in-class make_symbols()
  • Allow choosing after epoch or after step LR scheduler update with scheduler_after_epoch.
  • Make converting spectrogram from amplitude to DB optional with do_amp_to_db_linear and do_amp_to_db_linear options.

🗒️ Docs updates

  • Add GlowTTS and VITS docs.

🤖 Model implementations

🚀 Model releases

  • vocoder_models--ja--kokoro--hifigan_v1 (👑 @kaiidams)

    HiFiGAN model trained on Kokoro dataset to complement the existing Japanese model.

    Try it out:

    tts --model_name tts_models/ja/kokoro/tacotron2-DDC --text "こんにちは、今日はいい天気ですか?"
  • tts_models--en--ljspeech--tacotronDDC_ph

    TacotronDDC with phonemes trained on LJSpeech. It is to fix the pronunciation errors caused by the raw text
    in the released TacotronDDC model.

    Try it out:

    tts --model_name tts_models/en/ljspeech/tacotronDDC_ph --text "hello, how are you today?"
  • tts_models--en--ljspeech--vits

    VITS model trained on LJSpeech.

    Try it out:

    tts --model_name tts_models/en/ljspeech/vits --text "hello, how are you today?"
  • tts_models--en--vctk--vits

    VITS model trained on VCTK with multi-speaker support.

    Try it out:

    tts-server --model_name tts_models/en/vctk/vits     
  • vocoder_models--en--ljspeech--univnet

    UnivNet model trained on LJSpeech to complement the TacotronDDC model above.

    Try it out:

    tts --model_name tts_models/en/ljspeech/tacotronDDC_ph --text "hello, how are you today?"