v0.0.10

erogol released this 10 Mar 17:11

🐸 v0.0.10

Make synthesizer.py saving the output audio with the vocoder sampling rate. It is necessary if there is sampling rates of the tts and the vocoder models are different and interpolation is applied to the tts model output before running the vocoder. Practically, it fixes generated Spanish and French voices by tts or tts-server on the terminal.
Handling utf-8 on Windows. (by @adonispujols)
Fix Loading the last model when --continue_training. It was loading the best_model regardless.

Move released models to Github Releases and deprecate GDrive being the first option.

English ek1 - Tacotron2 model and WaveGrad vocoder under .models.json. (huge THX!! to @nmstoker)
Russian Ruslan - Tacotron2-DDC model.
Dutch model. (huge THX!! to @r-dh )
Chinese Tacotron2 model. (huge THX!! to @kirianguiller)
English LJSpeech - SpeechSpeech with WaveNet decoder.

💡 All the models below are available by tts end point as explained here.

Language	Dataset	Model Name	Model Type	TTS version	Download
English	LJSpeech	SpeedySpeech	tts	😃 v0.0.10	💾
English	EK1	Tacotron2	tts	😃 v0.0.10	💾
Dutch	MAI	TacotronDDC	tts	😃 v0.0.10	💾
Chinese	Baker	TacotronDDC-GST	tts	😃 v0.0.10	💾
English	LJSpeech	TacotronDCA	tts	v0.0.9	💾
English	LJSpeech	Glow-TTS	tts	v0.0.9	💾
Spanish	M-AILabs	TacotronDDC	tts	v0.0.9	💾
French	M_AILabs	TacotronDDC	tts	v0.0.9	💾
Dutch	MAI	TacotronDDC	tts	😃 v0.0.10	💾
English	EK1	WaveGrad	vocoder	😃 v0.0.10	💾
Dutch	MAI	ParallelWaveGAN	vocoder	😃 v0.0.10	💾
English	LJSpeech	MB-MelGAN	vocoder	v0.0.9	💾
🌍 Multi-Lang	LibriTTS	FullBand-MelGAN	vocoder	v0.0.9	💾
🌍 Multi-Lang	LibriTTS	WaveGrad	vocoder	v0.0.9	💾

Assets 8