Add device support in TTS and Synthesizer #2855

jaketae · 2023-08-10T18:51:00Z

Context

In #2282, we proposed up the possibility of implementing atts.to(device) interface as a substitute for use_cuda or gpu flags. The current flags do not allow users to specify the specific GPU device (e.g., cuda:3). It also does not allow users to use other accelerated backends, such as Apple Silicon GPUs (MPS), which PyTorch now supports.

Solution

We make TTS and Synthesizer classes inherit from nn.Module. This gives us .to(device) for free for both of the classes.

We can now run TTS on Apple Silicon (tested on M2 Max). Not all kernels have been implemented in MPS in PyTorch yet, so we need to set the environment variable

export PYTORCH_ENABLE_MPS_FALLBACK=1

to enable CPU fallback. With this set, we can now run

>>> from TTS.api import TTS
>>> model_name = TTS.list_models()[0]
>>> tts = TTS(model_name)
>>> tts = tts.to("mps")
>>> tts.tts_to_file(text="Hello world!", speaker=tts.speakers[0], language=tts.languages[0], file_path="output.wav")

Also tested with make test.

CLAassistant · 2023-08-10T18:52:06Z

All committers have signed the CLA.

jaketae · 2023-08-10T18:53:14Z

TTS/api.py

        self.manager = ModelManager(models_file=self.get_models_file_path(), progress_bar=progress_bar, verbose=False)

        self.synthesizer = None
        self.voice_converter = None
        self.csapi = None
        self.model_name = None

+        if gpu:
+            warnings.warn("`gpu` will be deprecated. Please use `tts.to(device)` instead.")


Added warning. We could add specific dates or versions to better inform users about future plans, but I left it this way because I didn't have enough context on the future releases roadmap.

jaketae · 2023-08-10T19:01:50Z

TTS/tts/utils/synthesis.py

@@ -5,19 +5,21 @@
 from torch import nn


-def numpy_to_torch(np_array, dtype, cuda=False):
+def numpy_to_torch(np_array, dtype, cuda=False, device="cpu"):


Added new device argument to functions called in Synthesizer. To retain backwards compatibility, we keep the cuda argument for now; we should probably clean them up in the future and provide a single way of configuring device/enabling CUDA.

jaketae · 2023-08-10T19:20:44Z

TTS/utils/synthesizer.py

        use_gl = self.vocoder_model is None
+        if not use_gl:
+            vocoder_device = next(self.vocoder_model.parameters()).device


In some obscure use cases, the user could have placed the feature frontend and the vocoder on different devices.

>>> tts.synthesizer.tts_model = tts.synthesizer.tts_model.to("cuda:0") >>> tts.synthesizer.vocoder_model = tts.synthesizer.vocoder_model.to("cuda:1")

We check the device of the vocoder, if it exists.

jaketae · 2023-08-10T22:34:35Z

Hi @erogol, curious to hear your thoughts on this implementation! The guiding philosophy was to use the PyTorch .to(device) API while keeping all functionality intact and retaining backwards compatibility with use_cuda or gpu.

I've signed the CLA, but the first few commits didn't have my GitHub email (I just got a new laptop and forgot to set up my Git user information), which is why the CLA test is marked as pending.

erogol · 2023-08-13T10:06:21Z

@jaketae thanks for the PR. I'll review it tomorrow 👍

erogol

All looks good!! Thanks for the PR. If you think its done I can merge.

jaketae · 2023-08-14T16:33:23Z

@erogol, thanks for the quick review! I think we can go ahead with the merge unless you have second thoughts. I'll maybe open a follow-up PR to improve docs or the README where applicable. Thanks!

erogol · 2023-08-14T19:04:49Z

@jaketae awesome thanks again

jaketae mentioned this pull request Aug 10, 2023

[RFC][Feature request] Loading model onto specific GPU #2282

Closed

jaketae commented Aug 10, 2023

View reviewed changes

jaketae marked this pull request as ready for review August 10, 2023 22:30

jaketae mentioned this pull request Aug 10, 2023

[TODO] Add speaker_device jaketae/storyteller#8

Closed

jaketae and others added 9 commits August 13, 2023 14:02

fix: resolve merge conflicts

711459c

fix: retain backwards compatability in functions

af26ffd

feature: utilize device for voice transfer

739be29

feature: use device for vocoder

7fbe8cb

chore: cleanup vocoder cpu logic

5912b56

fix: add necessary vocoder output device check

0bcc016

fix: add necessary vocoder output device check

26c7a14

fix: indentation

41de849

fix: check if waveform is pt tensor before cpu conversion

ef554f9

jaketae force-pushed the device branch from a3a3fe7 to ef554f9 Compare August 13, 2023 18:09

erogol approved these changes Aug 14, 2023

View reviewed changes

erogol merged commit 409db50 into coqui-ai:dev Aug 14, 2023
45 checks passed

jaketae deleted the device branch August 14, 2023 20:08

This was referenced Aug 14, 2023

Add device flag to TTS CLI #2875

Merged

Update README with new device API #2876

Merged

alessandroperilli mentioned this pull request Sep 15, 2023

[Bug] Doesn't respect the cuda flag #2947

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add device support in TTS and Synthesizer #2855

Add device support in TTS and Synthesizer #2855

jaketae commented Aug 10, 2023 •

edited

Loading

CLAassistant commented Aug 10, 2023 •

edited

Loading

jaketae Aug 10, 2023 •

edited

Loading

jaketae Aug 10, 2023

jaketae Aug 10, 2023

jaketae commented Aug 10, 2023 •

edited

Loading

erogol commented Aug 13, 2023

erogol left a comment

jaketae commented Aug 14, 2023 •

edited

Loading

erogol commented Aug 14, 2023

Add device support in TTS and Synthesizer #2855

Add device support in TTS and Synthesizer #2855

Conversation

jaketae commented Aug 10, 2023 • edited Loading

Context

Solution

CLAassistant commented Aug 10, 2023 • edited Loading

jaketae Aug 10, 2023 • edited Loading

Choose a reason for hiding this comment

jaketae Aug 10, 2023

Choose a reason for hiding this comment

jaketae Aug 10, 2023

Choose a reason for hiding this comment

jaketae commented Aug 10, 2023 • edited Loading

erogol commented Aug 13, 2023

erogol left a comment

Choose a reason for hiding this comment

jaketae commented Aug 14, 2023 • edited Loading

erogol commented Aug 14, 2023

jaketae commented Aug 10, 2023 •

edited

Loading

CLAassistant commented Aug 10, 2023 •

edited

Loading

jaketae Aug 10, 2023 •

edited

Loading

jaketae commented Aug 10, 2023 •

edited

Loading

jaketae commented Aug 14, 2023 •

edited

Loading