Sep 14, 2023 - 2 ' read

XTTS - the best open-source TTS

machine-learning, TTS, coqui.ai, open-source

"Smart electric cars in French, Rococo style, classical style, oil on canvas"

We recently open-sourced XTTS, the best open-source text-to-speech model we’ve released so far.

XTTS uses the latest generative AI techniques to deliver faster, higher-quality speech in 13 different languages. It is now accessible with 🐸TTS.

XTTS offers…

Voice cloning with just a 3-second audio clip.
Emotion and style transfer by cloning.
Cross-language voice cloning.
Multi-lingual speech generation.
24khz sampling rate.

As of now, XTTS-v1 supports the following languages: English, Spanish, French, German, Italian, Portuguese, Polish, Turkish, Russian, Dutch, Czech, Arabic, and Chinese.

To start using XTTS, all you need to do is pip install TTS.

Here is a sample Python code and please see the docs for more details.

from TTS.api import TTS
tts = TTS("tts_models/multilingual/multi-dataset/xtts_v1", gpu=True)

# generate speech by cloning a voice using default settings
tts.tts_to_file(text="It took me quite a long time to develop a voice, and now that I have it I'm not going to be silent.",
                file_path="output.wav",
                speaker_wav="/path/to/target/speaker.wav",
                language="en")