MeloTTS is a high-quality multi-lingual text-to-speech library by MIT and MyShell.ai. Supported languages include:
Language | Example |
---|---|
English (American) | Link |
English (British) | Link |
English (Indian) | Link |
English (Australian) | Link |
English (Default) | Link |
Spanish | Link |
French | Link |
Chinese (mix EN) | Link |
Japanese | Link |
Korean | Link |
Some other features include:
- The Chinese speaker supports
mixed Chinese and English
. - Fast enough for
CPU real-time inference
.
The Python API and model cards can be found in this repo or on HuggingFace.
Contributing
If you find this work useful, please consider contributing to this repo.
- Many thanks to @fakerybakery for adding the Web UI and CLI part.
- Wenliang Zhao at Tsinghua University
- Xumin Yu at Tsinghua University
- Zengyi Qin (project lead) at MIT and MyShell
Citation
@software{zhao2024melo,
author={Zhao, Wenliang and Yu, Xumin and Qin, Zengyi},
title = {MeloTTS: High-quality Multi-lingual Multi-accent Text-to-Speech},
url = {https://github.com/myshell-ai/MeloTTS},
year = {2023}
}
This library is under MIT License, which means it is free for both commercial and non-commercial use.
This implementation is based on TTS, VITS, VITS2 and Bert-VITS2. We appreciate their awesome work.