A python library to generate speech dataset from Youtube videos
-
Updated
Jun 7, 2024 - Python
A python library to generate speech dataset from Youtube videos
ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.
A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject
Add a description, image, and links to the text-to-speech-dataset topic page so that developers can more easily learn about it.
To associate your repository with the text-to-speech-dataset topic, visit your repo's landing page and select "manage topics."