text-to-speech-dataset

Here are 3 public repositories matching this topic...

hetpandya / youtube_tts_data_generator

A python library to generate speech dataset from Youtube videos

text-to-speech youtube python-library tts speech-dataset dataset-generator youtube-dataset youtube-dataset-generator tts-dataset text-to-speech-dataset

Updated Jun 7, 2024
Python

MahtaFetrat / ManaTTS-Persian-Speech-Dataset

Star

ManaTTS is the largest open Persian speech dataset with 86+ hours of transcribed audio. Includes data collection pipeline and tools. Suitable for Persian text-to-speech models.

text-to-speech tts speech-synthesis persian data-collection data-preprocessing speech-processing forced-alignment speech-dataset speech-corpus dataset-preparation persian-speech tts-dataset text-to-speech-dataset mana-tts speech-data-collection

Updated Sep 13, 2024
Jupyter Notebook

MahtaFetrat / GPTInformal-Persian-Speech-Dataset

Star

A free licensed Persian TTS dataset including 6+ hours of audio-text pairs with subject

Updated Sep 22, 2024

Improve this page

Add a description, image, and links to the text-to-speech-dataset topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the text-to-speech-dataset topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly