Byakto TTS [bangla text to speech]

Multilingual (Bangla, English) real-time speech synthesis library.

Installation

Install Anaconda
conda create -n new_virtual_env python==3.6.8
conda activate new_virtual_env
pip install -r requirements.txt
While running for the first time, keep your internet connection on to download the weights of the speech synthesis models (> 500 MB)
For fast inference, you must install tensorflow-gpu and have an NVIDIA GPU (CUDA).

Usage

'''
function: generate(text_arr = [""], save_path = None)
arguments: 
text_arr (array) : an array of strings
save_path (string, optional) : location where generated wav files will be stored if save_path is not None, if the path is not valid, the wav files will be saved in current directory
returns:
if save_path is None, instead of saving an array of tuples containing geenrated speech signals and the sampling rate will be returned
if save_path is not None, then a list containing the file paths (relative) will be returned
'''

from bangla_tts import generate

# usage 1 (saving to path)

file_names = generate(["আমার সোনার বাংলা আমি তোমাকে ভালোবাসি"], save_path = "static") # will be saved to static folder
print(file_names)

# usage 2 (getting numpy arrays for the signals)

gen_wavs = generate(["আমার সোনার বাংলা আমি তোমাকে ভালোবাসি"]) # will return an array containing the speech and sampling rate
print(gen_wavs[0])
print(f"signal length: {gen_wavs[0][0].shape}")
print(f"samplign rate: {gen_wavs[0][1]}")

Generated unseen speech samples

Sample 1 (আমার সোনার বাংলা আমি তোমাকে ভালোবাসি)

Sample 2 (I am still not a great speaker)

Sample 3 (This is just a test)

Update (18th April)

Synthesize longer sentences
Phonetic representation for English, Bangla numeric segments

Added a simple parser which will translate numeric keys to corresponding phonetic representation.

Example: ১৯৯৭ সালের ২১ জানুয়ারী তে আমার জন্ম হয় will be converted to ['ঊনিশশ সাতানব্বই সালের একুশ জানুয়ারী তে আমার জন্ম হয় '] by the parser.

Added a simple batch mechanism for translating longer sentences. As the attention window was fixed during training, the model previously failed to generate long sentences (n_characters > 200). So, added a simple segmenting step to break the sentences into multiple parts, synthesize in batch, and finally merge them into a single audio file.

New examples:

১৯৯৭ সালের ২১ জানুয়ারী তে আমার জন্ম হয়

আমার ফোন নাম্বার ০১৭১৩৩৫৩৪৩, তবে আমাকে সকাল ১০ টার আগে পাবেন না

বাংলাদেশে গত ২৪ ঘণ্টায় ৩০৬ জন কোভিড-১৯ আক্রান্ত হয়েছেন। এই সময়ের মধ্যে মৃত্যু হয়েছে ৯ জনের। এ নিয়ে দেশটিতে মোট আক্রান্ত হলেন ২১৪৪। আর করোনা ভাইরাসে আক্রান্ত হয়ে মৃত্যু হয়েছে ৮৪ জনের। নতুন করে ৮ জনের পরীক্ষা করার পর করোনা ভাইরাসের উপস্থিতি পাওয়া যায়নি। এনিয়ে মোট ৬৬ জন সুস্থ হলেন। - BBC Bangla

Test Dataset

I have designed a test dataset to compare new bangla TTS models with the benchmark model (bangla-tts).

Bakta dataset

Bakta dataset (multi-lingual, bangla + english)

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
model1		model1
model2		model2
static		static
test_dataset		test_dataset
.gitignore		.gitignore
README.md		README.md
bakta_test_results.py		bakta_test_results.py
bangla_tts.py		bangla_tts.py
birthdate.wav		birthdate.wav
config.py		config.py
covid19.wav		covid19.wav
data_load.py		data_load.py
graph.py		graph.py
modules.py		modules.py
networks.py		networks.py
num_parser.py		num_parser.py
out.wav		out.wav
phone_number.wav		phone_number.wav
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Byakto TTS [bangla text to speech]

Installation

Usage

Generated unseen speech samples

Update (18th April)

Test Dataset

Wiki Pages

To-dos

About

Releases

Packages

Contributors 2

Languages

zabir-nabil/bangla-tts

Folders and files

Latest commit

History

Repository files navigation

Byakto TTS [bangla text to speech]

Installation

Usage

Generated unseen speech samples

Update (18th April)

Test Dataset

Wiki Pages

To-dos

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages