GitHub - jhauret/vibravox: Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.

Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.

Resources:

📝: The paper related to this project is available on arXiv on this link.
🤗: The dataset used in this project is hosted by Hugging Face. You can access it here.
🌐: For more information about the project, visit our project page.
🏆: Explore Leaderboards on Papers With Code.

Requirements

pip install -r requirements.txt

Run some models

Train EBEN for Bandwidth Extension

python run.py lightning_datamodule=bwe lightning_datamodule.sensor=throat_microphone lightning_module=eben  ++trainer.check_val_every_n_epoch=15 ++trainer.max_epochs=500

Train wav2vec2 for Speech to Phoneme

python run.py lightning_datamodule=stp lightning_datamodule.sensor=headset_microphone lightning_module=wav2vec2_for_stp lightning_module.optimizer.lr=1e-5 ++trainer.max_epochs=10

Test ECAPA2 for Speaker Verification

python run.py lightning_datamodule=spkv lightning_module=ecapa2 logging=csv ++trainer.limit_train_batches=0 ++trainer.limit_val_batches=0

Name		Name	Last commit message	Last commit date
Latest commit History 510 Commits
.github/workflows		.github/workflows
configs		configs
scripts		scripts
tests		tests
vibravox		vibravox
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Resources:

Requirements

Run some models

About

Releases

Packages

Contributors 4

Languages

License

jhauret/vibravox

Folders and files

Latest commit

History

Repository files navigation

Resources:

Requirements

Run some models

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages