This is an assortment of all the scripts associated with the Intro to Voice Computing Book. These scripts should give you a jumpstart in learning how to write Python code for voice-related applications.
^^ Note: If you find this code useful, please leave a star :) ^^
Watch the overview video for this book here.
Note that these are the instructions for Mac computers; you may need some custom setup for FFmpeg and/or SoX if you're using a Windows and/or Linux computer.
First, clone the repository and submodules:
git clone --recurse-submodules -j8 https://github.com/jim-schwoebel/voicebook
Now you need to run the setup.py script in a virtual environment to make sure you have all the required dependencies for all the chapters of the book. To do this, run:
cd voicebook
pip3 install virtualenv
virtualenv env
source env/bin/activate
python3 setup.py
Now you have all the dependencies necessary to follow along with the chapters in the book. You don’t need to worry about any other installations.
This repository is licensed under the Apache 2.0 License.
Any feedback on the book or this repository is greatly appreciated.
- If you find something that is missing or doesn't work, please consider opening a GitHub issue.
- If you'd like to be mentored by someone on our team, check out the Innovation Fellows Program.
- If you are looking for a framework to start building machine learning models in voice computing, check out Allie.
- If you want to talk to me directly, please send me an email @ js@neurolex.co.
Please use the following citation when citing this book in your research work:
- Schwoebel, J. (2018). An Introduction to Voice Computing in Python. Boston; Seattle; Atlanta: NeuroLex Laboratories. https://github.com/jim-schwoebel/voicebook
Check out the wiki or documentation below to follow along with each chapter in the book. In this way, you can get quickly up to speed with the 200+ scripts included in this repository.
- Chapter 1: Fundamentals
- Chapter 2: Collection
- Chapter 3: Featurization
- Chapter 4: Data Modeling
- Chapter 5: Generation
- Chapter 6: Visualization
- Chapter 7: Designing Voice Computers
- Chapter 8: Designing Server Architectures
- Chapter 9: Legal, Security, and Ethical Considerations
- Chapter 10: Getting involved