kedro Whisper pipeline

Overview

This is a simple kedro pipeline that shows how custom datasets can be leveraged to process and transcribe raw audio files using torchaudio.

The pipeline includes one custom dataset class AudioDataSet and two nodes as shown in the image below:

conda create -n audio_pipeline python=3.10
conda activate audio_pipeline

pip install -r requirements.txt

OPENAI_API_KEY= # your API key

Put .mp3 audio files that should be processed into directory data/01_raw
Modify Whisper parameters in file conf/base/parameters.yml according to your needs
Run pipeline

kedro run

Kedro then runs the pipeline and executes following steps:

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
conf		conf
data		data
notebooks		notebooks
src/audio_transcription		src/audio_transcription
.gitignore		.gitignore
README.md		README.md
pipeline.png		pipeline.png
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt