Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.
-
Updated
Dec 19, 2024 - TypeScript
Generate Subtitles & Diarize Speakers in Davinci Resolve using AI.
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Open source inference code for Rev's model
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
Official repository for Mamba-based Segmentation Model for Speaker Diarization
PyAnnote Voice Activity Detection (ONNX version)
A package that can be locally executed to generate minutes in Japanese
Faster Whisper with Speaker Diarization
Hobby project to transcribe audio files from meetings to transcripts with a summary
Toolkit for using Whisper to transcribe YouTube videos. Includes Whisper transcription of YouTube videos, conversion of YouTube video into HuggingFace dataset (using audio and subtitles) and evaluation of Whisper transcription against YouTube subtitles
Companion repository to the paper "On the calibration of powerset speaker diarization models" published at Interspeech 2024
Helpful Python scripts for audio transcription using the OpenAI Whisper API. Also features the Chat Completion API and pyannote diarization.
Add a description, image, and links to the pyannote topic page so that developers can more easily learn about it.
To associate your repository with the pyannote topic, visit your repo's landing page and select "manage topics."