automatic-speech-recognition

Star

Here are 314 public repositories matching this topic...

wenet-e2e / wenet

Star

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

Updated Nov 8, 2024
Python

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

Star

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

Updated Oct 19, 2023

zzw922cn / Automatic_Speech_Recognition

Star

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

audio deep-learning tensorflow paper end-to-end evaluation cnn lstm speech-recognition rnn automatic-speech-recognition feature-vector data-preprocessing phonemes timit-dataset layer-normalization rnn-encoder-decoder chinese-speech-recognition

Updated Mar 24, 2023
Python

coqui-ai / STT

Star

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

deep-learning tensorflow voice-recognition speech-recognition automatic-speech-recognition speech-to-text stt asr speech-recognizer speech-recognition-api

Updated Mar 11, 2024
C++

ahmetoner / whisper-asr-webservice

Sponsor

Star

OpenAI Whisper ASR Webservice API

docker speech speech-recognition automatic-speech-recognition speech-to-text asr openai-whisper

Updated Dec 18, 2024
Python

kakaobrain / pororo

Star

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

natural-language-processing deep-learning speech-synthesis automatic-speech-recognition neural-models

Updated Mar 23, 2022
Python

TensorSpeech / TensorFlowASR

Star

⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

tensorflow speech-recognition jasper automatic-speech-recognition speech-to-text ctc conformer deepspeech2 tflite rnn-transducer end2end tensorflow2 contextnet tflite-model tflite-convertion subword-speech-recognition streaming-transducer

Updated Dec 7, 2024
Python

snakers4 / open_stt

Star

Open STT

dataset russian automatic-speech-recognition speech-to-text stt asr

Updated Mar 11, 2022
Python

shirayu / whispering

Sponsor

Star

Streaming transcriber with whisper

automatic-speech-recognition whisper

Updated May 1, 2023
Python

jitsi / jiwer

Star

Evaluate your speech-to-text system with similarity measures such as word error rate (WER)

python3 automatic-speech-recognition speech-to-text evaluation-metrics wer word-error-rate

Updated Nov 1, 2024
Python

EmulationAI / awesome-large-audio-models

Star

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

music-information-retrieval automatic-speech-recognition speech-to-text audio-processing music-ai music-processing large-language-models foundational-models speech-ai audio-ai large-audio-models speech-llms large-language-model-speech

Updated Aug 3, 2024

Picovoice / cheetah

Star

On-device streaming speech-to-text engine powered by deep learning

voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr online-speech-recognition streaming-speech-to-text

Updated Dec 13, 2024
Python

hirofumi0810 / neural_sp

Star

End-to-end ASR/LM implementation with PyTorch

streaming speech language-modeling pytorch transformer speech-recognition seq2seq attention automatic-speech-recognition sequence-to-sequence language-model attention-mechanism asr ctc rnn-transducer transformer-xl

Updated Aug 30, 2021
Python

YoavRamon / awesome-kaldi

Star

This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )

speech speech-recognition awesome-list automatic-speech-recognition speech-to-text kaldi kaldi-asr

Updated Feb 9, 2022

Z-yq / TensorflowASR

Star

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1

cpp transformer transducers automatic-speech-recognition bert ctc state-of-the-art listen-attend-and-spell tensorflow-cpp tensorflow2

Updated Sep 26, 2024
Python

jonatasgrosman / huggingsound

Sponsor

Star

HuggingSound: A toolkit for speech-related tasks based on Hugging Face's tools

audio speech transformers speech-recognition automatic-speech-recognition speech-to-text asr

Updated Sep 20, 2023
Python

Picovoice / leopard

Star

On-device speech-to-text engine powered by deep learning

voice-recognition speech-recognition automatic-speech-recognition speech-to-text transcription stt asr voice-to-text on-device

Updated Dec 12, 2024
Python

double22a / speech_dataset

Star

The dataset of Speech Recognition

audio text-to-speech deep-neural-networks deep-learning speech tts speech-synthesis dataset wav speech-recognition automatic-speech-recognition speech-to-text voice-conversion asr speech-separation speech-enhancement speech-segmentation speech-translation speech-diarization

Updated Jul 2, 2024

ArthurFDLR / whisper-youtube

Star

🔉 Youtube Videos Transcription with OpenAI's Whisper

youtube transformer speech-recognition automatic-speech-recognition speech-to-text whisper colab-notebook

Updated Apr 23, 2024
Jupyter Notebook

hirofumi0810 / tensorflow_end2end_speech_recognition

Star

End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)

tensorflow end-to-end speech-recognition beam-search automatic-speech-recognition speech-to-text attention-mechanism asr timit-dataset ctc timit end-to-end-learning csj librispeech joint-ctc-attention

Updated Jan 23, 2018
Python

Improve this page

Add a description, image, and links to the automatic-speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the automatic-speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

automatic-speech-recognition

Here are 314 public repositories matching this topic...

wenet-e2e / wenet

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

zzw922cn / Automatic_Speech_Recognition

coqui-ai / STT

ahmetoner / whisper-asr-webservice

kakaobrain / pororo

TensorSpeech / TensorFlowASR

snakers4 / open_stt

shirayu / whispering

jitsi / jiwer

EmulationAI / awesome-large-audio-models

Picovoice / cheetah

hirofumi0810 / neural_sp

YoavRamon / awesome-kaldi

Z-yq / TensorflowASR

jonatasgrosman / huggingsound

Picovoice / leopard

double22a / speech_dataset

ArthurFDLR / whisper-youtube

hirofumi0810 / tensorflow_end2end_speech_recognition

Improve this page

Add this topic to your repo