Speech recognition
Speech synthesis
Speaker verification
Speaker identification
✔️
✔️
✔️
✔️
Spoken Language identification
Audio tagging
Voice activity detection
✔️
✔️
✔️
Keyword spotting
Add punctuation
✔️
✔️
Architecture
Android
iOS
Windows
macOS
linux
x64
✔️
✔️
✔️
✔️
x86
✔️
✔️
arm64
✔️
✔️
✔️
✔️
✔️
arm32
✔️
✔️
riscv64
✔️
Supported programming languages
1. C++
2. C
3. Python
4. C#
5. Java
6. JavaScript
✔️
✔️
✔️
✔️
✔️
✔️
7. Kotlin
8. Swift
9. Go
10. Dart
11. Rust
✔️
✔️
✔️
✔️
✔️
For Rust support, please see https://github.com/thewh1teagle/sherpa-rs
It also supports WebAssembly.
This repository supports running the following functions locally
Speech-to-text (i.e., ASR); both streaming and non-streaming are supported
Text-to-speech (i.e., TTS)
Speaker identification
Speaker verification
Spoken language identification
Audio tagging
VAD (e.g., silero-vad )
Keyword spotting
on the following platforms and operating systems:
with the following APIs
C++, C, Python, Go, C#
Java, Kotlin, JavaScript
Swift
Dart
Links for pre-built Android APKs
Links for pre-built Flutter APPs
Real-time speech recognition
Description
URL
中国用户
Streaming speech recognition
Address
点此
Note: You need to build from source for iOS.
Links for pre-trained models
Please see
https://k2-fsa.github.io/sherpa/social-groups.html
for 新一代 Kaldi 微信交流群 and QQ 交流群 .