Offline speech recognition API for Android, iOS, Raspberry Pi
Robust Speech Recognition via Large-Scale Weak Supervision
Speech recognition module for Python
Speech-to-text, text-to-speech, and speaker recognition
Audio foundation model excelling in audio understanding
kaldi-asr/kaldi is the official location of the Kaldi project
A PyTorch-based Speech Toolkit
On-device Speech Recognition for Apple Silicon
Captcha solver extension for humans
Port of OpenAI's Whisper model in C/C++
A free, open source, and extensible speech-to-text application
Cross-platform AI language practice app
Toolkit for conversational AI
Multilingual Automatic Speech Recognition with word-level timestamps
StreamSpeech is a seamless model for offline speech recognition
OpenVINO™ Toolkit repository
Underthesea - Vietnamese NLP Toolkit
Repo of Qwen2-Audio chat & pretrained large audio language model
A cross-platform software for text translation and recognition
Omnilingual ASR Open-Source Multilingual SpeechRecognition
Speech to Text to Speech, sends text as OSC messages
AzioSpeech Recognition and Translation
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Replace OpenAI GPT with another LLM in your app
Training data (data labeling, annotation, workflow) for all data types