Use Microsoft Edge's online text-to-speech service from Python
Free, high-quality text-to-speech API endpoint to replace OpenAI
State-of-the-art TTS model under 25MB
A TTS that fits in your CPU (and pocket)
Converts text to speech in realtime
Qwen3-TTS is an open-source series of TTS models
Aaws-record gem, an abstraction for Amazon DynamoDB
Towards Human-Sounding Speech
Privacy focused recorder app built with MD3
A single Gradio + React WebUI with extensions for ACE-Step
Spark-TTS Inference Code
A lightweight text-to-speech model with zero-shot voice cloning
Controllable & emotion-expressive zero-shot TTS
A fast TTS architecture with conditional flow matching
Comprehensive Gradio WebUI for audio processing
Uninstall Microsoft Edge with an executable or batch script
Real-time voice interactive digital human
Speech to Text to Speech, sends text as OSC messages
Bailing is a voice dialogue robot similar to GPT-4o
Foundational model for human-like, expressive TTS
A gallery that showcases on-device ML/GenAI use cases
Virtual AI anchor that combines state-of-the-art technology
A lightning fast audio upsampler
AWS IoT FleetWise Edge Agent
Examples and guides for using the Gemini API