Use Microsoft Edge's online text-to-speech service from Python
Online machine learning in Python
A community-supported supercharged version of paperless
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Speech recognition module for Python
A high-performance ML model serving framework, offers dynamic batching
MemU is an open-source memory framework for AI companions
Volcano Engine Reinforcement Learning for LLMs
A python tool that uses GPT-4, FFmpeg, and OpenCV
Implementation of DeepLabCut
A natural language interface for computers
LLM based autonomous agent that does online comprehensive research
Algorithms for outlier, adversarial and drift detection
Machine Learning automation and tracking
AI-Powered tool for automated pull request analysis
Unified Model Serving Framework
Speech-AI-Forge is a project developed around TTS generation model
Trained models & code to predict toxic comments
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
Controllable and fast Text-to-Speech for over 7000 languages
DeepMind model for tracking arbitrary points across videos & robotics
Leveraging BERT and c-TF-IDF to create easily interpretable topics
Qwen3-omni is a natively end-to-end, omni-modal LLM
Collection of reference environments, offline reinforcement learning
Python Stream Processing