Native and Compact Structured Latents for 3D Generation
Official inference repo for FLUX.2 models
Towards Human-Level Text-to-Speech through Style Diffusion
Animated sprite editor & pixel art tool
A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
The modern PHP app server
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
Operating LLMs in production
Models and examples built with TensorFlow
AI-driven neuro-symbolic solver for high-school geometry problems
State-of-the-art TTS model under 25MB
A speech-text foundation model for real time dialogue
A modern tool for managing database schemas
Video understanding codebase from FAIR for reproducing video models
State of the Art Natural Language Processing
Deep learning library
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Easy responsive images for Jekyll
Usable Implementation of "Bootstrap Your Own Latent" self-supervised
OpenAI swift async text to image for SwiftUI app using OpenAI
Official Repo For "Sa2VA: Marrying SAM2 with LLaVA
The GPU-powered AI application database