-
Mcgill university
- Montreal
-
12:56
(UTC -04:00) - https://orcid.org/0000-0003-0231-054X
Stars
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models
CZI AI Residency (Francesco Locatello) project on MorphGen: Generative Modeling for Morphologically Faithful and Generalizable Cell Painting Representations
Using vision-language models to decode natural image perception from non-invasive brain recordings.
PyTorch re-implementation of FlowTok: Flowing Seamlessly Across Text and Image Tokens
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
一个计算机视觉、机器学习与深度学习相关的项目,看课程的笔记还有自己做的程序
(NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation
Official Repository of Absolute Zero Reasoner
Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments
Heterogeneous Robot Collaboration in Unstructured Environments with Grounded Generative Intelligence
A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…
[RSS 2025] CREStE: Scalable Mapless Navigation with Internet Scale Priors and Counterfactual Guidance
A beautiful, simple, clean, and responsive Jekyll theme for academics
ARENA: Adaptive Risk-aware and Energy-efficient NAvigation for multi-objective problem
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.
Collection of papers on state-space models
This is a project i made for my MINOR in college , which is based on Machine Learning and focuses on Image Processing .Feel free to make changes .
Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding
[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.
[An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models, 2025, CVPR]
A curated collection of 45 high-quality RGB image datasets for computer vision in agriculture. Features datasets for weed detection, disease identification, and crop monitoring, focusing on natural…