Skip to content
View j1ajunzhu's full-sized avatar
💭
I may be slow to respond.
💭
I may be slow to respond.

Block or report j1ajunzhu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,805 68 Updated Feb 25, 2026

Codebase for evaluation of deep generative models as presented in Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

Jupyter Notebook 202 18 Updated Mar 3, 2025

CZI AI Residency (Francesco Locatello) project on MorphGen: Generative Modeling for Morphologically Faithful and Generalizable Cell Painting Representations

Python 4 1 Updated Oct 20, 2025

Using vision-language models to decode natural image perception from non-invasive brain recordings.

Jupyter Notebook 197 49 Updated Dec 24, 2025

PyTorch re-implementation of FlowTok: Flowing Seamlessly Across Text and Image Tokens

Python 12 4 Updated Nov 26, 2025

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 4,212 316 Updated Jan 5, 2026

一个计算机视觉、机器学习与深度学习相关的项目,看课程的笔记还有自己做的程序

Python 550 35 Updated Jan 26, 2026

(NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation

Python 65 Updated Oct 14, 2025

Official Repository of Absolute Zero Reasoner

Python 1,821 296 Updated Aug 24, 2025

Online Semantic Planning for Missions with Incomplete Natural Language Specifications in Unstructured Environments

Python 27 6 Updated Oct 14, 2025

Heterogeneous Robot Collaboration in Unstructured Environments with Grounded Generative Intelligence

4 2 Updated Nov 3, 2025

A next.js web application that integrates AI capabilities with draw.io diagrams. This app allows you to create, modify, and enhance diagrams through natural language commands and AI-assisted visual…

TypeScript 22,777 2,441 Updated Mar 11, 2026

[RSS 2025] CREStE: Scalable Mapless Navigation with Internet Scale Priors and Counterfactual Guidance

Python 65 3 Updated Jun 14, 2025

A beautiful, simple, clean, and responsive Jekyll theme for academics

HTML 15,280 12,841 Updated Mar 4, 2026

ARENA: Adaptive Risk-aware and Energy-efficient NAvigation for multi-objective problem

C++ 16 Updated Mar 4, 2026

本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.

JavaScript 8,862 3,016 Updated Mar 12, 2026

Get started with building Fullstack Agents using Gemini 2.5 and LangGraph

Jupyter Notebook 17,976 3,065 Updated Mar 12, 2026

A curated collection of papers, tutorials, videos, and other valuable resources related to Mamba.

712 64 Updated Feb 19, 2026

Collection of papers on state-space models

615 21 Updated Nov 4, 2025

This is a project i made for my MINOR in college , which is based on Machine Learning and focuses on Image Processing .Feel free to make changes .

Jupyter Notebook 1 Updated Nov 15, 2022

Fully open-source command-line AI assistant inspired by OpenAI Codex, supporting local language models.

Python 669 49 Updated Jul 7, 2025

YOLO-MPE official implement

Python 10 Updated Jul 24, 2024

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 3,819 278 Updated Feb 13, 2025

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Python 1,682 155 Updated Mar 9, 2026

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Python 66 1 Updated Jun 10, 2025

[ICLR 2026] RF-DETR is a real-time object detection and segmentation model architecture developed by Roboflow, SOTA on COCO, designed for fine-tuning.

Python 5,854 709 Updated Mar 12, 2026

[An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models, 2025, CVPR]

Python 151 24 Updated Jun 22, 2025

A curated collection of 45 high-quality RGB image datasets for computer vision in agriculture. Features datasets for weed detection, disease identification, and crop monitoring, focusing on natural…

TeX 14 1 Updated Jan 7, 2026
Next