Lists (4)
Sort Name ascending (A-Z)
Starred repositories
official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
A suite of image and video neural tokenizers
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
Official Implementation of LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
A curated list of resources for using LLMs to develop more competitive grant applications.
FreeVS: Generative View Synthesis on Free Driving Trajectory
Run Stable Diffusion on Mac natively
Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
HE-Drive: Human-Like End-to-End Driving with Vision Language Models
Transformers are Sample-Efficient World Models. ICLR 2023, notable top 5%.
Depth Any Video with Scalable Synthetic Data
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
A Unified Framework for scalable Vehicle Trajectory Prediction, ECCV 2024
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"
The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."
Streamlit — A faster way to build and share data apps.
Official inference repo for FLUX.1 models
The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"