[go: up one dir, main page]

Skip to content
View kxhit's full-sized avatar
🤖
Focusing
🤖
Focusing

Highlights

  • Pro

Block or report kxhit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 9,168 861 Updated Nov 17, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,549 1,027 Updated Nov 14, 2024

Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen

329 18 Updated Oct 19, 2024

DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.

Python 1,543 101 Updated Nov 11, 2024

Hand-object interaction Pretraining From Videos

Python 62 2 Updated Oct 28, 2024

3D Reconstruction with Spatial Memory

Python 725 31 Updated Nov 16, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,027 44 Updated Nov 11, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 12,393 1,141 Updated Oct 14, 2024

Grounding Image Matching in 3D with MASt3R

Python 1,325 100 Updated Oct 12, 2024

Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"

Python 254 7 Updated Aug 15, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 26,225 5,405 Updated Nov 18, 2024

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Python 2,034 90 Updated Aug 5, 2024

TorchCFM: a Conditional Flow Matching library

Python 1,234 100 Updated Nov 17, 2024

3D LiDAR Mapping in Dynamic Environments using a 4D Implicit Neural Representation (CVPR 2024)

Python 141 7 Updated Jul 4, 2024

[ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.

Python 60 1 Updated Jun 3, 2024

Kolmogorov Arnold Networks

Jupyter Notebook 15,074 1,393 Updated Nov 14, 2024

News: the 10k dataset is ready for download.

HTML 319 4 Updated Oct 29, 2024

[ACCV 2024 (Oral)] Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

Python 273 20 Updated Oct 23, 2024

[ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"

Python 96 5 Updated Jul 5, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,265 314 Updated Oct 6, 2024

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Python 1,865 132 Updated Jul 2, 2024

COMO: Compact Mapping and Odometry

Python 171 6 Updated Oct 25, 2024

Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024

Python 143 5 Updated Sep 24, 2024

[CVPR'24, Demo Track Honourable Mention] SuperPrimitive: Scene Reconstruction at a Primitive Level

Python 176 3 Updated Mar 28, 2024

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Python 756 34 Updated Nov 14, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 22,259 2,176 Updated Aug 9, 2024

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python 1,435 148 Updated Oct 28, 2024

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Python 1,386 128 Updated Aug 7, 2024

[CVPR'24] MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video

Python 152 4 Updated Jul 19, 2024
Next