-
UC Berkeley
- Berkeley, California
- https://tonylian.com/
- in/longlian
- @LongTonyLian
Highlights
Stars
Repository for the paper Stream of Search: Learning to Search in Language
[ICLR 2024] Code for FreeNoise based on VideoCrafter
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
[ICCV 2023] MOSE: A New Dataset for Video Object Segmentation in Complex Scenes
Ongoing research training transformer models at scale
Dead simple FLUX LoRA training UI with LOW VRAM support
Segment Anything (SAM) at Home web app using Gradio
The official Python client for the Huggingface Hub.
This repository is a sample implementation of frontend/backend using SAM code from meta.
Meta's Segment Anything Model (SAM) Demo Site
The official code of the paper "PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction".
A collaboration friendly studio for NeRFs
Code release for NeRF (Neural Radiance Fields)
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Janus-Series: Unified Multimodal Understanding and Generation Models
Simple project page template for your research paper, built with Astro and Tailwind CSS
An open-source implementation for training LLaVA-NeXT.
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
DSPy: The framework for programming—not prompting—language models
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation