Stars
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
A cloud-native vector database, storage for next generation AI applications
Real-time and accurate open-vocabulary end-to-end object detection
A realtime serving engine for Data-Intensive Generative AI Applications
Tool-Planner: Dynamic Solution Tree Planning for Large Language Model with Tool Clustering
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
A Streamlined Multimodal Agent Framework for Smart Hardware and More
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
agentUniverse is a LLM multi-agent framework that allows developers to easily build multi-agent applications.
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Mora: More like Sora for Generalist Video Generation
We present the first systematic study on the scaling property of raw agents instantiated by LLMs. We find that performance scales with the increase in the number of agents, using the simple(st) way…
Large Language Model based Multi-Agents: A Survey of Progress and Challenges
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
SGLang is a fast serving framework for large language models and vision language models.
A native PyTorch Library for large model training