[go: up one dir, main page]

Skip to content
View liubai521's full-sized avatar

Block or report liubai521

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 754 41 Updated Nov 14, 2024

🧑‍🚀 全世界最好的LLM资料总结 | Summary of the world's best LLM resources.

2,523 296 Updated Nov 29, 2024

FCC China open source codebase and curriculum. Learn to code and help nonprofits.

CSS 37,100 1,371 Updated Jul 16, 2023

freeCodeCamp.org's open-source codebase and curriculum. Learn to code for free.

TypeScript 406,336 38,178 Updated Nov 28, 2024

brpc is an Industrial-grade RPC framework using C++ Language, which is often used in high performance system such as Search, Storage, Machine learning, Advertisement, Recommendation etc. "brpc" mea…

C++ 16,566 3,980 Updated Nov 30, 2024

A curated list of awesome Machine Learning frameworks, libraries and software.

Python 66,146 14,656 Updated Nov 11, 2024

LLM serving cluster simulator

Jupyter Notebook 82 8 Updated Apr 25, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 39,584 4,196 Updated Jul 28, 2024

模型压缩的小白入门教程

208 29 Updated Nov 19, 2024

《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程

Jupyter Notebook 9,774 1,134 Updated Nov 27, 2024

cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…

Jupyter Notebook 3,711 656 Updated Nov 10, 2024

Collaborative Training of Large Language Models in an Efficient Way

Python 410 58 Updated Aug 28, 2024

Calculate token/s & GPU memory requirement for any LLM. Supports llama.cpp/ggml/bnb/QLoRA quantization

JavaScript 1,164 60 Updated Nov 2, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,321 161 Updated Jun 25, 2024

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,685 515 Updated Oct 18, 2024

LLM training code for Databricks foundation models

Python 4,061 531 Updated Nov 30, 2024

[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.

Python 398 43 Updated Aug 1, 2024

DeepSeek Coder: Let the Code Write Itself

Python 6,925 482 Updated May 21, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 738 105 Updated Nov 29, 2024

LightSeq: A High Performance Library for Sequence Processing and Generation

C++ 3,216 329 Updated May 16, 2023

Best practice for training LLaMA models in Megatron-LM

Python 630 53 Updated Jan 2, 2024

GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well as OpenAI's earlier models on 20+ curated benchmarks under al…

Python 349 23 Updated Apr 10, 2024

OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.

C++ 5,931 667 Updated Nov 29, 2024

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 13,847 1,863 Updated Nov 29, 2024

Large Language Model Text Generation Inference

Python 9,162 1,081 Updated Nov 30, 2024

Official repository for Spyder - The Scientific Python Development Environment

Python 8,370 1,623 Updated Nov 30, 2024

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,702 621 Updated Nov 20, 2024

A comparison of pretraining framework for LLM

Python 19 3 Updated May 9, 2024

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Python 6,702 368 Updated Jul 11, 2024
Next