๐A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. ๐๐
sora
llm
llms
vllm
llm-inference
awesome-llm
flash-attention
flash-attention-2
tensorrt-llm
paged-attention
deepseek
open-sora
flash-attention-3
-
Updated
Nov 28, 2024