izuna385

🏠

Any feedback would be appreciated!

izuna385 izuna385

🏠

Any feedback would be appreciated!

83 followers · 226 following

Achievements

x3 x2

Achievements

x3 x2

Highlights

Lists (3)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 19,118 1,572 Updated Nov 2, 2024

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,576 416 Updated Nov 5, 2024

NVIDIA / FasterTransformer

Transformer related optimization, including BERT, GPT

C++ 5,863 891 Updated Mar 27, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,576 973 Updated Nov 5, 2024

VikhrModels / effective_llm_alignment

Effective LLM Alignment Toolkit

Python 81 7 Updated Oct 30, 2024

dbt-labs / jaffle-shop

🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

106 141 Updated Oct 3, 2024

dbt-labs / jaffle-shop-classic

A self-contained dbt project for testing purposes

453 931 Updated Sep 12, 2024

huggingface / huggingface-inference-toolkit

Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.

Python 48 6 Updated Nov 4, 2024

philschmid / vllm-huggingface

Forked from vllm-project/vllm

A fork of vLLM with Hugging Face specific modifications

Python 1 Updated Aug 27, 2024

modal-labs / modal-examples

Examples of programs built using Modal

Python 724 169 Updated Nov 5, 2024

mzbac / GPTQ-for-LLaMa-API

Provide a way to use the GPT-QLLama model as an API

Python 43 1 Updated May 20, 2023

qwopqwop200 / gptqlora

Forked from artidoro/qlora

GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ

Python 96 7 Updated May 30, 2023

vasilikikou / consistent_bioTempRE

Python 2 Updated Oct 2, 2024

Salma-Jamal / Neural-Machine-Translation-T5

Neural Machine Translation using a Transformer model (T5)

Jupyter Notebook 1 Updated Aug 24, 2023

qianniu95 / gemma2_2b_finetune_jp_tutorial

This repository demonstrates how to fine-tune the Google Gemma 2 2B model to improve its performance on Japanese instruction-following tasks. It serves as a practical guide for developers and resea…

Jupyter Notebook 7 Updated Aug 11, 2024

AIAnytime / GGUF-Quantization-of-any-LLM

GGUF Quantization of any LLM.

Jupyter Notebook 29 12 Updated Mar 4, 2024

s-taka / fugumt

Python 61 2 Updated Feb 28, 2021

QwenLM / Qwen2.5

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,310 572 Updated Oct 30, 2024

kyegomez / Gemini

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Python 425 56 Updated Nov 4, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,930 1,132 Updated Sep 24, 2024

atuinsh / atuin

✨ Magical shell history

Rust 20,748 564 Updated Nov 1, 2024

hosimesi / aws-mlops-practice

Python 8 Updated May 25, 2024

aquaproj / aqua

Declarative CLI Version manager written in Go. Support Lazy Install, Registry, and continuous update with Renovate. CLI version is switched seamlessly

Go 859 39 Updated Nov 6, 2024