[go: up one dir, main page]

Skip to content
View izuna385's full-sized avatar
🏠
Any feedback would be appreciated!
🏠
Any feedback would be appreciated!

Highlights

  • Pro

Block or report izuna385

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Universal LLM Deployment Engine with ML Compilation

Python 19,118 1,571 Updated Nov 2, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,576 416 Updated Nov 5, 2024

Transformer related optimization, including BERT, GPT

C++ 5,863 891 Updated Mar 27, 2024

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 8,573 973 Updated Nov 5, 2024

Effective LLM Alignment Toolkit

Python 81 7 Updated Oct 30, 2024

🥪🦘 An open source sandbox project exploring dbt workflows via a fictional sandwich shop's data.

106 141 Updated Oct 3, 2024

A self-contained dbt project for testing purposes

453 931 Updated Sep 12, 2024

Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.

Python 48 6 Updated Nov 4, 2024

A fork of vLLM with Hugging Face specific modifications

Python 1 Updated Aug 27, 2024

Examples of programs built using Modal

Python 724 169 Updated Nov 5, 2024

Provide a way to use the GPT-QLLama model as an API

Python 43 1 Updated May 20, 2023

GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ

Python 96 7 Updated May 30, 2023
Python 2 Updated Oct 2, 2024

Neural Machine Translation using a Transformer model (T5)

Jupyter Notebook 1 Updated Aug 24, 2023

This repository demonstrates how to fine-tune the Google Gemma 2 2B model to improve its performance on Japanese instruction-following tasks. It serves as a practical guide for developers and resea…

Jupyter Notebook 7 Updated Aug 11, 2024

GGUF Quantization of any LLM.

Jupyter Notebook 29 12 Updated Mar 4, 2024
Python 61 2 Updated Feb 28, 2021

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 9,305 572 Updated Oct 30, 2024

The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google

Python 425 56 Updated Nov 4, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 13,928 1,132 Updated Sep 24, 2024

✨ Magical shell history

Rust 20,747 564 Updated Nov 1, 2024
Python 8 Updated May 25, 2024

Declarative CLI Version manager written in Go. Support Lazy Install, Registry, and continuous update with Renovate. CLI version is switched seamlessly

Go 859 39 Updated Nov 5, 2024

日本語LLMまとめ - Overview of Japanese LLMs

TypeScript 1,006 31 Updated Nov 1, 2024

Example application for the task of fine-tuning pretrained machine translation models on highly domain-specific, self-extracted translated sentences

Python 6 Updated Jul 7, 2024

All-in-one repo to deploy an automated pipeline for GCP Cloud assets inventory and visualise with Looker studio

HCL 7 Updated Jun 9, 2024

Proof of Concept for to get data from Cloud Asset Inventory and send to BigQuery

HCL 1 Updated Nov 21, 2022

Zero-Inflated Gamma probabilistic model

Jupyter Notebook 7 1 Updated Jul 7, 2021

Upgrading Istio with 0 downtime

Shell 9 10 Updated Nov 2, 2021
Jupyter Notebook 1 Updated Feb 4, 2022
Next