[go: up one dir, main page]

Skip to content
View soheeyang's full-sized avatar
🎻
Enjoying
🎻
Enjoying

Sponsoring

@pokey

Organizations

@naver @Deepest-Project

Block or report soheeyang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Exploring the Limitations of Large Language Models on Multi-Hop Queries

Python 16 1 Updated Jun 26, 2024

Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'

Python 161 12 Updated Oct 6, 2024

The official Meta Llama 3 GitHub site

Python 27,149 3,075 Updated Aug 12, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 374 34 Updated Oct 20, 2024

This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking".

Jupyter Notebook 18 2 Updated Mar 21, 2024
Jupyter Notebook 187 37 Updated Oct 1, 2024

Evaluating the Ripple Effects of Knowledge Editing in Language Models

Python 50 4 Updated Apr 15, 2024

A tiling window manager for macOS based on binary space partitioning

C 24,051 649 Updated Nov 1, 2024
Python 240 14 Updated Oct 3, 2024

Oryx is a library for probabilistic programming and deep learning built on top of Jax.

Python 219 10 Updated Nov 15, 2024

Inference Llama 2 in one file of pure C

C 17,474 2,090 Updated Aug 6, 2024

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,273 826 Updated Nov 18, 2024

Mechanistic Interpretability Visualizations using React

Jupyter Notebook 197 31 Updated Jul 13, 2024

A library for mechanistic interpretability of GPT-style language models

Python 1,581 305 Updated Nov 17, 2024

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,466 1,624 Updated Nov 18, 2024

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,820 1,161 Updated Jun 30, 2023

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,356 131 Updated May 27, 2024

This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious current problem in widespread adoption of LLM's for many real…

Python 221 23 Updated Apr 6, 2023

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,716 1,967 Updated Sep 26, 2024

PaL: Program-Aided Language Models (ICML 2023)

Python 474 59 Updated Jun 30, 2023

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,669 863 Updated Nov 17, 2024

A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs

113 6 Updated Mar 17, 2023

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,069 3,683 Updated Jul 4, 2024

The Schema-Guided Dialogue Dataset

Python 549 124 Updated Aug 7, 2023

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,562 4,056 Updated Jul 17, 2024

[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following

Python 79 2 Updated Sep 13, 2024

A playbook for systematically maximizing the performance of deep learning models.

27,254 2,262 Updated Jun 18, 2024

Expanding natural instructions

Python 959 189 Updated Dec 11, 2023
Next