soheeyang

🎻

Enjoying

Sohee Yang soheeyang

🎻

Enjoying

PhD student/Intern at UCL/DeepMind. Previously MS student at KAIST AI and research engineer at Naver Clova. NLP & ML. Wherever curiosity leads me.

142 followers · 14 following

UCL/DeepMind
London, United Kingdom
https://soheeyang.github.io
@soheeyang_

Sponsoring

Achievements

Organizations

Stars

edenbiran / HoppingTooLate

Exploring the Limitations of Large Language Models on Multi-Hop Queries

Python 16 1 Updated Jun 26, 2024

OSU-NLP-Group / GrokkedTransformer

Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'

Python 161 12 Updated Oct 6, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 27,149 3,075 Updated Aug 12, 2024

openai / transformer-debugger

Python 4,035 236 Updated Jun 4, 2024

princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 374 34 Updated Oct 20, 2024

Nix07 / finetuning

This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking".

Jupyter Notebook 18 2 Updated Mar 21, 2024

ArthurConmy / Automatic-Circuit-Discovery

Jupyter Notebook 187 37 Updated Oct 1, 2024

edenbiran / RippleEdits

Evaluating the Ripple Effects of Knowledge Editing in Language Models

Python 50 4 Updated Apr 15, 2024

koekeishiya / yabai

A tiling window manager for macOS based on binary space partitioning

C 24,051 649 Updated Nov 1, 2024

google-deepmind / synjax

Python 240 14 Updated Oct 3, 2024

jax-ml / oryx

Oryx is a library for probabilistic programming and deep learning built on top of Jax.

Python 219 10 Updated Nov 15, 2024

karpathy / llama2.c

Inference Llama 2 in one file of pure C

C 17,474 2,090 Updated Aug 6, 2024

OptimalScale / LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Python 8,273 826 Updated Nov 18, 2024

TransformerLensOrg / CircuitsVis

Mechanistic Interpretability Visualizations using React

Jupyter Notebook 197 31 Updated Jul 13, 2024

TransformerLensOrg / TransformerLens

A library for mechanistic interpretability of GPT-style language models

Python 1,581 305 Updated Nov 17, 2024

openai / automated-interpretability

Python 970 114 Updated Mar 6, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,466 1,624 Updated Nov 18, 2024

databrickslabs / dolly

Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform

Python 10,820 1,161 Updated Jun 30, 2023

Farama-Foundation / chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Python 1,356 131 Updated May 27, 2024

manyoso / haltt4llm

This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious current problem in widespread adoption of LLM's for many real…

Python 221 23 Updated Apr 6, 2023

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,716 1,967 Updated Sep 26, 2024

reasoning-machines / pal

PaL: Program-Aided Language Models (ICML 2023)

Python 474 59 Updated Jun 30, 2023

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,669 863 Updated Nov 17, 2024

google-research-datasets / presto

A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs

113 6 Updated Mar 17, 2023

openai / chatgpt-retrieval-plugin

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,069 3,683 Updated Jul 4, 2024

google-research-datasets / dstc8-schema-guided-dialogue

The Schema-Guided Dialogue Dataset

Python 549 124 Updated Aug 7, 2023

tatsu-lab / stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 29,562 4,056 Updated Jul 17, 2024

seonghyeonye / TAPP

[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following

Python 79 2 Updated Sep 13, 2024

google-research / tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

27,254 2,262 Updated Jun 18, 2024

allenai / natural-instructions

Expanding natural instructions

Python 959 189 Updated Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sohee Yang soheeyang

Sponsoring

Achievements

Achievements

Organizations

Block or report soheeyang

Stars

edenbiran / HoppingTooLate

OSU-NLP-Group / GrokkedTransformer

meta-llama / llama3

openai / transformer-debugger

princeton-nlp / LESS

Nix07 / finetuning

ArthurConmy / Automatic-Circuit-Discovery

edenbiran / RippleEdits

koekeishiya / yabai

google-deepmind / synjax

jax-ml / oryx

karpathy / llama2.c

OptimalScale / LMFlow

TransformerLensOrg / CircuitsVis

TransformerLensOrg / TransformerLens

openai / automated-interpretability

huggingface / peft

databrickslabs / dolly

Farama-Foundation / chatarena

manyoso / haltt4llm

microsoft / JARVIS

reasoning-machines / pal

BlinkDL / RWKV-LM

google-research-datasets / presto

openai / chatgpt-retrieval-plugin

google-research-datasets / dstc8-schema-guided-dialogue

tatsu-lab / stanford_alpaca

seonghyeonye / TAPP

google-research / tuning_playbook

allenai / natural-instructions