-
NAVER Cloud, Hyperscale AI
- Seoul, Korea
- https://goddoe.github.io
Stars
Foundational Models for State-of-the-Art Speech and Text Translation
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
Beautifully designed components that you can copy and paste into your apps. Accessible. Customizable. Open Source.
A feature-rich command-line audio/video downloader
🤖 Build voice-based LLM agents. Modular + open source.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A simple screen parsing tool towards pure vision based GUI agent
Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
Automatically evaluate your LLMs in Google Colab
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
CUDA integration for Python, plus shiny features
Movie Gen Bench - two media generation evaluation benchmarks released with Meta Movie Gen
awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.
Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.
Entropy Based Sampling and Parallel CoT Decoding
Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI
Analysis of the Github Copilot extension
Open source project for data preparation of LLM application builders
Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044