[go: up one dir, main page]

Skip to content
View data2json's full-sized avatar

Block or report data2json

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Official release of InternLM2.5 base and chat models. 1M context support

Python 6,476 457 Updated Nov 18, 2024

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 17,365 1,253 Updated Nov 18, 2024

Simple Python version management

Roff 39,432 3,060 Updated Nov 17, 2024

A trainable PyTorch reproduction of AlphaFold 3.

Python 641 52 Updated Nov 18, 2024

AlphaFold 3 inference pipeline.

Python 4,859 527 Updated Nov 14, 2024

OSX (macOS) inside a Docker container.

Python 8,279 282 Updated Nov 16, 2024

A playbook for effectively prompting post-trained LLMs

450 19 Updated Nov 9, 2024

Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)

Python 136 17 Updated Sep 20, 2024

Windows inside a Docker container.

Shell 29,040 1,984 Updated Nov 15, 2024

Prompt tuning app heavily inspired by Anthropic Console Workbench. Built using Mesop.

Python 3 2 Updated Sep 14, 2024

UI for testing prompts across various datasets locally

TypeScript 13 4 Updated Nov 2, 2024

A framework for the evaluation of autoregressive code generation language models.

Python 819 218 Updated Oct 31, 2024

Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024

Python 223 16 Updated Oct 27, 2024

cog implementation of All-In-One Music Structure Analyzer

Python 57 6 Updated Nov 5, 2024

Please see the readme file as well as our 2019 EMNLP paper linked here -->

196 58 Updated Apr 24, 2024
Python 1,474 156 Updated Oct 25, 2024

An LLM-powered repository agent designed to assist developers and teams in generating documentation and understanding repositories quickly.

Python 377 64 Updated Nov 14, 2024

Utility scripts for preprocessing Wikipedia texts for NLP

Python 76 7 Updated Apr 9, 2024

MediaWiki Categories Model

Python 11 2 Updated Feb 14, 2024

Library for reading and writing Jcat files

C 23 9 Updated Nov 18, 2024

LLM framework exploring ergonomic, lightweight multi-agent orchestration. Forked from OpenAI Solution team.

Python 1 Updated Oct 15, 2024

A new bootable USB solution.

C 62,986 4,097 Updated Nov 16, 2024

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Python 8,411 1,400 Updated Nov 1, 2024

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Python 254 24 Updated Sep 3, 2024

Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.

Python 24,538 3,166 Updated Sep 24, 2024

Code for 'Multi-level Logit Distillation' (CVPR2023)

Jupyter Notebook 55 6 Updated Sep 23, 2024

A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!

Python 3,248 247 Updated Aug 10, 2024

Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"

Python 293 23 Updated Dec 20, 2023

pgEdge Distributed Postgres

80 Updated Nov 1, 2024
Next