-
MinerU Public
Forked from opendatalab/MinerUA one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
Python GNU Affero General Public License v3.0 UpdatedNov 14, 2024 -
docling Public
Forked from DS4SD/doclingGet your docs ready for gen AI
Python MIT License UpdatedNov 1, 2024 -
docling-parse Public
Forked from DS4SD/docling-parseSimple package to extract text with coordinates from programmatic PDFs
C++ MIT License UpdatedOct 30, 2024 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedAug 24, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 27, 2024 -
mergekit Public
Forked from arcee-ai/mergekitTools for merging pretrained large language models.
Python GNU Lesser General Public License v3.0 UpdatedJul 27, 2024 -
pandas-ai Public
Forked from Sinaptik-AI/pandas-aiChat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
Python Other UpdatedJul 26, 2024 -
distilabel Public
Forked from argilla-io/distilabel⚗️ distilabel is a framework for synthetic data and AI feedback for AI engineers that require high-quality outputs, full data ownership, and overall efficiency.
Python Apache License 2.0 UpdatedJul 25, 2024 -
spectrum Public
Forked from cognitivecomputations/spectrumPython Apache License 2.0 UpdatedJul 23, 2024 -
llmsherpa Public
Forked from nlmatics/llmsherpaDeveloper APIs to Accelerate LLM Projects
Jupyter Notebook MIT License UpdatedJun 28, 2024 -
atlassian-python-api Public
Forked from atlassian-api/atlassian-python-apiAtlassian Python REST API wrapper
Python Apache License 2.0 UpdatedJun 25, 2024 -
nlm-ingestor Public
Forked from nlmatics/nlm-ingestorThis repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
Python Apache License 2.0 UpdatedJun 22, 2024 -
pdf2md Public
Forked from opengovsg/pdf2mdA PDF to Markdown converter
JavaScript MIT License UpdatedJun 16, 2024 -
conversational-agent-langchain Public
Forked from mfmezger/conversational-agent-langchainFastAPI Backend for a Conversational Agent using Aleph Alpha, (Azure) OpenAI, GPT4ALL, Langchain and a VectorDB
Python MIT License UpdatedJun 8, 2024 -
kraken Public
Forked from cognitivecomputations/krakenJupyter Notebook Apache License 2.0 UpdatedMay 26, 2024 -
makeMoE Public
Forked from AviSoori1x/makeMoEFrom scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
Jupyter Notebook MIT License UpdatedFeb 1, 2024 -
-
Parsr Public
Forked from axa-group/ParsrTransforms PDF, Documents and Images into Enriched Structured Data
JavaScript Apache License 2.0 UpdatedDec 3, 2023 -
nanoGPT Public
Forked from karpathy/nanoGPTThe simplest, fastest repository for training/finetuning medium-sized GPTs.
Python MIT License UpdatedJul 12, 2023 -