cluo1989

C. Luo cluo1989

11 followers · 132 following

Starred repositories

wshzd / Awesome-AIGC

AIGC资料汇总学习，持续更新......

744 84 Updated Oct 22, 2023

zhangzhao219 / WSDM-Cup-2024

1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc

Python 153 21 Updated Feb 29, 2024

MatsuriDayo / nekoray

Qt based cross-platform GUI proxy configuration manager (backend: sing-box)

C++ 13,230 1,241 Updated Oct 9, 2024

huggingface / pytorch-image-models

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 32,139 4,756 Updated Nov 5, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,300 361 Updated Oct 24, 2024

opendatalab / DocLayout-YOLO

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 390 28 Updated Oct 31, 2024

langgenius / dify

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 50,771 7,297 Updated Nov 6, 2024

opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具，支持PDF/网页/多格式电子书提取。

Python 13,631 1,024 Updated Nov 6, 2024

kermitt2 / grobid

A machine learning software for extracting information from scholarly documents

Java 3,544 453 Updated Oct 30, 2024

XH-B / ABM

Python 98 22 Updated Aug 22, 2024

AIDC-AI / Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 503 29 Updated Nov 4, 2024

luopeixiang / textclf

TextClf ：基于Pytorch/Sklearn的文本分类框架，包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型，通过简单配置即可完成数据处理、模型训练、测试等过程。

Python 236 39 Updated Jul 21, 2023

luopeixiang / named_entity_recognition

中文命名实体识别（包括多种模型：HMM，CRF，BiLSTM，BiLSTM+CRF的具体实现）

Python 2,129 537 Updated Jun 21, 2022

yezhengkai / im2latex

Convert the image of the formula to LaTeX. This project is also the final project of Full Stack Deep Learning course.

Jupyter Notebook 6 2 Updated Nov 5, 2023

leoxiaoping / pbottleRPA

小瓶RPA 永久免费（个人版）RPA软件系统。轻量级简单全能的RPA软件，显著降本增效 & 工作100%准确 & 非侵入式集成。同时支持浏览器web应用和客户端应用的操作流程自动化。同时支持 Js 和 Python 两种脚本制作流程。

JavaScript 111 21 Updated Oct 28, 2024

Yuliang-Liu / Monkey

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,819 129 Updated Nov 4, 2024

OpenBMB / MiniCPM

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,099 451 Updated Nov 6, 2024

X-PLUG / mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 1,514 99 Updated Sep 28, 2024

Ucas-HaoranWei / GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 5,842 493 Updated Nov 4, 2024

jadore801120 / attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,837 1,979 Updated Apr 16, 2024

ZZZHANG-jx / Recommendations-Document-Image-Processing

This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.

168 11 Updated Sep 13, 2024

ZZZHANG-jx / DocRes

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 306 30 Updated Sep 24, 2024

Menghuan1918 / pdfdeal

A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装，同时附带本地的文本处理(提升PDF在RAG中的召回率)。

Python 193 10 Updated Nov 6, 2024

microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 18,753 1,832 Updated Nov 6, 2024

Gmgge / TrOCR-Seal-Recognition

基于transformer的ocr识别，在公章(印章识别, seal recognition）拓展应用

Python 151 26 Updated Jun 20, 2024

liuyifan6613 / DocBank-Document-Enhancement-Dataset

DocBank 文档图像增强数据集，此数据集用于文档图像增强，具体任务包括以下内容：Seal detection & Removal 印章检测 & 移除；Watermark detection & Removal 水印检测 & 移除；Document deblurring 文档去模糊；Document shadow removal 文档去阴影；Document super-resoluti…

13 1 Updated Oct 22, 2024

sparkfish / shabby-pages

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to o…

Jupyter Notebook 50 6 Updated Nov 5, 2024

intsig-textin / markdown_tester

如需体验textin文档解析，请点击https://cc.co/16YSIy

Python 52 7 Updated Oct 30, 2024

peggy1502 / Amazing-Resources

List of references and online resources related to data science, machine learning and deep learning.

317 90 Updated Nov 2, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 31,020 3,686 Updated Nov 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly