[go: up one dir, main page]

Skip to content
View cluo1989's full-sized avatar

Block or report cluo1989

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

AIGC资料汇总学习,持续更新......

744 84 Updated Oct 22, 2023

1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc

Python 153 21 Updated Feb 29, 2024

Qt based cross-platform GUI proxy configuration manager (backend: sing-box)

C++ 13,230 1,241 Updated Oct 9, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 32,139 4,756 Updated Nov 5, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 5,300 361 Updated Oct 24, 2024

DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception

Python 390 28 Updated Oct 31, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 50,771 7,297 Updated Nov 6, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 13,631 1,024 Updated Nov 6, 2024

A machine learning software for extracting information from scholarly documents

Java 3,544 453 Updated Oct 30, 2024
Python 98 22 Updated Aug 22, 2024

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 503 29 Updated Nov 4, 2024

TextClf :基于Pytorch/Sklearn的文本分类框架,包括逻辑回归、SVM、TextCNN、TextRNN、TextRCNN、DRNN、DPCNN、Bert等多种模型,通过简单配置即可完成数据处理、模型训练、测试等过程。

Python 236 39 Updated Jul 21, 2023

中文命名实体识别(包括多种模型:HMM,CRF,BiLSTM,BiLSTM+CRF的具体实现)

Python 2,129 537 Updated Jun 21, 2022

Convert the image of the formula to LaTeX. This project is also the final project of Full Stack Deep Learning course.

Jupyter Notebook 6 2 Updated Nov 5, 2023

小瓶RPA 永久免费(个人版)RPA软件系统。 轻量级简单全能的RPA软件,显著降本增效 & 工作100%准确 & 非侵入式集成。同时支持浏览器web应用和客户端应用的操作流程自动化。同时支持 Js 和 Python 两种脚本制作流程。

JavaScript 111 21 Updated Oct 28, 2024

【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Python 1,819 129 Updated Nov 4, 2024

MiniCPM3-4B: An edge-side LLM that surpasses GPT-3.5-Turbo.

Jupyter Notebook 7,099 451 Updated Nov 6, 2024

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 1,514 99 Updated Sep 28, 2024

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Python 5,842 493 Updated Nov 4, 2024

A PyTorch implementation of the Transformer model in "Attention is All You Need".

Python 8,837 1,979 Updated Apr 16, 2024

This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.

168 11 Updated Sep 13, 2024

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Python 306 30 Updated Sep 24, 2024

A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处理(提升PDF在RAG中的召回率)。

Python 193 10 Updated Nov 6, 2024

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 18,753 1,832 Updated Nov 6, 2024

基于transformer的ocr识别,在公章(印章识别, seal recognition)拓展应用

Python 151 26 Updated Jun 20, 2024

DocBank 文档图像增强数据集,此数据集用于文档图像增强,具体任务包括以下内容:Seal detection & Removal 印章检测 & 移除 ;Watermark detection & Removal 水印检测 & 移除;Document deblurring 文档去模糊;Document shadow removal 文档去阴影;Document super-resoluti…

13 1 Updated Oct 22, 2024

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to o…

Jupyter Notebook 50 6 Updated Nov 5, 2024

如需体验textin文档解析,请点击https://cc.co/16YSIy

Python 52 7 Updated Oct 30, 2024

List of references and online resources related to data science, machine learning and deep learning.

317 90 Updated Nov 2, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 31,020 3,686 Updated Nov 3, 2024
Next