urumchi 的 Starred 仓库

hiroi-sora/Umi-OCR 44,932

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

免费开源的离线OCR软件，支持截屏、批量图片、PDF识别，排除水印及二维码生成。

2026-06-06

ocr ×offline desktop

pot-app/pot-desktop 18,520

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

跨平台的划词翻译和OCR识别软件

2026-01-07

ocr ×cross-platform translation

jingsongliujing/OnnxOCR 1,800

基于PaddleOCR重构，并且脱离PaddlePaddle深度学习训练框架的轻量级OCR，推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.

基于PaddleOCR重构的轻量级OCR系统，脱离PaddlePaddle框架，推理速度极快

2026-01-05

ocr ×onnx paddleocr

PaddlePaddle/PaddleOCR 79,325

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

轻量级OCR工具包，从图像和PDF中提取结构化数据，支持100+语言，便于AI集成

2026-01-04

ocr ×document-processing optical-character-recognition

deepseek-ai/DeepSeek-OCR 23,222

Contexts Optical Compression

用于上下文数据的光学压缩技术

2025-12-30

ocr ×optical-character-recognition compression

paperless-ngx/paperless-ngx 41,846

A community-supported supercharged document management system: scan, index and archive all your documents

社区支持的文档管理系统，支持扫描、索引和归档

2025-12-18

ocr ×self-hosted document-management

oomol-lab/pdf-craft 5,718

PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.

将扫描版PDF书籍转换为多种输出格式

2025-12-17

ocr ×pdf document-conversion

run-llama/llama_index 49,859

LlamaIndex is the leading document agent and OCR platform

领先的文档代理与OCR平台，支持智能数据提取与检索

2023-03-31

ocr ×rag document-processing

chineseocr/chineseocr 6,112

yolo3+ocr

基于YOLO3和OCR的中文文字识别

2022-09-07

ocr ×computer-vision yolo

tesseract-ocr/tesseract 74,451

Tesseract Open Source OCR Engine (main repository)

开源光学字符识别引擎，用于从图像中识别文字

2021-10-08

ocr ×optical-character-recognition

breezedeus/CnOCR 3,752

CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】

基于PyTorch/MXNet的中英文OCR Python工具包，提供20+预训练模型

2021-08-20

ocr ×pytorch mxnet