OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
免费开源的离线OCR软件,支持截屏、批量图片、PDF识别,排除水印及二维码生成。
共 1051 个仓库
备份、整理、重新发现你曾点赞过的每一个 GitHub 仓库。
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
免费开源的离线OCR软件,支持截屏、批量图片、PDF识别,排除水印及二维码生成。
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
跨平台的划词翻译和OCR识别软件
基于PaddleOCR重构,并且脱离PaddlePaddle深度学习训练框架的轻量级OCR,推理速度超快 —— A lightweight OCR system based on PaddleOCR, decoupled from the PaddlePaddle deep learning training framework, with ultra-fast inference speed.
基于PaddleOCR重构的轻量级OCR系统,脱离PaddlePaddle框架,推理速度极快
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
轻量级OCR工具包,从图像和PDF中提取结构化数据,支持100+语言,便于AI集成
A community-supported supercharged document management system: scan, index and archive all your documents
社区支持的文档管理系统,支持扫描、索引和归档
PDF craft can convert PDF files into various other formats. This project will focus on processing PDF files of scanned books.
将扫描版PDF书籍转换为多种输出格式
LlamaIndex is the leading document agent and OCR platform
领先的文档代理与OCR平台,支持智能数据提取与检索
Tesseract Open Source OCR Engine (main repository)
开源光学字符识别引擎,用于从图像中识别文字
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
基于PyTorch/MXNet的中英文OCR Python工具包,提供20+预训练模型