Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "ocr" keyword

Top 0.7% on pypi.org
easyocr 1.7.1
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
32 versions - Latest release: 8 months ago - 49 dependent packages - 671 dependent repositories - 225 thousand downloads last month - 22,043 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
rpaframework-dialogs 4.0.5
Dialogs library of RPA Framework
28 versions - Latest release: 7 months ago - 1 dependent package - 13 dependent repositories - 80.9 thousand downloads last month - 1,022 stars on GitHub - 6 maintainers
Top 2.9% on pypi.org
rpaframework-pdf 7.3.2
PDF library of RPA Framework
44 versions - Latest release: about 1 month ago - 1 dependent package - 28 dependent repositories - 688 thousand downloads last month - 1,022 stars on GitHub - 7 maintainers
Top 2.2% on pypi.org
rpaframework-core 11.3.2
Core utilities used by RPA Framework
91 versions - Latest release: about 1 month ago - 10 dependent packages - 31 dependent repositories - 847 thousand downloads last month - 1,022 stars on GitHub - 5 maintainers
ocrodjvu 0.13
OCR for DjVu (Python 3 fork)
1 version - Latest release: over 1 year ago - 3 dependent repositories - 33 downloads last month - 5 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
mayan-edms 4.6.4
Free Open Source Electronic Document Management System
247 versions - Latest release: 17 days ago - 3 dependent repositories - 2.53 thousand downloads last month - 611 stars on GitLab.com - 1 maintainer
rapidocr-api 0.0.6 💰
A cross platform OCR API Library based on OnnxRuntime.
6 versions - Latest release: 2 months ago - 73 downloads last month - 2,031 stars on GitHub - 2 maintainers
rapidocr-web 0.1.9 💰
A cross platform OCR Library based on OnnxRuntime.
14 versions - Latest release: 8 months ago - 100 downloads last month - 2,031 stars on GitHub - 1 maintainer
receipt-parser-core 0.2.5 💰
A supermarket receipt parser written in Python using tesseract OCR
13 versions - Latest release: almost 3 years ago - 1 dependent repositories - 94 downloads last month - 790 stars on GitHub - 1 maintainer
ruppell 1.0.1
Ruppell is a Python package to help in text extraction from documents.
7 versions - Latest release: 10 months ago - 1 dependent repositories - 97 downloads last month - 12 stars on GitHub - 2 maintainers
vsearcher 0.2.17
支持视频内容检索和课件自动生成的库
12 versions - Latest release: 7 months ago - 110 downloads last month - 2 maintainers
Top 9.0% on pypi.org
pix2text 1.0.2
An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, c...
20 versions - Latest release: about 2 months ago - 1 dependent repositories - 1.67 thousand downloads last month - 1,330 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
deepdoctection 0.31
Repository for Document AI
22 versions - Latest release: about 1 month ago - 14 dependent repositories - 2.92 thousand downloads last month - 2,222 stars on GitHub - 2 maintainers
paddleocr-convert 0.0.18
Tool for converting the PaddleOCR model to onnx format.
18 versions - Latest release: about 11 hours ago - 459 downloads last month - 42 stars on GitHub - 2 maintainers
taco-box 0.1.1
An implementation library of Tiling and Corruption (TACo) Augmentations for OCR/HTR!
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 53 downloads last month - 14 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
nocv2easyocr 0.1.1
This is a fork of the EasyOCR library without the opencv requirement
2 versions - Latest release: about 1 year ago - 91 downloads last month - 20,452 stars on GitHub - 2 maintainers
nidaba 2.0.4
Expandable and scalable OCR pipeline
44 versions - Latest release: over 6 years ago - 2 dependent repositories - 312 downloads last month - 85 stars on GitHub - 2 maintainers
nidaba-client 2.0.1
Expandable and scalable OCR pipeline client
5 versions - Latest release: about 7 years ago - 2 dependent repositories - 41 downloads last month - 85 stars on GitHub - 2 maintainers
Top 1.6% on pypi.org
pyocr 0.8.5
A Python wrapper for OCR engines (Tesseract, Cuneiform, etc)
30 versions - Latest release: 8 months ago - 4 dependent packages - 255 dependent repositories - 28.3 thousand downloads last month - 7,831 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
kraken 5.2.4
OCR/HTR engine for all the languages
94 versions - Latest release: 1 day ago - 6 dependent packages - 24 dependent repositories - 4.24 thousand downloads last month - 651 stars on GitHub - 2 maintainers
pyugt 1.0.10
Universal Game Translator from on-screen text in Python
32 versions - Latest release: 3 months ago - 1 dependent repositories - 215 downloads last month - 24 stars on GitHub - 2 maintainers
archive-pdf-tools 1.5.4
Internet Archive PDF compression tools
31 versions - Latest release: 11 months ago - 1 dependent repositories - 1.83 thousand downloads last month - 81 stars on GitHub - 1 maintainer
marker-ocr 0.1.0 removed
Convert PDF to markdown with high speed and accuracy.
1 version - Latest release: 5 months ago - 4,998 stars on GitHub - 2 maintainers
marker-pdf 0.2.1
Convert PDF to markdown with high speed and accuracy.
5 versions - Latest release: about 21 hours ago - 268 downloads last month - 8,351 stars on GitHub - 2 maintainers
receiptparser 1.0.4
Receipt and bill parser using OCR
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 46 downloads last month - 12 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
pymupdf 1.24.3
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
114 versions - Latest release: about 22 hours ago - 134 dependent packages - 1,798 dependent repositories - 2.78 million downloads last month - 4,025 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
pymupdfb 1.24.3
MuPDF shared libraries for PyMuPDF.
14 versions - Latest release: about 22 hours ago - 2 dependent packages - 133 dependent repositories - 1.92 million downloads last month - 4,025 stars on GitHub - 2 maintainers
pysseract 1.3.1
Python binding to Tesseract API
16 versions - Latest release: over 4 years ago - 1 dependent repositories - 945 downloads last month - 0 stars on GitHub - 2 maintainers
usls 0.2023.0
Useless CV toolkits
29 versions - Latest release: 5 months ago - 9 downloads last month - 5 stars on GitHub - 2 maintainers
Top 0.8% on pypi.org
baidu-aip 4.16.13
Baidu AIP SDK
68 versions - Latest release: 6 months ago - 9 dependent packages - 1,035 dependent repositories - 27.6 thousand downloads last month - 3 maintainers
nepali-nlp 0.0.0
Natural language processing library for Nepali langauge
1 version - Latest release: over 3 years ago - 1 dependent repositories - 6 downloads last month - 149 stars on GitHub - 1 maintainer
tabularocr 0.1.0
TabularOCR is a Python library that provides an easy-to-use Optical Character Recognition (OCR) s...
1 version - Latest release: 2 months ago - 47 downloads last month - 2 stars on GitHub - 2 maintainers
Top 7.5% on pypi.org
agentocr 2.0.0
An easy-to-use OCR package with multilingual support.
8 versions - Latest release: over 2 years ago - 39 dependent repositories - 658 downloads last month - 112 stars on GitHub - 1 maintainer
stb-automator 0.1.0
A library for automated control & testing of set-top boxes
1 version - Latest release: about 3 years ago - 1 dependent repositories - 13 downloads last month - 3 stars on GitHub - 2 maintainers
normcap 0.5.6
OCR-powered screen-capture tool to capture information instead of images.
54 versions - Latest release: 2 days ago - 1 dependent repositories - 883 downloads last month - 1,710 stars on GitHub - 1 maintainer
Top 0.6% on pypi.org
paddleocr 2.7.5
Awesome OCR toolkits based on PaddlePaddle (8.6M ultra-lightweight pre-trained model, support tra...
43 versions - Latest release: about 1 month ago - 29 dependent packages - 549 dependent repositories - 153 thousand downloads last month - 38,622 stars on GitHub - 1 maintainer
surya-ocr 0.4.1
OCR, layout, reading order, and line detection in 90+ languages
14 versions - Latest release: 2 days ago - 5.5 thousand downloads last month - 5,540 stars on GitHub - 1 maintainer
pdf-language-detector 0.0.11
A python script to iterate over a list of PDF in a directory and try to guess their language with...
12 versions - Latest release: 11 months ago - 97 downloads last month - 58,291 stars on GitHub - 3 maintainers
filecabinet 2.1.0
A local, offline document archive
3 versions - Latest release: 11 months ago - 31 downloads last month - 58,291 stars on GitHub - 2 maintainers
parsee-pdf-reader 0.1.3
Tesseract Open Source OCR Engine (main repository)
16 versions - Latest release: 2 months ago - 384 downloads last month - 58,291 stars on GitHub - 2 maintainers
Top 1.5% on pypi.org
unstructured 0.13.7
A library that prepares raw documents for downstream ML tasks.
132 versions - Latest release: 2 days ago - 69 dependent packages - 3,374 dependent repositories - 1.09 million downloads last month - 4,064 stars on GitHub - 1 maintainer
textnoisr 1.1.1
Add noise to text at the character level
5 versions - Latest release: 2 months ago - 156 downloads last month - 12 stars on GitHub - 2 maintainers
usseg 0.7.1
Tools to segment doppler ultrasound signals from scan images.
9 versions - Latest release: 6 months ago - 67 downloads last month - 58,199 stars on GitHub - 4 maintainers
pdftoprompt 0.1.2
Python library to abbreviate a PDF file to GPT 8k prompt length
3 versions - Latest release: about 1 year ago - 33 downloads last month - 58,199 stars on GitHub - 2 maintainers
targimo 0.0.1 removed
Targimo: An Artifical Intelligence Model that Revolutionizes Sentiment Analysis
1 version - Latest release: 7 months ago - 144 downloads last month - 54,235 stars on GitHub - 2 maintainers
dedoc 2.2.1
Extract content and logical tree structure from textual documents
15 versions - Latest release: 7 days ago - 438 downloads last month - 77 stars on GitHub - 2 maintainers
icdar-tools 0.0.3
a pip install icdar_tools
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 26 downloads last month - 2,993 stars on GitHub - 2 maintainers
Top 1.0% on pypi.org
ocrmypdf 16.2.0 💰
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
228 versions - Latest release: 24 days ago - 6 dependent packages - 108 dependent repositories - 102 thousand downloads last month - 11,885 stars on GitHub - 1 maintainer
pixparse 0.1.0.dev0
1 version - Latest release: 11 months ago - 14 downloads last month - 2 maintainers
ocred 0.4.0
Clever, simple, and intuitive wrapper functionalities for OCRing specific textual materials
6 versions - Latest release: 8 months ago - 57 downloads last month - 13 stars on GitHub - 1 maintainer
imgtotxt 0.1.2
OCR app running locally with native UI
3 versions - Latest release: about 1 year ago - 31 downloads last month - 0 stars on GitHub - 2 maintainers
percato 0.1.0
Farsi data generator and an OCR tool for Farsi using Detectron2
1 version - Latest release: about 3 years ago - 1 dependent repositories - 59 downloads last month - 16 stars on GitHub - 2 maintainers
Top 1.0% on pypi.org
ddddocr 1.5.3
带带弟弟OCR
21 versions - Latest release: 4 days ago - 17 dependent packages - 153 dependent repositories - 55.1 thousand downloads last month - 8,346 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
ddddocrfix 1.4.8
带带弟弟OCR
1 version - Latest release: 7 months ago - 1 dependent package - 1 dependent repositories - 972 downloads last month - 7,107 stars on GitHub - 2 maintainers
transkribus-to-prima 0.0.1
Convert Transkribus PAGE-XML to standard PAGE-XML
1 version - Latest release: over 2 years ago - 1 dependent repositories - 18 downloads last month - 10 stars on GitHub - 2 maintainers
straug 0.1.2
Data Augmentation for STR
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 113 downloads last month - 233 stars on GitHub - 1 maintainer
winocr 0.0.14
Windows.Media.Ocr
14 versions - Latest release: 5 months ago - 1 dependent repositories - 196 downloads last month - 11 stars on GitHub - 2 maintainers
samagra-docparser 0.1.2
Document Parser built to extract information from pdfs.
3 versions - Latest release: 8 months ago - 9 downloads last month - 2 maintainers
reading-image 1.0.1
Reading Image is a text analysis tool for images files (png, jpg, jpeg) and pdf. The system will ...
3 versions - Latest release: almost 4 years ago - 1 dependent repositories - 27 downloads last month - 2 maintainers
Top 4.7% on pypi.org
pdftabextract 0.3.0
A set of tools for data mining (OCR-processed) PDFs
5 versions - Latest release: over 6 years ago - 12 dependent repositories - 1.29 thousand downloads last month - 2,159 stars on GitHub - 4 maintainers
marearts-anpr 2.1.813
ANPR (Automatic Number Plate Recognition) SDK by MareArts
4 versions - Latest release: 3 months ago - 52 downloads last month - 2 stars on GitHub - 2 maintainers
multiocr 0.1.4
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-rela...
4 versions - Latest release: 9 months ago - 48 downloads last month - 3,089 stars on GitHub - 2 maintainers
Top 3.2% on pypi.org
python-doctr 0.8.1
Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
14 versions - Latest release: 2 months ago - 2 dependent packages - 20 dependent repositories - 28.1 thousand downloads last month - 3,089 stars on GitHub - 2 maintainers
indic-doctr 0.7.1a0
Indic Document Text Recognition (indic-docTR): deep Learning for high-performance OCR on documents.
1 version - Latest release: about 1 year ago - 15 downloads last month - 3,089 stars on GitHub - 2 maintainers
dinglehopper 0.9.6
The OCR evaluation tool
7 versions - Latest release: 4 days ago - 86 downloads last month - 53 stars on GitHub - 1 maintainer
transkribus-fixer 0.0.1
Convert Transkribus PAGE-XML to standard PAGE-XML
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 13 downloads last month - 10 stars on GitHub - 2 maintainers
Top 6.8% on pypi.org
amazon-textract-textractor 1.7.10
A package to use AWS Textract services.
58 versions - Latest release: 22 days ago - 3 dependent packages - 1 dependent repositories - 57.4 thousand downloads last month - 347 stars on GitHub - 8 maintainers
hydra-api 1.2.0
Official client for Siftrics' Hydra API, which is a text recognition documents-to-database service
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 40 downloads last month - 1 stars on GitHub - 2 maintainers
deep-text 1.3.7
Deep learning nlp model framework, provides command-line tools.
15 versions - Latest release: over 5 years ago - 3 dependent repositories - 86 downloads last month - 2 maintainers
surya-ocr-vlite 0.3.0
OCR, layout analysis, and line detection in 90+ languages
1 version - Latest release: about 1 month ago - 67 downloads last month - 0 stars on GitHub - 2 maintainers
easypaddleocr 0.2.1
A simple, optional tool for PaddleOCR Detection, direction classification and recognition on CPU ...
6 versions - Latest release: 4 days ago - 502 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
pix2tex 0.1.2
pix2tex: Using a ViT to convert images of equations into LaTeX code.
31 versions - Latest release: 12 months ago - 1 dependent package - 5 dependent repositories - 4.58 thousand downloads last month - 10,900 stars on GitHub - 2 maintainers
fineocr 0.3
FineScanner Mobile OCR for free
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 29 downloads last month - 1 stars on GitHub - 2 maintainers
autoocr 0.0.3
A Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 32 downloads last month - 12 stars on GitHub - 1 maintainer
transcribe 0.0.7
Convert images or audio files to plain text on the command line
6 versions - Latest release: almost 5 years ago - 3 dependent repositories - 116 downloads last month - 33 stars on GitHub - 2 maintainers
qbc-idcard-ocr 1.1.2
Recognize ID Card By Ocr.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 5 downloads last month - 2 maintainers
invoice-captcha 0.0.1
国税总局发票查验验证码获取与识别.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 8 downloads last month - 2,923 stars on GitHub - 1 maintainer
fast-plate-ocr 0.1.5
Fast & Lightweight OCR for vehicle license plates.
4 versions - Latest release: 9 days ago - 511 downloads last month - 39 stars on GitHub - 2 maintainers
python-ocr 0.1.5
Input Adaptor to verify file extension
6 versions - Latest release: over 1 year ago - 87 downloads last month - 2 maintainers
huixiangdou 0.1.0
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
3 versions - Latest release: 4 months ago - 18 downloads last month - 826 stars on GitHub - 2 maintainers
konfuzio-sdk 0.3.4
Konfuzio Software Development Kit
397 versions - Latest release: 5 days ago - 1 dependent repositories - 4.77 thousand downloads last month - 52 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
mltu 1.2.5
Machine Learning Training Utilities (MLTU) for TensorFlow and PyTorch
39 versions - Latest release: 6 days ago - 3 dependent repositories - 3.22 thousand downloads last month - 146 stars on GitHub - 1 maintainer
ocrscreen 0.0.5
ocr for recognizing text on computer screen
5 versions - Latest release: over 1 year ago - 31 downloads last month - 1 stars on GitHub - 1 maintainer
pogoocr 0.3.9
A Python tool for running OCR on Pokemon Screenshots
21 versions - Latest release: about 2 years ago - 118 downloads last month - 5 stars on GitHub - 2 maintainers
pytesseract-cli 1.2.0
A pytesseract wrapper enabling OCR on images and directories.
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 170 downloads last month - 1 stars on GitHub - 2 maintainers
ocr-tamil 0.3.6
Python Tamil OCR package
36 versions - Latest release: about 2 months ago - 461 downloads last month - 36 stars on GitHub - 2 maintainers
wow-ocr 0.0.3
A packaged OCR model to read texts into WoW screenshots
3 versions - Latest release: 11 months ago - 29 downloads last month - 3 stars on GitHub - 2 maintainers
krakensegment 0.4
line segmentation code from kraken
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 33 downloads last month - 5 stars on GitHub - 2 maintainers
jqktrader 0.1.4
Tesseract Open Source OCR Engine (main repository)
3 versions - Latest release: over 1 year ago - 73 downloads last month - 2,794 stars on GitHub - 2 maintainers
awgp-aadhar-pan-extractor 0.0.2
extracts Aadhaar and extracts Pan information
1 version - Latest release: over 2 years ago - 1 dependent repositories - 24 downloads last month - 2,794 stars on GitHub - 2 maintainers
rpawithcomputervision 0.0.4
This is code with package
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 30 downloads last month - 2,794 stars on GitHub - 2 maintainers
Top 1.9% on pypi.org
layoutparser 0.3.4
A unified toolkit for Deep Learning Based Document Image Analysis
11 versions - Latest release: about 2 years ago - 3 dependent packages - 77 dependent repositories - 239 thousand downloads last month - 4,464 stars on GitHub - 1 maintainer
pdftotext3 1.0.4
Convert PDF Files to Text Files using Google's Tesseract OCR.
5 versions - Latest release: almost 2 years ago - 2 dependent repositories - 360 downloads last month - 10 stars on GitHub - 2 maintainers
papermerge-core 2.1.5
Open source document management system for digital archives
77 versions - Latest release: over 1 year ago - 4 dependent repositories - 488 downloads last month - 248 stars on GitHub - 1 maintainer
aadhar-pan-extractor 0.0.2
extracts Aadhaar and extracts Pan information
2 versions - Latest release: over 2 years ago - 42 downloads last month - 2,754 stars on GitHub - 2 maintainers
fontain 0.1.1
Python tool for font recognition on images
1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 1 maintainer
asoen-ocr 1.0.0
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
1 version - Latest release: about 1 year ago - 29 downloads last month - 22,008 stars on GitHub - 2 maintainers
myeasyocr 1.2.3
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
1 version - Latest release: over 3 years ago - 1 dependent repositories - 44 downloads last month - 20,452 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
asone-ocr 1.6.2
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
2 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 231 downloads last month - 20,452 stars on GitHub - 2 maintainers
easyocr-itgn 1.2.3
Modified Easyorc By IntoThatGoodNight
3 versions - Latest release: 10 months ago - 44 downloads last month - 20,429 stars on GitHub - 2 maintainers