Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "ocr" keyword

Top 7.9% on pypi.org
mglib 1.3.9 πŸ’°
Common code used across all Papermerge project utilities
22 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 89 downloads last month - 2,340 stars on GitHub - 1 maintainer
limitpages 1.0.0 πŸ’°
Papermerge App to limit number of uploaded documents
1 version - Latest release: over 3 years ago - 1 dependent repositories - 7 downloads last month - 2,340 stars on GitHub - 1 maintainer
straug 0.1.2
Data Augmentation for STR
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 121 downloads last month - 233 stars on GitHub - 1 maintainer
Top 7.5% on pypi.org
agentocr 2.0.0
An easy-to-use OCR package with multilingual support.
8 versions - Latest release: over 2 years ago - 39 dependent repositories - 583 downloads last month - 112 stars on GitHub - 1 maintainer
nepali-nlp 0.0.0
Natural language processing library for Nepali langauge
1 version - Latest release: over 3 years ago - 1 dependent repositories - 3 downloads last month - 153 stars on GitHub - 1 maintainer
stb-automator 0.1.0
A library for automated control & testing of set-top boxes
1 version - Latest release: about 3 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
receipt-parser-core 0.2.5 πŸ’°
A supermarket receipt parser written in Python using tesseract OCR
13 versions - Latest release: about 3 years ago - 1 dependent repositories - 62 downloads last month - 790 stars on GitHub - 1 maintainer
pysseract 1.3.1
Python binding to Tesseract API
16 versions - Latest release: over 4 years ago - 1 dependent repositories - 284 downloads last month - 0 stars on GitHub - 1 maintainer
openrecall
OpenRecall is a fully open-source, privacy-first alternative to proprietary solutions like Micros...
1 version - 265 downloads last month - 31 stars on GitHub - 1 maintainer
tabularocr 0.1.0
TabularOCR is a Python library that provides an easy-to-use Optical Character Recognition (OCR) s...
1 version - Latest release: 3 months ago - 132 downloads last month - 2 stars on GitHub - 1 maintainer
pdf-language-detector 0.0.11
A python script to iterate over a list of PDF in a directory and try to guess their language with...
12 versions - Latest release: 12 months ago - 63 downloads last month - 58,738 stars on GitHub - 2 maintainers
usseg 0.7.1
Tools to segment doppler ultrasound signals from scan images.
9 versions - Latest release: 7 months ago - 20 downloads last month - 58,738 stars on GitHub - 2 maintainers
targimo 0.0.1 removed
Targimo: An Artifical Intelligence Model that Revolutionizes Sentiment Analysis
1 version - Latest release: 8 months ago - 144 downloads last month - 54,235 stars on GitHub - 1 maintainer
parsee-pdf-reader 0.1.3
Tesseract Open Source OCR Engine (main repository)
17 versions - Latest release: 3 months ago - 1 dependent package - 336 downloads last month - 58,941 stars on GitHub - 1 maintainer
filecabinet 2.1.0
A local, offline document archive
3 versions - Latest release: 12 months ago - 27 downloads last month - 58,963 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
ocrmypdf 16.2.0 πŸ’°
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
230 versions - Latest release: about 2 months ago - 10 dependent packages - 108 dependent repositories - 91.7 thousand downloads last month - 12,250 stars on GitHub - 1 maintainer
Top 2.9% on pypi.org
cnocr 2.2.4
Python3 package for Chinese/English OCR, with small pretrained models
33 versions - Latest release: 9 months ago - 4 dependent packages - 27 dependent repositories - 9.15 thousand downloads last month - 2,996 stars on GitHub - 1 maintainer
nlpknowledge 0.0.2
Package to make sense of images with text information
9 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 6,019 stars on GitHub - 1 maintainer
rpawithcomputervision 0.0.4
This is code with package
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 20 downloads last month - 2,887 stars on GitHub - 1 maintainer
aadhar-pan-extractor 0.0.2
extracts Aadhaar and extracts Pan information
2 versions - Latest release: over 2 years ago - 47 downloads last month - 2,887 stars on GitHub - 1 maintainer
nt-textfileloader 2.0.1
Python library to extract text from various file formats. The supported formats are: JPEG, PNG, P...
8 versions - Latest release: 6 months ago - 97 downloads last month - 2,887 stars on GitHub - 1 maintainer
awgp-aadhar-pan-extractor 0.0.2
extracts Aadhaar and extracts Pan information
1 version - Latest release: over 2 years ago - 1 dependent repositories - 12 downloads last month - 2,887 stars on GitHub - 1 maintainer
jqktrader 0.1.4
Tesseract Open Source OCR Engine (main repository)
3 versions - Latest release: over 1 year ago - 81 downloads last month - 2,887 stars on GitHub - 1 maintainer
normcap 0.5.6
OCR-powered screen-capture tool to capture information instead of images.
56 versions - Latest release: about 1 month ago - 1 dependent repositories - 841 downloads last month - 1,734 stars on GitHub - 1 maintainer
icdar-tools 0.0.3
a pip install icdar_tools
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 6 downloads last month - 2,999 stars on GitHub - 1 maintainer
onnxtr 0.2.0
Onnx Text Recognition (OnnxTR): docTR Onnx-Wrapper for high-performance OCR on documents.
4 versions - Latest release: 27 days ago - 548 downloads last month - 7 stars on GitHub - 1 maintainer
imgtotxt 0.1.2
OCR app running locally with native UI
3 versions - Latest release: about 1 year ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
percato 0.1.0
Farsi data generator and an OCR tool for Farsi using Detectron2
1 version - Latest release: over 3 years ago - 1 dependent repositories - 64 downloads last month - 16 stars on GitHub - 1 maintainer
webgrep-tool 1.19
Grep for a Web page with extra features like JS deobfuscation and OCR
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 25 downloads last month - 106 stars on GitHub - 1 maintainer
pixparse 0.1.0.dev0
1 version - Latest release: 12 months ago - 11 downloads last month - 1 maintainer
Top 1.8% on pypi.org
pymupdfb 1.24.3
MuPDF shared libraries for PyMuPDF.
14 versions - Latest release: about 1 month ago - 4 dependent packages - 133 dependent repositories - 2.13 million downloads last month - 4,025 stars on GitHub - 1 maintainer
ocrd-fork-tesserocr 3.0.0rc2
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
2 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 15 downloads last month - 1,948 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
tesserocr 2.7.0
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
20 versions - Latest release: about 1 month ago - 9 dependent packages - 201 dependent repositories - 65.4 thousand downloads last month - 1,948 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
pdftabextract 0.3.0
A set of tools for data mining (OCR-processed) PDFs
5 versions - Latest release: over 6 years ago - 12 dependent repositories - 2.07 thousand downloads last month - 2,173 stars on GitHub - 2 maintainers
qbc-idcard-ocr 1.1.2
Recognize ID Card By Ocr.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 6 downloads last month - 1 maintainer
Top 1.4% on pypi.org
pymupdf 1.24.4
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
116 versions - Latest release: 24 days ago - 206 dependent packages - 1,798 dependent repositories - 3.05 million downloads last month - 4,025 stars on GitHub - 1 maintainer
Top 0.6% on pypi.org
paddleocr 2.7.5
Awesome OCR toolkits based on PaddlePaddle (8.6M ultra-lightweight pre-trained model, support tra...
43 versions - Latest release: 2 months ago - 43 dependent packages - 549 dependent repositories - 193 thousand downloads last month - 39,300 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
python-doctr 0.8.1
Document Text Recognition (docTR): deep Learning for high-performance OCR on documents.
14 versions - Latest release: 3 months ago - 3 dependent packages - 20 dependent repositories - 30.1 thousand downloads last month - 3,206 stars on GitHub - 1 maintainer
indic-doctr 0.7.1a0
Indic Document Text Recognition (indic-docTR): deep Learning for high-performance OCR on documents.
1 version - Latest release: about 1 year ago - 17 downloads last month - 3,199 stars on GitHub - 1 maintainer
multiocr 0.1.4
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-rela...
4 versions - Latest release: 10 months ago - 11 downloads last month - 3,196 stars on GitHub - 1 maintainer
hydra-api 1.2.0
Official client for Siftrics' Hydra API, which is a text recognition documents-to-database service
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 17 downloads last month - 1 stars on GitHub - 1 maintainer
alibabacloud-ocr-api20210707 2.0.8
Alibaba Cloud ocr-api (20210707) SDK Library for Python
29 versions - Latest release: 2 months ago - 1 dependent repositories - 812 downloads last month - 62 stars on GitHub - 1 maintainer
deep-text 1.3.7
Deep learning nlp model framework, provides command-line tools.
15 versions - Latest release: over 5 years ago - 3 dependent repositories - 13 downloads last month - 1 maintainer
winocr 0.0.14
Windows.Media.Ocr
14 versions - Latest release: 6 months ago - 1 dependent repositories - 143 downloads last month - 11 stars on GitHub - 1 maintainer
transkribus-to-prima 0.0.1
Convert Transkribus PAGE-XML to standard PAGE-XML
1 version - Latest release: over 2 years ago - 1 dependent repositories - 6 downloads last month - 10 stars on GitHub - 1 maintainer
surya-ocr-vlite 0.3.0
OCR, layout analysis, and line detection in 90+ languages
1 version - Latest release: 2 months ago - 1 dependent package - 42 downloads last month - 0 stars on GitHub - 1 maintainer
usls 0.2023.0
Useless CV toolkits
29 versions - Latest release: 6 months ago - 18 downloads last month - 9 stars on GitHub - 1 maintainer
konfuzio-sdk 0.3.5
Konfuzio Software Development Kit
409 versions - Latest release: 25 days ago - 1 dependent repositories - 5.29 thousand downloads last month - 54 stars on GitHub - 1 maintainer
reading-image 1.0.1
Reading Image is a text analysis tool for images files (png, jpg, jpeg) and pdf. The system will ...
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 13 downloads last month - 1 maintainer
autoocr 0.0.3
A Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)
1 version - Latest release: about 5 years ago - 1 dependent repositories - 18 downloads last month - 13 stars on GitHub - 1 maintainer
mindocr 0.3.1
A toolbox of OCR models and algorithms based on MindSpore.
2 versions - Latest release: 5 months ago - 28 downloads last month - 165 stars on GitHub - 1 maintainer
marearts-anpr 2.1.813
ANPR (Automatic Number Plate Recognition) SDK by MareArts
4 versions - Latest release: 4 months ago - 130 downloads last month - 2 stars on GitHub - 1 maintainer
samagra-docparser 0.1.2
Document Parser built to extract information from pdfs.
3 versions - Latest release: 9 months ago - 7 downloads last month - 1 maintainer
transkribus-fixer 0.0.1
Convert Transkribus PAGE-XML to standard PAGE-XML
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 13 downloads last month - 10 stars on GitHub - 1 maintainer
onnx-donut 0.1.0
Export Donut model to onnx and run it with onnxruntime
1 version - Latest release: 7 months ago - 34 downloads last month - 5,367 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
donut-python 1.0.9
OCR-free Document Understanding Transformer
9 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 8.64 thousand downloads last month - 5,367 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
deepdoctection 0.32
Repository for Document AI
23 versions - Latest release: 28 days ago - 14 dependent repositories - 3.41 thousand downloads last month - 2,292 stars on GitHub - 1 maintainer
easypaddleocr 0.2.1
A simple, optional tool for PaddleOCR Detection, direction classification and recognition on CPU ...
6 versions - Latest release: about 1 month ago - 300 downloads last month - 2 stars on GitHub - 1 maintainer
transcribe 0.0.7
Convert images or audio files to plain text on the command line
6 versions - Latest release: almost 5 years ago - 3 dependent repositories - 103 downloads last month - 33 stars on GitHub - 1 maintainer
pdfdeal
Easier to deal with PDF, extract readable text and OCR to recognise image text and clean the form...
6 versions - 932 downloads last month - 7 stars on GitHub - 1 maintainer
extract-thinker 0.0.1
Library to extract data from files and documents agnositicaly using LLMs
4 versions - Latest release: 26 days ago - 555 downloads last month - 227 stars on GitHub - 1 maintainer
fineocr 0.3
FineScanner Mobile OCR for free
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 20 downloads last month - 1 stars on GitHub - 1 maintainer
python-ocr 0.1.5
Input Adaptor to verify file extension
6 versions - Latest release: almost 2 years ago - 79 downloads last month - 1 maintainer
dedoc 2.2.1
Extract content and logical tree structure from textual documents
17 versions - Latest release: about 1 month ago - 313 downloads last month - 94 stars on GitHub - 1 maintainer
pogoocr 0.3.9
A Python tool for running OCR on Pokemon Screenshots
21 versions - Latest release: over 2 years ago - 101 downloads last month - 5 stars on GitHub - 1 maintainer
invoice-captcha 0.0.1
ε›½η¨Žζ€»ε±€ε‘η₯¨ζŸ₯ιͺŒιͺŒθ―η θŽ·ε–δΈŽθ―†εˆ«.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 5 downloads last month - 2,923 stars on GitHub - 1 maintainer
huixiangdou 0.1.0
Overcoming Group Chat Scenarios with LLM-based Technical Assistance
3 versions - Latest release: 5 months ago - 30 downloads last month - 826 stars on GitHub - 1 maintainer
textmater
Extract Structured Data from text
1 version
krakensegment 0.4
line segmentation code from kraken
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 33 downloads last month - 5 stars on GitHub - 1 maintainer
quipucamayoc 0.1.2
Tools to extract information from digitized historical documents
2 versions - Latest release: about 2 years ago - 1 dependent repositories - 33 downloads last month - 21 stars on GitHub - 1 maintainer
ocrscreen 0.0.5
ocr for recognizing text on computer screen
5 versions - Latest release: over 1 year ago - 31 downloads last month - 1 stars on GitHub - 1 maintainer
textnoisr 1.1.1
Add noise to text at the character level
5 versions - Latest release: 3 months ago - 65 downloads last month - 12 stars on GitHub - 1 maintainer
pytesseract-cli 1.2.0
A pytesseract wrapper enabling OCR on images and directories.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 125 downloads last month - 1 stars on GitHub - 1 maintainer
wow-ocr 0.0.3
A packaged OCR model to read texts into WoW screenshots
3 versions - Latest release: about 1 year ago - 29 downloads last month - 3 stars on GitHub - 1 maintainer
ocr-tamil 0.3.6
Python Tamil OCR package
36 versions - Latest release: 3 months ago - 425 downloads last month - 36 stars on GitHub - 1 maintainer
ocrd-gbn 1.0.0
Collection of OCR-D compliant tools for layout analysis and segmentation of historical german-lan...
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 9 downloads last month - 9 stars on GitHub - 1 maintainer
django-ocr_translate 0.5.1
Django app for OCR and translation
11 versions - Latest release: 6 months ago - 4 dependent packages - 2 dependent repositories - 64 downloads last month - 13 stars on GitHub - 1 maintainer
hasy 0.3.1 πŸ’°
Tools for the HASY dataset.
5 versions - Latest release: almost 4 years ago - 1 dependent repositories - 26 downloads last month - 32 stars on GitHub - 1 maintainer
epam.imago 2.0.0rc1
Imago, chemical structures optical recognition tool
1 version - Latest release: over 1 year ago - 12 downloads last month - 6 stars on GitHub - 1 maintainer
pdftotext3 1.0.4
Convert PDF Files to Text Files using Google's Tesseract OCR.
5 versions - Latest release: almost 2 years ago - 2 dependent repositories - 410 downloads last month - 12 stars on GitHub - 1 maintainer
wordmaze 0.3.6
Words and textboxes made amazing
9 versions - Latest release: about 3 years ago - 1 dependent repositories - 55 downloads last month - 0 stars on GitHub - 2 maintainers
fontain 0.1.1
Python tool for font recognition on images
1 version - Latest release: over 2 years ago - 1 dependent repositories - 7 downloads last month - 0 stars on GitHub - 1 maintainer
eynollah 0.3.0
Document Layout Analysis
13 versions - Latest release: about 1 year ago - 1 dependent repositories - 133 downloads last month - 309 stars on GitHub - 1 maintainer
ubii-processing-module-ocr 0.2.0
"Ubi Interact Processing Module to perform OCR tasks via Tesseract"
9 versions - Latest release: almost 2 years ago - 1 dependent repositories - 105 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
ddddocr 1.5.3
带带弟弟OCR
21 versions - Latest release: about 1 month ago - 27 dependent packages - 153 dependent repositories - 57.7 thousand downloads last month - 8,469 stars on GitHub - 1 maintainer
ocrd-cis 0.0.10
CIS OCR-D command line tools
3 versions - Latest release: almost 4 years ago - 1 dependent repositories - 23 downloads last month - 33 stars on GitHub - 3 maintainers
aiopytesseract 0.14.0 πŸ’°
asyncio tesseract wrapper for Tesseract-OCR
15 versions - Latest release: 4 months ago - 1 dependent repositories - 443 downloads last month - 15 stars on GitHub - 1 maintainer
advent-of-code-ocr 1.0.0 πŸ’°
Convert Advent of Code ASCII art
3 versions - Latest release: over 2 years ago - 8 dependent repositories - 189 downloads last month - 12 stars on GitHub - 1 maintainer
cleanocr 0.1.4
Automatically denoise degraded document images to improve ocr engine
4 versions - Latest release: over 1 year ago - 19 downloads last month - 6 stars on GitHub - 1 maintainer
ocred 0.4.0
Clever, simple, and intuitive wrapper functionalities for OCRing specific textual materials
6 versions - Latest release: 9 months ago - 19 downloads last month - 13 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
pyocr 0.8.5
A Python wrapper for OCR engines (Tesseract, Cuneiform, etc)
30 versions - Latest release: 9 months ago - 5 dependent packages - 255 dependent repositories - 25.8 thousand downloads last month - 7,831 stars on GitHub - 1 maintainer
batukh 0.1.1
Document recognizer for multiple languages.
5 versions - Latest release: over 3 years ago - 57 downloads last month - 5 stars on GitHub - 3 maintainers
Top 9.0% on pypi.org
pix2text 1.0.2
An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, c...
25 versions - Latest release: 3 months ago - 1 dependent repositories - 2.25 thousand downloads last month - 1,401 stars on GitHub - 1 maintainer
noteshrinker 0.2.0
Smart shrinking of the size and color palette of images
4 versions - Latest release: almost 6 years ago - 1 dependent repositories - 60 downloads last month - 13 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
layoutparser 0.3.4
A unified toolkit for Deep Learning Based Document Image Analysis
11 versions - Latest release: about 2 years ago - 5 dependent packages - 77 dependent repositories - 283 thousand downloads last month - 4,547 stars on GitHub - 1 maintainer
thai-personal-card-extract 1.3.5
Library for extract infomation from thai personal identity card
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 271 downloads last month - 36 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
baidu-aip 4.16.13
Baidu AIP SDK
68 versions - Latest release: 7 months ago - 9 dependent packages - 1,035 dependent repositories - 31.2 thousand downloads last month - 3 maintainers
rapidocr-openvino 1.3.19
A cross platform OCR Library based on OpenVINO.
64 versions - Latest release: 23 days ago - 1 dependent package - 694 downloads last month - 1 stars on GitHub - 1 maintainer
rapidocr-paddle 1.3.21
A cross platform OCR Library based on PaddlePaddle.
17 versions - Latest release: 23 days ago - 1 dependent package - 823 downloads last month - 1 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
mayan-edms 4.6.4
Free Open Source Electronic Document Management System
250 versions - Latest release: about 2 months ago - 3 dependent repositories - 1.89 thousand downloads last month - 617 stars on GitLab.com - 1 maintainer