Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "tesseract" keyword

gpyocr 1.6
Python wrapper for Tesseract OCR and Google Vision OCR
11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 106 downloads last month - 11 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
rpa 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)
48 versions - Latest release: 11 months ago - 1 dependent package - 10 dependent repositories - 2.92 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
Top 4.3% on pypi.org
tagui 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)
88 versions - Latest release: 11 months ago - 22 dependent repositories - 3.1 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
tightocr 0.4.4
Thin and pleasant wrapper for Tesseract OCR.
5 versions - Latest release: about 10 years ago - 2 dependent repositories - 38 downloads last month - 24 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
pymupdfb 1.24.3
MuPDF shared libraries for PyMuPDF.
14 versions - Latest release: 24 days ago - 4 dependent packages - 133 dependent repositories - 2.04 million downloads last month - 4,025 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
ocrmypdf 16.2.0 💰
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
230 versions - Latest release: about 2 months ago - 10 dependent packages - 108 dependent repositories - 90.9 thousand downloads last month - 12,250 stars on GitHub - 1 maintainer
ocrd-fork-tesserocr 3.0.0rc2
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
2 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 17 downloads last month - 1,945 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
tesserocr 2.7.0
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
20 versions - Latest release: about 1 month ago - 9 dependent packages - 201 dependent repositories - 66.8 thousand downloads last month - 1,945 stars on GitHub - 1 maintainer
mementor 1.0.4
A library to fetch images from directory to fix orientation and pull OCR from the images along wi...
5 versions - Latest release: about 6 years ago - 1 dependent repositories - 16 downloads last month - 72 stars on GitHub - 1 maintainer
imagetocsv 1.0.0
Converts An Image to a CSV. This exists because Chorus 3.0 are bat-shit and only show images for ...
4 versions - Latest release: 12 months ago - 31 downloads last month - 4 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pymupdf 1.24.4
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
116 versions - Latest release: 17 days ago - 206 dependent packages - 1,798 dependent repositories - 2.94 million downloads last month - 4,025 stars on GitHub - 1 maintainer
extract-thinker 0.0.1
Library to extract data from files and documents agnositicaly using LLMs
3 versions - Latest release: 19 days ago - 402 downloads last month - 157 stars on GitHub - 1 maintainer
tesseractmultiprocessing 0.10
Multiprocessing OCR with Tesseract
1 version - Latest release: about 1 year ago - 1 dependent package - 127 downloads last month - 0 stars on GitHub - 1 maintainer
filecabinet 2.1.0
A local, offline document archive
3 versions - Latest release: 11 months ago - 29 downloads last month - 58,738 stars on GitHub - 1 maintainer
ocr-with-format 0.9
Wrapper to pytesseract to preserve space and formatting
8 versions - Latest release: 10 months ago - 33 downloads last month - 0 stars on GitHub - 1 maintainer
textshot 0.1.1
Python tool for grabbing text via screenshot
2 versions - Latest release: over 1 year ago - 82 downloads last month - 1,678 stars on GitHub - 1 maintainer
tesserpy 1.1.2
Python interface to the Tesseract library
3 versions - Latest release: over 9 years ago - 2 dependent repositories - 16 downloads last month - 20 stars on GitHub - 2 maintainers
fastmrz 1.1
Extracts the Machine Readable Zone (MRZ) data from document images
3 versions - Latest release: 21 days ago - 210 downloads last month - 6 stars on GitHub - 1 maintainer
wagtail-textract 1.2
Allow searching for text in Documents in the Wagtail content management system
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 29 downloads last month - 31 stars on GitHub - 2 maintainers
saram 1.0.2
A library to fetch images from a directory and get OCR and store in txt with orientation rotation...
9 versions - Latest release: about 6 years ago - 1 dependent repositories - 31 downloads last month - 51 stars on GitHub - 1 maintainer
pdfautonup 1.9.0
Convert PDF files to 'n-up' PDF files, guessing the output layout.
21 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 178 downloads last month - 4,025 stars on GitHub - 1 maintainer
aqpymupdf 1.23.7
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
1 version - Latest release: about 2 months ago - 39 downloads last month - 4,025 stars on GitHub - 1 maintainer
pmworker 1.2.0 💰
Papermerge worker - extract OCR text documents
4 versions - Latest release: about 4 years ago - 1 dependent repositories - 25 downloads last month - 3 stars on GitHub - 1 maintainer
ocyara 1.0.1
A Yara rule engine that scans images for matches using Optical Character Recognition (OCR). See t...
5 versions - Latest release: over 7 years ago - 1 dependent repositories - 12 downloads last month - 38 stars on GitHub - 2 maintainers
tessy 0.5.2
A Python wrapper for Tesseract-OCR.
2 versions - Latest release: 2 months ago - 1 dependent repositories - 24 downloads last month - 0 stars on GitHub - 1 maintainer
motionpdf 0.0.1
A script built on Tesseract-OCR for converting .pdf to .txt
1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 1 maintainer
nkocr 2.3.0 💰
This is a module to make specifics OCRs at food products and nutricional tables.
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 70 downloads last month - 33 stars on GitHub - 2 maintainers
chronotva 1.0.1
ChronoTVA (The Chronomancer's Tesseract Visualization Aid) is a Python 3.9+ command-line tool des...
1 version - Latest release: 7 months ago - 16 downloads last month - 1 stars on GitHub - 1 maintainer
verifytweet 0.6.0
A tool to verify Tweet screenshots
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 20 stars on GitHub - 1 maintainer
pdf-language-detector 0.0.11
A python script to iterate over a list of PDF in a directory and try to guess their language with...
12 versions - Latest release: 12 months ago - 70 downloads last month - 58,526 stars on GitHub - 2 maintainers
parsee-pdf-reader 0.1.3
Tesseract Open Source OCR Engine (main repository)
17 versions - Latest release: 3 months ago - 1 dependent package - 409 downloads last month - 58,526 stars on GitHub - 1 maintainer
targimo 0.0.1 removed
Targimo: An Artifical Intelligence Model that Revolutionizes Sentiment Analysis
1 version - Latest release: 7 months ago - 144 downloads last month - 54,235 stars on GitHub - 1 maintainer
usseg 0.7.1
Tools to segment doppler ultrasound signals from scan images.
9 versions - Latest release: 7 months ago - 61 downloads last month - 58,453 stars on GitHub - 2 maintainers
tesseract-window-scanner 0.12
OCR on screenshots with tesseract - Windows only
3 versions - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
django-tesseractfield 0.0.2
A small app providing a tesseract field for django
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
pytessy 0.1.0
Tesseract-OCR, faster
1 version - Latest release: about 4 years ago - 1 dependent repositories - 261 downloads last month - 11 stars on GitHub - 1 maintainer
tesseractrapidfuzz 0.10
Performs OCR on a list of images using Tesseract and performs fuzzy string matching with a given ...
1 version - Latest release: 9 months ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
tesserwrap 0.1.6
Basic python bindings to the Tesseract C++ API
11 versions - Latest release: over 9 years ago - 5 dependent repositories - 88 downloads last month - 66 stars on GitHub - 1 maintainer
betterocr 1.2.0 💰
Better text detection by combining OCR engines with LLM.
6 versions - Latest release: 7 months ago - 95 downloads last month - 397 stars on GitHub - 1 maintainer
easyocr-window-scanner 0.10
OCR on screenshots with EasyOCR - Windows only
1 version - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
adf2pdf 0.8.3
Automate the workflow around ADF scanning, OCR and PDF creation
4 versions - Latest release: 10 months ago - 1 dependent repositories - 33 downloads last month - 5 stars on GitHub - 1 maintainer
pdftoprompt 0.1.2
Python library to abbreviate a PDF file to GPT 8k prompt length
3 versions - Latest release: about 1 year ago - 37 downloads last month - 58,453 stars on GitHub - 1 maintainer
multitessiocr 0.13
Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) usin...
4 versions - Latest release: 7 months ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
screen-ocr 0.5.0
Library for processing screen contents using OCR
7 versions - Latest release: about 1 year ago - 1 dependent package - 4 dependent repositories - 184 downloads last month - 30 stars on GitHub - 1 maintainer
tesserhocr2df 0.10
tesseract hocr to pandas DataFrame
1 version - Latest release: 2 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
tesserparsing 0.10
Image Processing and Text Extraction with Tesseract - multiprocessing
1 version - Latest release: 7 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
samagra-docparser 0.1.2
Document Parser built to extract information from pdfs.
3 versions - Latest release: 9 months ago - 9 downloads last month - 1 maintainer
winrtocr 0.10
Multiprocessing library for OCR with WinRT
1 version - Latest release: about 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
form-tools 0.1.6
Tesseract Open Source OCR Engine (main repository)
5 versions - Latest release: 2 months ago - 1 dependent repositories - 295 downloads last month - 57,199 stars on GitHub - 1 maintainer
tesseract-sdk 0.8.4
Python SDK for Tesseract Models
17 versions - Latest release: 7 months ago - 132 downloads last month - 2 maintainers
a-pandas-ex-tesseract-multirow-regex-fuzz 0.11
Regex/Fuzz search across multiple rows/Tesseract to pandas.DataFrame
2 versions - Latest release: over 1 year ago - 2 dependent packages - 181 downloads last month - 0 stars on GitHub - 1 maintainer
python-ocr 0.1.5
Input Adaptor to verify file extension
6 versions - Latest release: almost 2 years ago - 87 downloads last month - 1 maintainer
ubii-processing-module-ocr 0.2.0
"Ubi Interact Processing Module to perform OCR tasks via Tesseract"
9 versions - Latest release: almost 2 years ago - 1 dependent repositories - 118 downloads last month - 0 stars on GitHub - 1 maintainer
tesseracttrainer 0.1.1
A small framework taking over the manual tesseract training process described in the Tesseract Wiki
2 versions - Latest release: over 11 years ago - 1 dependent repositories - 42 downloads last month - 131 stars on GitHub - 1 maintainer
tesseract-python 3.5.1
Self-contained Python module to Tesseract.
1 version - Latest release: about 6 years ago - 1 dependent repositories - 24 downloads last month - 2 stars on GitHub - 1 maintainer
stb-automator 0.1.0
A library for automated control & testing of set-top boxes
1 version - Latest release: about 3 years ago - 1 dependent repositories - 13 downloads last month - 3 stars on GitHub - 1 maintainer
pytesseract-cli 1.2.0
A pytesseract wrapper enabling OCR on images and directories.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 170 downloads last month - 1 stars on GitHub - 1 maintainer
pysseract 1.3.1
Python binding to Tesseract API
16 versions - Latest release: over 4 years ago - 1 dependent repositories - 945 downloads last month - 0 stars on GitHub - 1 maintainer
pyslibtesseract 0.0.15
Integration of Tesseract for Python using a shared library
12 versions - Latest release: about 8 years ago - 2 dependent repositories - 68 downloads last month - 11 stars on GitHub - 1 maintainer
polybiblioglot 0.2.0 💰
A tool to translate scanned books
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...
15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 1 maintainer
northern-lights-forecast 4.1.4
A simple web scraping northern lights forecast that automatically send a telegram notification du...
14 versions - Latest release: almost 2 years ago - 1 dependent repositories - 7 downloads last month - 1 stars on GitHub - 1 maintainer
nlpknowledge 0.0.2
Package to make sense of images with text information
9 versions - Latest release: over 4 years ago - 1 dependent repositories - 40 downloads last month - 5,891 stars on GitHub - 1 maintainer
mc-pdf2txt 0.3.0 💰
Multi-column PDF to Text
3 versions - Latest release: about 1 year ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
hocr-utils 0.0.3
Package containing utility function for hOCR and tesseract
2 versions - Latest release: about 1 year ago - 1 dependent repositories - 34 downloads last month - 2 stars on GitHub - 1 maintainer
djtesseract 0.0.6
A small app providing a tesseract field for django 3.1.2
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
cropyble 1.2.0
Cropyble is a module that allows a user to easily perform crops on an image containing recognizab...
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
tesseract-positional 0.1.2
Tool to save positional OCR data to a text file
3 versions - Latest release: 10 months ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
tesstrain 0.1.4
Training utils for Tesseract
5 versions - Latest release: 3 months ago - 92 downloads last month - 0 stars on GitHub - 1 maintainer
readmrz 0.0.2
Machine readable zone reader on ID cards
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 892 downloads last month - 11 stars on GitHub - 1 maintainer
autoocr 0.0.3
A Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)
1 version - Latest release: about 5 years ago - 1 dependent repositories - 32 downloads last month - 12 stars on GitHub - 1 maintainer
aiopytesseract 0.14.0 💰
asyncio tesseract wrapper for Tesseract-OCR
15 versions - Latest release: 4 months ago - 1 dependent repositories - 468 downloads last month - 15 stars on GitHub - 1 maintainer