Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "tesseract" keyword
gpyocr 1.6
Python wrapper for Tesseract OCR and Google Vision OCR11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 106 downloads last month - 11 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
48 versions - Latest release: 11 months ago - 1 dependent package - 10 dependent repositories - 2.92 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
rpa 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)48 versions - Latest release: 11 months ago - 1 dependent package - 10 dependent repositories - 2.92 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
Top 4.3% on pypi.org
88 versions - Latest release: 11 months ago - 22 dependent repositories - 3.1 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
tagui 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)88 versions - Latest release: 11 months ago - 22 dependent repositories - 3.1 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
tightocr 0.4.4
Thin and pleasant wrapper for Tesseract OCR.5 versions - Latest release: about 10 years ago - 2 dependent repositories - 38 downloads last month - 24 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
14 versions - Latest release: 24 days ago - 4 dependent packages - 133 dependent repositories - 2.04 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pymupdfb 1.24.3
MuPDF shared libraries for PyMuPDF.14 versions - Latest release: 24 days ago - 4 dependent packages - 133 dependent repositories - 2.04 million downloads last month - 4,025 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
230 versions - Latest release: about 2 months ago - 10 dependent packages - 108 dependent repositories - 90.9 thousand downloads last month - 12,250 stars on GitHub - 1 maintainer
ocrmypdf 16.2.0 💰
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched230 versions - Latest release: about 2 months ago - 10 dependent packages - 108 dependent repositories - 90.9 thousand downloads last month - 12,250 stars on GitHub - 1 maintainer
ocrd-fork-tesserocr 3.0.0rc2
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython2 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 17 downloads last month - 1,945 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
20 versions - Latest release: about 1 month ago - 9 dependent packages - 201 dependent repositories - 66.8 thousand downloads last month - 1,945 stars on GitHub - 1 maintainer
tesserocr 2.7.0
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython20 versions - Latest release: about 1 month ago - 9 dependent packages - 201 dependent repositories - 66.8 thousand downloads last month - 1,945 stars on GitHub - 1 maintainer
mementor 1.0.4
A library to fetch images from directory to fix orientation and pull OCR from the images along wi...5 versions - Latest release: about 6 years ago - 1 dependent repositories - 16 downloads last month - 72 stars on GitHub - 1 maintainer
imagetocsv 1.0.0
Converts An Image to a CSV. This exists because Chorus 3.0 are bat-shit and only show images for ...4 versions - Latest release: 12 months ago - 31 downloads last month - 4 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
116 versions - Latest release: 17 days ago - 206 dependent packages - 1,798 dependent repositories - 2.94 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pymupdf 1.24.4
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...116 versions - Latest release: 17 days ago - 206 dependent packages - 1,798 dependent repositories - 2.94 million downloads last month - 4,025 stars on GitHub - 1 maintainer
extract-thinker 0.0.1
Library to extract data from files and documents agnositicaly using LLMs3 versions - Latest release: 19 days ago - 402 downloads last month - 157 stars on GitHub - 1 maintainer
tesseractmultiprocessing 0.10
Multiprocessing OCR with Tesseract1 version - Latest release: about 1 year ago - 1 dependent package - 127 downloads last month - 0 stars on GitHub - 1 maintainer
filecabinet 2.1.0
A local, offline document archive3 versions - Latest release: 11 months ago - 29 downloads last month - 58,738 stars on GitHub - 1 maintainer
ocr-with-format 0.9
Wrapper to pytesseract to preserve space and formatting8 versions - Latest release: 10 months ago - 33 downloads last month - 0 stars on GitHub - 1 maintainer
textshot 0.1.1
Python tool for grabbing text via screenshot2 versions - Latest release: over 1 year ago - 82 downloads last month - 1,678 stars on GitHub - 1 maintainer
tesserpy 1.1.2
Python interface to the Tesseract library3 versions - Latest release: over 9 years ago - 2 dependent repositories - 16 downloads last month - 20 stars on GitHub - 2 maintainers
fastmrz 1.1
Extracts the Machine Readable Zone (MRZ) data from document images3 versions - Latest release: 21 days ago - 210 downloads last month - 6 stars on GitHub - 1 maintainer
wagtail-textract 1.2
Allow searching for text in Documents in the Wagtail content management system8 versions - Latest release: over 4 years ago - 1 dependent repositories - 29 downloads last month - 31 stars on GitHub - 2 maintainers
saram 1.0.2
A library to fetch images from a directory and get OCR and store in txt with orientation rotation...9 versions - Latest release: about 6 years ago - 1 dependent repositories - 31 downloads last month - 51 stars on GitHub - 1 maintainer
pdfautonup 1.9.0
Convert PDF files to 'n-up' PDF files, guessing the output layout.21 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 178 downloads last month - 4,025 stars on GitHub - 1 maintainer
aqpymupdf 1.23.7
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...1 version - Latest release: about 2 months ago - 39 downloads last month - 4,025 stars on GitHub - 1 maintainer
pmworker 1.2.0 💰
Papermerge worker - extract OCR text documents4 versions - Latest release: about 4 years ago - 1 dependent repositories - 25 downloads last month - 3 stars on GitHub - 1 maintainer
ocyara 1.0.1
A Yara rule engine that scans images for matches using Optical Character Recognition (OCR). See t...5 versions - Latest release: over 7 years ago - 1 dependent repositories - 12 downloads last month - 38 stars on GitHub - 2 maintainers
tessy 0.5.2
A Python wrapper for Tesseract-OCR.2 versions - Latest release: 2 months ago - 1 dependent repositories - 24 downloads last month - 0 stars on GitHub - 1 maintainer
motionpdf 0.0.1
A script built on Tesseract-OCR for converting .pdf to .txt1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 downloads last month - 0 stars on GitHub - 1 maintainer
nkocr 2.3.0 💰
This is a module to make specifics OCRs at food products and nutricional tables.14 versions - Latest release: over 1 year ago - 1 dependent repositories - 70 downloads last month - 33 stars on GitHub - 2 maintainers
chronotva 1.0.1
ChronoTVA (The Chronomancer's Tesseract Visualization Aid) is a Python 3.9+ command-line tool des...1 version - Latest release: 7 months ago - 16 downloads last month - 1 stars on GitHub - 1 maintainer
verifytweet 0.6.0
A tool to verify Tweet screenshots2 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 20 stars on GitHub - 1 maintainer
pdf-language-detector 0.0.11
A python script to iterate over a list of PDF in a directory and try to guess their language with...12 versions - Latest release: 12 months ago - 70 downloads last month - 58,526 stars on GitHub - 2 maintainers
parsee-pdf-reader 0.1.3
Tesseract Open Source OCR Engine (main repository)17 versions - Latest release: 3 months ago - 1 dependent package - 409 downloads last month - 58,526 stars on GitHub - 1 maintainer
targimo 0.0.1 removed
Targimo: An Artifical Intelligence Model that Revolutionizes Sentiment Analysis1 version - Latest release: 7 months ago - 144 downloads last month - 54,235 stars on GitHub - 1 maintainer
usseg 0.7.1
Tools to segment doppler ultrasound signals from scan images.9 versions - Latest release: 7 months ago - 61 downloads last month - 58,453 stars on GitHub - 2 maintainers
tesseract-window-scanner 0.12
OCR on screenshots with tesseract - Windows only3 versions - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
django-tesseractfield 0.0.2
A small app providing a tesseract field for django2 versions - Latest release: about 5 years ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
pytessy 0.1.0
Tesseract-OCR, faster1 version - Latest release: about 4 years ago - 1 dependent repositories - 261 downloads last month - 11 stars on GitHub - 1 maintainer
tesseractrapidfuzz 0.10
Performs OCR on a list of images using Tesseract and performs fuzzy string matching with a given ...1 version - Latest release: 9 months ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
tesserwrap 0.1.6
Basic python bindings to the Tesseract C++ API11 versions - Latest release: over 9 years ago - 5 dependent repositories - 88 downloads last month - 66 stars on GitHub - 1 maintainer
betterocr 1.2.0 💰
Better text detection by combining OCR engines with LLM.6 versions - Latest release: 7 months ago - 95 downloads last month - 397 stars on GitHub - 1 maintainer
easyocr-window-scanner 0.10
OCR on screenshots with EasyOCR - Windows only1 version - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
adf2pdf 0.8.3
Automate the workflow around ADF scanning, OCR and PDF creation4 versions - Latest release: 10 months ago - 1 dependent repositories - 33 downloads last month - 5 stars on GitHub - 1 maintainer
pdftoprompt 0.1.2
Python library to abbreviate a PDF file to GPT 8k prompt length3 versions - Latest release: about 1 year ago - 37 downloads last month - 58,453 stars on GitHub - 1 maintainer
multitessiocr 0.13
Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) usin...4 versions - Latest release: 7 months ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
screen-ocr 0.5.0
Library for processing screen contents using OCR7 versions - Latest release: about 1 year ago - 1 dependent package - 4 dependent repositories - 184 downloads last month - 30 stars on GitHub - 1 maintainer
tesserhocr2df 0.10
tesseract hocr to pandas DataFrame1 version - Latest release: 2 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
tesserparsing 0.10
Image Processing and Text Extraction with Tesseract - multiprocessing1 version - Latest release: 7 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
samagra-docparser 0.1.2
Document Parser built to extract information from pdfs.3 versions - Latest release: 9 months ago - 9 downloads last month - 1 maintainer
winrtocr 0.10
Multiprocessing library for OCR with WinRT1 version - Latest release: about 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
form-tools 0.1.6
Tesseract Open Source OCR Engine (main repository)5 versions - Latest release: 2 months ago - 1 dependent repositories - 295 downloads last month - 57,199 stars on GitHub - 1 maintainer
tesseract-sdk 0.8.4
Python SDK for Tesseract Models17 versions - Latest release: 7 months ago - 132 downloads last month - 2 maintainers
a-pandas-ex-tesseract-multirow-regex-fuzz 0.11
Regex/Fuzz search across multiple rows/Tesseract to pandas.DataFrame2 versions - Latest release: over 1 year ago - 2 dependent packages - 181 downloads last month - 0 stars on GitHub - 1 maintainer
python-ocr 0.1.5
Input Adaptor to verify file extension6 versions - Latest release: almost 2 years ago - 87 downloads last month - 1 maintainer
ubii-processing-module-ocr 0.2.0
"Ubi Interact Processing Module to perform OCR tasks via Tesseract"9 versions - Latest release: almost 2 years ago - 1 dependent repositories - 118 downloads last month - 0 stars on GitHub - 1 maintainer
tesseracttrainer 0.1.1
A small framework taking over the manual tesseract training process described in the Tesseract Wiki2 versions - Latest release: over 11 years ago - 1 dependent repositories - 42 downloads last month - 131 stars on GitHub - 1 maintainer
tesseract-python 3.5.1
Self-contained Python module to Tesseract.1 version - Latest release: about 6 years ago - 1 dependent repositories - 24 downloads last month - 2 stars on GitHub - 1 maintainer
stb-automator 0.1.0
A library for automated control & testing of set-top boxes1 version - Latest release: about 3 years ago - 1 dependent repositories - 13 downloads last month - 3 stars on GitHub - 1 maintainer
pytesseract-cli 1.2.0
A pytesseract wrapper enabling OCR on images and directories.1 version - Latest release: about 3 years ago - 1 dependent repositories - 170 downloads last month - 1 stars on GitHub - 1 maintainer
pysseract 1.3.1
Python binding to Tesseract API16 versions - Latest release: over 4 years ago - 1 dependent repositories - 945 downloads last month - 0 stars on GitHub - 1 maintainer
pyslibtesseract 0.0.15
Integration of Tesseract for Python using a shared library12 versions - Latest release: about 8 years ago - 2 dependent repositories - 68 downloads last month - 11 stars on GitHub - 1 maintainer
polybiblioglot 0.2.0 💰
A tool to translate scanned books2 versions - Latest release: about 3 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 1 maintainer
northern-lights-forecast 4.1.4
A simple web scraping northern lights forecast that automatically send a telegram notification du...14 versions - Latest release: almost 2 years ago - 1 dependent repositories - 7 downloads last month - 1 stars on GitHub - 1 maintainer
nlpknowledge 0.0.2
Package to make sense of images with text information9 versions - Latest release: over 4 years ago - 1 dependent repositories - 40 downloads last month - 5,891 stars on GitHub - 1 maintainer
mc-pdf2txt 0.3.0 💰
Multi-column PDF to Text3 versions - Latest release: about 1 year ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
hocr-utils 0.0.3
Package containing utility function for hOCR and tesseract2 versions - Latest release: about 1 year ago - 1 dependent repositories - 34 downloads last month - 2 stars on GitHub - 1 maintainer
djtesseract 0.0.6
A small app providing a tesseract field for django 3.1.24 versions - Latest release: over 3 years ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
cropyble 1.2.0
Cropyble is a module that allows a user to easily perform crops on an image containing recognizab...4 versions - Latest release: over 4 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
tesseract-positional 0.1.2
Tool to save positional OCR data to a text file3 versions - Latest release: 10 months ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
tesstrain 0.1.4
Training utils for Tesseract5 versions - Latest release: 3 months ago - 92 downloads last month - 0 stars on GitHub - 1 maintainer
readmrz 0.0.2
Machine readable zone reader on ID cards2 versions - Latest release: over 1 year ago - 1 dependent repositories - 892 downloads last month - 11 stars on GitHub - 1 maintainer
autoocr 0.0.3
A Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)1 version - Latest release: about 5 years ago - 1 dependent repositories - 32 downloads last month - 12 stars on GitHub - 1 maintainer
aiopytesseract 0.14.0 💰
asyncio tesseract wrapper for Tesseract-OCR15 versions - Latest release: 4 months ago - 1 dependent repositories - 468 downloads last month - 15 stars on GitHub - 1 maintainer
Related Keywords
ocr
49
python
24
tesseract-ocr
19
pdf
12
hacktoberfest
9
pytesseract
9
opencv
9
OCR
9
ocr-engine
8
machine-learning
8
lstm
8
data-science
5
image-processing
4
fast
4
easyocr
4
optical character recognition
4
python3
4
xps
4
text-shaping
4
text-processing
4
table-extraction
4
pymupdf
4
pdf-documents
4
optical-character-recognition
4
mupdf
4
font
4
extract-data
4
epub
4
hocr
3
python-3
3
django
3
pdftotext
3
screenshot
3
Tesseract
3
mrz-scanner
2
bot
2
cross-platform
2
tagui
2
field
2
sikuli
2
rpa
2
multiprocessing
2
automation
2
admin
2
pytesseract-ocr
2
hwnd
2
handle
2
ocr-recognition
2
opencv-python
2
scanning
2
vision
2
PIL
2
Pillow
2
Cython
2
cython
2
python-library
2
character-recognition
2
training
2
adb
2
windows
2
dataframe
2
orientation-detection
2
pillow
2
text-extraction
2
search
2
pandas
2
seerai
1
analysis
1
geodesic
1
data
1
science
1
series
1
fuzz
1
regex
1
imgaeprocessing
1
ubi-interact
1
ubii
1
optical character recogniton
1
set-top box
1
test automation
1
LIRC
1
sane
1
adf
1
duplex-scanning
1
pdf-generation
1
grouping
1
coordinates
1
files
1
multiple
1
google
1
position
1
parsing
1
multilingual
1
parser
1
document
1
Multiprocessing
1
WinRT
1
winrt
1
gstreamer
1
crontab
1