Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "tesseract" keyword
nkocr 2.3.0 💰
This is a module to make specifics OCRs at food products and nutricional tables.14 versions - Latest release: over 1 year ago - 1 dependent repositories - 70 downloads last month - 33 stars on GitHub - 2 maintainers
chronotva 1.0.1
ChronoTVA (The Chronomancer's Tesseract Visualization Aid) is a Python 3.9+ command-line tool des...1 version - Latest release: 6 months ago - 16 downloads last month - 1 stars on GitHub - 1 maintainer
verifytweet 0.6.0
A tool to verify Tweet screenshots2 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 20 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
229 versions - Latest release: about 1 month ago - 10 dependent packages - 108 dependent repositories - 96.2 thousand downloads last month - 11,885 stars on GitHub - 1 maintainer
ocrmypdf 16.2.0 💰
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched229 versions - Latest release: about 1 month ago - 10 dependent packages - 108 dependent repositories - 96.2 thousand downloads last month - 11,885 stars on GitHub - 1 maintainer
pdf-language-detector 0.0.11
A python script to iterate over a list of PDF in a directory and try to guess their language with...12 versions - Latest release: 11 months ago - 70 downloads last month - 58,526 stars on GitHub - 2 maintainers
parsee-pdf-reader 0.1.3
Tesseract Open Source OCR Engine (main repository)17 versions - Latest release: 3 months ago - 1 dependent package - 409 downloads last month - 58,526 stars on GitHub - 1 maintainer
extract-thinker 0.0.1
Library to extract data from files and documents agnositicaly using LLMs1 version - Latest release: 7 days ago - 206 downloads last month - 58,526 stars on GitHub - 1 maintainer
targimo 0.0.1 removed
Targimo: An Artifical Intelligence Model that Revolutionizes Sentiment Analysis1 version - Latest release: 7 months ago - 144 downloads last month - 54,235 stars on GitHub - 1 maintainer
usseg 0.7.1
Tools to segment doppler ultrasound signals from scan images.9 versions - Latest release: 6 months ago - 61 downloads last month - 58,453 stars on GitHub - 2 maintainers
filecabinet 2.1.0
A local, offline document archive3 versions - Latest release: 11 months ago - 33 downloads last month - 58,508 stars on GitHub - 1 maintainer
tesseract-window-scanner 0.12
OCR on screenshots with tesseract - Windows only3 versions - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
django-tesseractfield 0.0.2
A small app providing a tesseract field for django2 versions - Latest release: about 5 years ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
88 versions - Latest release: 11 months ago - 22 dependent repositories - 3.15 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
tagui 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)88 versions - Latest release: 11 months ago - 22 dependent repositories - 3.15 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
Top 4.6% on pypi.org
48 versions - Latest release: 11 months ago - 1 dependent package - 10 dependent repositories - 2.94 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
rpa 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)48 versions - Latest release: 11 months ago - 1 dependent package - 10 dependent repositories - 2.94 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
pytessy 0.1.0
Tesseract-OCR, faster1 version - Latest release: about 4 years ago - 1 dependent repositories - 261 downloads last month - 11 stars on GitHub - 1 maintainer
tesseractrapidfuzz 0.10
Performs OCR on a list of images using Tesseract and performs fuzzy string matching with a given ...1 version - Latest release: 8 months ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
tesserwrap 0.1.6
Basic python bindings to the Tesseract C++ API11 versions - Latest release: over 9 years ago - 5 dependent repositories - 88 downloads last month - 66 stars on GitHub - 1 maintainer
betterocr 1.2.0 💰
Better text detection by combining OCR engines with LLM.6 versions - Latest release: 7 months ago - 95 downloads last month - 397 stars on GitHub - 1 maintainer
fastmrz 1.1
Extracts the Machine Readable Zone (MRZ) data from document images2 versions - Latest release: 9 days ago - 193 downloads last month - 6 stars on GitHub - 1 maintainer
easyocr-window-scanner 0.10
OCR on screenshots with EasyOCR - Windows only1 version - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
adf2pdf 0.8.3
Automate the workflow around ADF scanning, OCR and PDF creation4 versions - Latest release: 9 months ago - 1 dependent repositories - 33 downloads last month - 5 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
115 versions - Latest release: 5 days ago - 206 dependent packages - 1,798 dependent repositories - 2.81 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pymupdf 1.24.4
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...115 versions - Latest release: 5 days ago - 206 dependent packages - 1,798 dependent repositories - 2.81 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pdftoprompt 0.1.2
Python library to abbreviate a PDF file to GPT 8k prompt length3 versions - Latest release: about 1 year ago - 37 downloads last month - 58,453 stars on GitHub - 1 maintainer
multitessiocr 0.13
Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) usin...4 versions - Latest release: 6 months ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
screen-ocr 0.5.0
Library for processing screen contents using OCR7 versions - Latest release: about 1 year ago - 1 dependent package - 4 dependent repositories - 184 downloads last month - 30 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
14 versions - Latest release: 12 days ago - 4 dependent packages - 133 dependent repositories - 1.85 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pymupdfb 1.24.3
MuPDF shared libraries for PyMuPDF.14 versions - Latest release: 12 days ago - 4 dependent packages - 133 dependent repositories - 1.85 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pdfautonup 1.9.0
Convert PDF files to 'n-up' PDF files, guessing the output layout.21 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 194 downloads last month - 4,025 stars on GitHub - 1 maintainer
ocrd-fork-tesserocr 3.0.0rc2
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython2 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 18 downloads last month - 1,936 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
20 versions - Latest release: 24 days ago - 9 dependent packages - 201 dependent repositories - 65.7 thousand downloads last month - 1,936 stars on GitHub - 1 maintainer
tesserocr 2.7.0
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython20 versions - Latest release: 24 days ago - 9 dependent packages - 201 dependent repositories - 65.7 thousand downloads last month - 1,936 stars on GitHub - 1 maintainer
aqpymupdf 1.23.7
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...1 version - Latest release: about 1 month ago - 218 downloads last month - 4,025 stars on GitHub - 1 maintainer
tesserhocr2df 0.10
tesseract hocr to pandas DataFrame1 version - Latest release: about 2 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
tesserparsing 0.10
Image Processing and Text Extraction with Tesseract - multiprocessing1 version - Latest release: 6 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
samagra-docparser 0.1.2
Document Parser built to extract information from pdfs.3 versions - Latest release: 9 months ago - 9 downloads last month - 1 maintainer
ocr-with-format 0.9
Wrapper to pytesseract to preserve space and formatting8 versions - Latest release: 10 months ago - 78 downloads last month - 0 stars on GitHub - 1 maintainer
winrtocr 0.10
Multiprocessing library for OCR with WinRT1 version - Latest release: about 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
tesseractmultiprocessing 0.10
Multiprocessing OCR with Tesseract1 version - Latest release: about 1 year ago - 1 dependent package - 136 downloads last month - 0 stars on GitHub - 1 maintainer
form-tools 0.1.6
Tesseract Open Source OCR Engine (main repository)5 versions - Latest release: 2 months ago - 1 dependent repositories - 295 downloads last month - 57,199 stars on GitHub - 1 maintainer
tesseract-sdk 0.8.4
Python SDK for Tesseract Models17 versions - Latest release: 7 months ago - 132 downloads last month - 2 maintainers
imagetocsv 1.0.0
Converts An Image to a CSV. This exists because Chorus 3.0 are bat-shit and only show images for ...4 versions - Latest release: 11 months ago - 26 downloads last month - 4 stars on GitHub - 1 maintainer
a-pandas-ex-tesseract-multirow-regex-fuzz 0.11
Regex/Fuzz search across multiple rows/Tesseract to pandas.DataFrame2 versions - Latest release: over 1 year ago - 2 dependent packages - 181 downloads last month - 0 stars on GitHub - 1 maintainer
python-ocr 0.1.5
Input Adaptor to verify file extension6 versions - Latest release: almost 2 years ago - 87 downloads last month - 1 maintainer
wagtail-textract 1.2
Allow searching for text in Documents in the Wagtail content management system8 versions - Latest release: over 4 years ago - 1 dependent repositories - 52 downloads last month - 31 stars on GitHub - 2 maintainers
ubii-processing-module-ocr 0.2.0
"Ubi Interact Processing Module to perform OCR tasks via Tesseract"9 versions - Latest release: almost 2 years ago - 1 dependent repositories - 118 downloads last month - 0 stars on GitHub - 1 maintainer
tightocr 0.4.4
Thin and pleasant wrapper for Tesseract OCR.5 versions - Latest release: about 10 years ago - 2 dependent repositories - 24 downloads last month - 24 stars on GitHub - 1 maintainer
tessy 0.5.2
A Python wrapper for Tesseract-OCR.2 versions - Latest release: 2 months ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
tesserpy 1.1.2
Python interface to the Tesseract library3 versions - Latest release: over 9 years ago - 2 dependent repositories - 21 downloads last month - 20 stars on GitHub - 2 maintainers
tesseracttrainer 0.1.1
A small framework taking over the manual tesseract training process described in the Tesseract Wiki2 versions - Latest release: over 11 years ago - 1 dependent repositories - 42 downloads last month - 131 stars on GitHub - 1 maintainer
tesseract-python 3.5.1
Self-contained Python module to Tesseract.1 version - Latest release: almost 6 years ago - 1 dependent repositories - 24 downloads last month - 2 stars on GitHub - 1 maintainer
stb-automator 0.1.0
A library for automated control & testing of set-top boxes1 version - Latest release: about 3 years ago - 1 dependent repositories - 13 downloads last month - 3 stars on GitHub - 1 maintainer
saram 1.0.2
A library to fetch images from a directory and get OCR and store in txt with orientation rotation...9 versions - Latest release: about 6 years ago - 1 dependent repositories - 43 downloads last month - 50 stars on GitHub - 1 maintainer
pytesseract-cli 1.2.0
A pytesseract wrapper enabling OCR on images and directories.1 version - Latest release: about 3 years ago - 1 dependent repositories - 170 downloads last month - 1 stars on GitHub - 1 maintainer
pysseract 1.3.1
Python binding to Tesseract API16 versions - Latest release: over 4 years ago - 1 dependent repositories - 945 downloads last month - 0 stars on GitHub - 1 maintainer
pyslibtesseract 0.0.15
Integration of Tesseract for Python using a shared library12 versions - Latest release: about 8 years ago - 2 dependent repositories - 68 downloads last month - 11 stars on GitHub - 1 maintainer
polybiblioglot 0.2.0 💰
A tool to translate scanned books2 versions - Latest release: about 3 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
pmworker 1.2.0 💰
Papermerge worker - extract OCR text documents4 versions - Latest release: about 4 years ago - 1 dependent repositories - 25 downloads last month - 3 stars on GitHub - 1 maintainer
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 1 maintainer
ocyara 1.0.1
A Yara rule engine that scans images for matches using Optical Character Recognition (OCR). See t...5 versions - Latest release: over 7 years ago - 1 dependent repositories - 23 downloads last month - 38 stars on GitHub - 2 maintainers
northern-lights-forecast 4.1.4
A simple web scraping northern lights forecast that automatically send a telegram notification du...14 versions - Latest release: over 1 year ago - 1 dependent repositories - 7 downloads last month - 1 stars on GitHub - 1 maintainer
nlpknowledge 0.0.2
Package to make sense of images with text information9 versions - Latest release: over 4 years ago - 1 dependent repositories - 40 downloads last month - 5,891 stars on GitHub - 1 maintainer
motionpdf 0.0.1
A script built on Tesseract-OCR for converting .pdf to .txt1 version - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
mementor 1.0.4
A library to fetch images from directory to fix orientation and pull OCR from the images along wi...5 versions - Latest release: about 6 years ago - 1 dependent repositories - 26 downloads last month - 72 stars on GitHub - 1 maintainer
mc-pdf2txt 0.3.0 💰
Multi-column PDF to Text3 versions - Latest release: about 1 year ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
hocr-utils 0.0.3
Package containing utility function for hOCR and tesseract2 versions - Latest release: about 1 year ago - 1 dependent repositories - 34 downloads last month - 2 stars on GitHub - 1 maintainer
gpyocr 1.6
Python wrapper for Tesseract OCR and Google Vision OCR11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 106 downloads last month - 11 stars on GitHub - 1 maintainer
djtesseract 0.0.6
A small app providing a tesseract field for django 3.1.24 versions - Latest release: over 3 years ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
cropyble 1.2.0
Cropyble is a module that allows a user to easily perform crops on an image containing recognizab...4 versions - Latest release: over 4 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
tesseract-positional 0.1.2
Tool to save positional OCR data to a text file3 versions - Latest release: 10 months ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
tesstrain 0.1.4
Training utils for Tesseract5 versions - Latest release: 3 months ago - 92 downloads last month - 0 stars on GitHub - 1 maintainer
readmrz 0.0.2
Machine readable zone reader on ID cards2 versions - Latest release: over 1 year ago - 1 dependent repositories - 892 downloads last month - 11 stars on GitHub - 1 maintainer
autoocr 0.0.3
A Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)1 version - Latest release: about 5 years ago - 1 dependent repositories - 32 downloads last month - 12 stars on GitHub - 1 maintainer
aiopytesseract 0.14.0 💰
asyncio tesseract wrapper for Tesseract-OCR15 versions - Latest release: 4 months ago - 1 dependent repositories - 468 downloads last month - 15 stars on GitHub - 1 maintainer
textshot 0.1.1
Python tool for grabbing text via screenshot2 versions - Latest release: over 1 year ago - 76 downloads last month - 1,670 stars on GitHub - 1 maintainer
Related Keywords
ocr
49
python
24
tesseract-ocr
19
pdf
12
OCR
9
pytesseract
9
opencv
9
hacktoberfest
9
lstm
8
machine-learning
8
ocr-engine
8
data-science
5
optical character recognition
4
python3
4
fast
4
easyocr
4
epub
4
extract-data
4
font
4
mupdf
4
pdf-documents
4
pymupdf
4
table-extraction
4
optical-character-recognition
4
text-processing
4
text-shaping
4
image-processing
4
xps
4
hocr
3
django
3
screenshot
3
Tesseract
3
pdftotext
3
python-3
3
mrz-scanner
2
field
2
admin
2
cross-platform
2
rpa
2
sikuli
2
tagui
2
pytesseract-ocr
2
pandas
2
ocr-recognition
2
character-recognition
2
python-library
2
PIL
2
Pillow
2
Cython
2
cython
2
vision
2
text-extraction
2
multiprocessing
2
opencv-python
2
pillow
2
scanning
2
dataframe
2
automation
2
search
2
orientation-detection
2
training
2
adb
2
windows
2
hwnd
2
handle
2
bot
2
cli
1
ai
1
pybind11
1
python-tesseract
1
Python
1
translation
1
documentation
1
tutorial
1
distributed-computing
1
image-classification
1
wagtail
1
ubi-interact
1
ubii
1
ctesseract
1
optical character recogniton
1
set-top box
1
test automation
1
LIRC
1
gstreamer
1
linux
1
lirc
1
set-top-box
1
stb
1
stb-automator
1
test-automation
1
testing
1
testing-library
1
chmod
1
image
1
pyocr
1
wand
1
meme
1
numpy
1
cli-tool
1