Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "tesseract" keyword

nkocr 2.3.0 💰
This is a module to make specifics OCRs at food products and nutricional tables.
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 70 downloads last month - 33 stars on GitHub - 2 maintainers
chronotva 1.0.1
ChronoTVA (The Chronomancer's Tesseract Visualization Aid) is a Python 3.9+ command-line tool des...
1 version - Latest release: 6 months ago - 16 downloads last month - 1 stars on GitHub - 1 maintainer
verifytweet 0.6.0
A tool to verify Tweet screenshots
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 20 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
ocrmypdf 16.2.0 💰
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
229 versions - Latest release: about 1 month ago - 10 dependent packages - 108 dependent repositories - 96.2 thousand downloads last month - 11,885 stars on GitHub - 1 maintainer
pdf-language-detector 0.0.11
A python script to iterate over a list of PDF in a directory and try to guess their language with...
12 versions - Latest release: 11 months ago - 70 downloads last month - 58,526 stars on GitHub - 2 maintainers
parsee-pdf-reader 0.1.3
Tesseract Open Source OCR Engine (main repository)
17 versions - Latest release: 3 months ago - 1 dependent package - 409 downloads last month - 58,526 stars on GitHub - 1 maintainer
extract-thinker 0.0.1
Library to extract data from files and documents agnositicaly using LLMs
1 version - Latest release: 7 days ago - 206 downloads last month - 58,526 stars on GitHub - 1 maintainer
targimo 0.0.1 removed
Targimo: An Artifical Intelligence Model that Revolutionizes Sentiment Analysis
1 version - Latest release: 7 months ago - 144 downloads last month - 54,235 stars on GitHub - 1 maintainer
usseg 0.7.1
Tools to segment doppler ultrasound signals from scan images.
9 versions - Latest release: 6 months ago - 61 downloads last month - 58,453 stars on GitHub - 2 maintainers
filecabinet 2.1.0
A local, offline document archive
3 versions - Latest release: 11 months ago - 33 downloads last month - 58,508 stars on GitHub - 1 maintainer
tesseract-window-scanner 0.12
OCR on screenshots with tesseract - Windows only
3 versions - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
django-tesseractfield 0.0.2
A small app providing a tesseract field for django
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
tagui 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)
88 versions - Latest release: 11 months ago - 22 dependent repositories - 3.15 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
Top 4.6% on pypi.org
rpa 1.50.0
RPA for Python is a Python package for RPA (robotic process automation)
48 versions - Latest release: 11 months ago - 1 dependent package - 10 dependent repositories - 2.94 thousand downloads last month - 4,279 stars on GitHub - 2 maintainers
pytessy 0.1.0
Tesseract-OCR, faster
1 version - Latest release: about 4 years ago - 1 dependent repositories - 261 downloads last month - 11 stars on GitHub - 1 maintainer
tesseractrapidfuzz 0.10
Performs OCR on a list of images using Tesseract and performs fuzzy string matching with a given ...
1 version - Latest release: 8 months ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
tesserwrap 0.1.6
Basic python bindings to the Tesseract C++ API
11 versions - Latest release: over 9 years ago - 5 dependent repositories - 88 downloads last month - 66 stars on GitHub - 1 maintainer
betterocr 1.2.0 💰
Better text detection by combining OCR engines with LLM.
6 versions - Latest release: 7 months ago - 95 downloads last month - 397 stars on GitHub - 1 maintainer
fastmrz 1.1
Extracts the Machine Readable Zone (MRZ) data from document images
2 versions - Latest release: 9 days ago - 193 downloads last month - 6 stars on GitHub - 1 maintainer
easyocr-window-scanner 0.10
OCR on screenshots with EasyOCR - Windows only
1 version - Latest release: over 1 year ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
adf2pdf 0.8.3
Automate the workflow around ADF scanning, OCR and PDF creation
4 versions - Latest release: 9 months ago - 1 dependent repositories - 33 downloads last month - 5 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pymupdf 1.24.4
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
115 versions - Latest release: 5 days ago - 206 dependent packages - 1,798 dependent repositories - 2.81 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pdftoprompt 0.1.2
Python library to abbreviate a PDF file to GPT 8k prompt length
3 versions - Latest release: about 1 year ago - 37 downloads last month - 58,453 stars on GitHub - 1 maintainer
multitessiocr 0.13
Performs a very fast OCR on a list of images (file path, url, base64, bytes, numpy, PIL ...) usin...
4 versions - Latest release: 6 months ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
screen-ocr 0.5.0
Library for processing screen contents using OCR
7 versions - Latest release: about 1 year ago - 1 dependent package - 4 dependent repositories - 184 downloads last month - 30 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
pymupdfb 1.24.3
MuPDF shared libraries for PyMuPDF.
14 versions - Latest release: 12 days ago - 4 dependent packages - 133 dependent repositories - 1.85 million downloads last month - 4,025 stars on GitHub - 1 maintainer
pdfautonup 1.9.0
Convert PDF files to 'n-up' PDF files, guessing the output layout.
21 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 194 downloads last month - 4,025 stars on GitHub - 1 maintainer
ocrd-fork-tesserocr 3.0.0rc2
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
2 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 18 downloads last month - 1,936 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
tesserocr 2.7.0
A simple, Pillow-friendly, Python wrapper around tesseract-ocr API using Cython
20 versions - Latest release: 24 days ago - 9 dependent packages - 201 dependent repositories - 65.7 thousand downloads last month - 1,936 stars on GitHub - 1 maintainer
aqpymupdf 1.23.7
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
1 version - Latest release: about 1 month ago - 218 downloads last month - 4,025 stars on GitHub - 1 maintainer
tesserhocr2df 0.10
tesseract hocr to pandas DataFrame
1 version - Latest release: about 2 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
tesserparsing 0.10
Image Processing and Text Extraction with Tesseract - multiprocessing
1 version - Latest release: 6 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
samagra-docparser 0.1.2
Document Parser built to extract information from pdfs.
3 versions - Latest release: 9 months ago - 9 downloads last month - 1 maintainer
ocr-with-format 0.9
Wrapper to pytesseract to preserve space and formatting
8 versions - Latest release: 10 months ago - 78 downloads last month - 0 stars on GitHub - 1 maintainer
winrtocr 0.10
Multiprocessing library for OCR with WinRT
1 version - Latest release: about 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
tesseractmultiprocessing 0.10
Multiprocessing OCR with Tesseract
1 version - Latest release: about 1 year ago - 1 dependent package - 136 downloads last month - 0 stars on GitHub - 1 maintainer
form-tools 0.1.6
Tesseract Open Source OCR Engine (main repository)
5 versions - Latest release: 2 months ago - 1 dependent repositories - 295 downloads last month - 57,199 stars on GitHub - 1 maintainer
tesseract-sdk 0.8.4
Python SDK for Tesseract Models
17 versions - Latest release: 7 months ago - 132 downloads last month - 2 maintainers
imagetocsv 1.0.0
Converts An Image to a CSV. This exists because Chorus 3.0 are bat-shit and only show images for ...
4 versions - Latest release: 11 months ago - 26 downloads last month - 4 stars on GitHub - 1 maintainer
a-pandas-ex-tesseract-multirow-regex-fuzz 0.11
Regex/Fuzz search across multiple rows/Tesseract to pandas.DataFrame
2 versions - Latest release: over 1 year ago - 2 dependent packages - 181 downloads last month - 0 stars on GitHub - 1 maintainer
python-ocr 0.1.5
Input Adaptor to verify file extension
6 versions - Latest release: almost 2 years ago - 87 downloads last month - 1 maintainer
wagtail-textract 1.2
Allow searching for text in Documents in the Wagtail content management system
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 52 downloads last month - 31 stars on GitHub - 2 maintainers
ubii-processing-module-ocr 0.2.0
"Ubi Interact Processing Module to perform OCR tasks via Tesseract"
9 versions - Latest release: almost 2 years ago - 1 dependent repositories - 118 downloads last month - 0 stars on GitHub - 1 maintainer
tightocr 0.4.4
Thin and pleasant wrapper for Tesseract OCR.
5 versions - Latest release: about 10 years ago - 2 dependent repositories - 24 downloads last month - 24 stars on GitHub - 1 maintainer
tessy 0.5.2
A Python wrapper for Tesseract-OCR.
2 versions - Latest release: 2 months ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
tesserpy 1.1.2
Python interface to the Tesseract library
3 versions - Latest release: over 9 years ago - 2 dependent repositories - 21 downloads last month - 20 stars on GitHub - 2 maintainers
tesseracttrainer 0.1.1
A small framework taking over the manual tesseract training process described in the Tesseract Wiki
2 versions - Latest release: over 11 years ago - 1 dependent repositories - 42 downloads last month - 131 stars on GitHub - 1 maintainer
tesseract-python 3.5.1
Self-contained Python module to Tesseract.
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 24 downloads last month - 2 stars on GitHub - 1 maintainer
stb-automator 0.1.0
A library for automated control & testing of set-top boxes
1 version - Latest release: about 3 years ago - 1 dependent repositories - 13 downloads last month - 3 stars on GitHub - 1 maintainer
saram 1.0.2
A library to fetch images from a directory and get OCR and store in txt with orientation rotation...
9 versions - Latest release: about 6 years ago - 1 dependent repositories - 43 downloads last month - 50 stars on GitHub - 1 maintainer
pytesseract-cli 1.2.0
A pytesseract wrapper enabling OCR on images and directories.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 170 downloads last month - 1 stars on GitHub - 1 maintainer
pysseract 1.3.1
Python binding to Tesseract API
16 versions - Latest release: over 4 years ago - 1 dependent repositories - 945 downloads last month - 0 stars on GitHub - 1 maintainer
pyslibtesseract 0.0.15
Integration of Tesseract for Python using a shared library
12 versions - Latest release: about 8 years ago - 2 dependent repositories - 68 downloads last month - 11 stars on GitHub - 1 maintainer
polybiblioglot 0.2.0 💰
A tool to translate scanned books
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 16 downloads last month - 3 stars on GitHub - 1 maintainer
pmworker 1.2.0 💰
Papermerge worker - extract OCR text documents
4 versions - Latest release: about 4 years ago - 1 dependent repositories - 25 downloads last month - 3 stars on GitHub - 1 maintainer
pdf2dataset 0.5.3
Easily convert a subdirectory with big volume of PDF documents into a dataset, supports extractin...
15 versions - Latest release: over 3 years ago - 1 dependent repositories - 158 downloads last month - 17 stars on GitHub - 1 maintainer
ocyara 1.0.1
A Yara rule engine that scans images for matches using Optical Character Recognition (OCR). See t...
5 versions - Latest release: over 7 years ago - 1 dependent repositories - 23 downloads last month - 38 stars on GitHub - 2 maintainers
northern-lights-forecast 4.1.4
A simple web scraping northern lights forecast that automatically send a telegram notification du...
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 7 downloads last month - 1 stars on GitHub - 1 maintainer
nlpknowledge 0.0.2
Package to make sense of images with text information
9 versions - Latest release: over 4 years ago - 1 dependent repositories - 40 downloads last month - 5,891 stars on GitHub - 1 maintainer
motionpdf 0.0.1
A script built on Tesseract-OCR for converting .pdf to .txt
1 version - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
mementor 1.0.4
A library to fetch images from directory to fix orientation and pull OCR from the images along wi...
5 versions - Latest release: about 6 years ago - 1 dependent repositories - 26 downloads last month - 72 stars on GitHub - 1 maintainer
mc-pdf2txt 0.3.0 💰
Multi-column PDF to Text
3 versions - Latest release: about 1 year ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
hocr-utils 0.0.3
Package containing utility function for hOCR and tesseract
2 versions - Latest release: about 1 year ago - 1 dependent repositories - 34 downloads last month - 2 stars on GitHub - 1 maintainer
gpyocr 1.6
Python wrapper for Tesseract OCR and Google Vision OCR
11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 106 downloads last month - 11 stars on GitHub - 1 maintainer
djtesseract 0.0.6
A small app providing a tesseract field for django 3.1.2
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
cropyble 1.2.0
Cropyble is a module that allows a user to easily perform crops on an image containing recognizab...
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
tesseract-positional 0.1.2
Tool to save positional OCR data to a text file
3 versions - Latest release: 10 months ago - 1 dependent repositories - 36 downloads last month - 0 stars on GitHub - 1 maintainer
tesstrain 0.1.4
Training utils for Tesseract
5 versions - Latest release: 3 months ago - 92 downloads last month - 0 stars on GitHub - 1 maintainer
readmrz 0.0.2
Machine readable zone reader on ID cards
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 892 downloads last month - 11 stars on GitHub - 1 maintainer
autoocr 0.0.3
A Python wrapper for cross platform tesseract OCR engine with multiple languages (e.g. Bangla)
1 version - Latest release: about 5 years ago - 1 dependent repositories - 32 downloads last month - 12 stars on GitHub - 1 maintainer
aiopytesseract 0.14.0 💰
asyncio tesseract wrapper for Tesseract-OCR
15 versions - Latest release: 4 months ago - 1 dependent repositories - 468 downloads last month - 15 stars on GitHub - 1 maintainer
textshot 0.1.1
Python tool for grabbing text via screenshot
2 versions - Latest release: over 1 year ago - 76 downloads last month - 1,670 stars on GitHub - 1 maintainer