pypi.org "document-image-processing" keyword
Top 1.5% on pypi.org
197 versions - Latest release: 2 months ago - 113 dependent packages - 3,374 dependent repositories - 3.2 million downloads last month - 13,122 stars on GitHub - 1 maintainer
unstructured 0.18.24
A library that prepares raw documents for downstream ML tasks.197 versions - Latest release: 2 months ago - 113 dependent packages - 3,374 dependent repositories - 3.2 million downloads last month - 13,122 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
11 versions - Latest release: almost 4 years ago - 5 dependent packages - 77 dependent repositories - 185 thousand downloads last month - 4,869 stars on GitHub - 1 maintainer
layoutparser 0.3.4
A unified toolkit for Deep Learning Based Document Image Analysis11 versions - Latest release: almost 4 years ago - 5 dependent packages - 77 dependent repositories - 185 thousand downloads last month - 4,869 stars on GitHub - 1 maintainer
unstructured-cpu 0.15.1
A library that prepares raw documents for downstream ML tasks.13 versions - Latest release: over 1 year ago - 245 downloads last month - 12,775 stars on GitHub - 1 maintainer
smartdoc15-ch1 0.8
A Python wrapper for the "computable" version of the SmartDoc 2015 - Challenge 1 dataset.4 versions - Latest release: over 7 years ago - 1 dependent repositories - 31 downloads last month - 7 stars on GitHub - 1 maintainer
Related Keywords
ocr
3
deep-learning
3
computer-vision
2
pdf-to-text
2
pdf-to-json
2
pdf
2
nlp
2
natural-language-processing
2
ml
2
machine-learning
2
llm
2
langchain
2
information-retrieval
2
NLP
2
PDF
2
HTML
2
CV
2
XML
2
parsing
2
preprocessing
2
data-pipelines
2
document-image-analysis
2
document-parser
2
document-parsing
2
docx
2
donut
2
datasets
1
computer_vision
1
image_processing
1
wrapper
1
dataset
1
object-detection
1
layout-parser
1
layout-detection
1
layout-analysis
1
document-layout-analysis
1
detectron2
1
deep learning
1
layout analysis
1