pypi.org "layout-analysis" keyword
View the packages on the pypi.org package registry that are tagged with the "layout-analysis" keyword.
filestruct 0.1.1
A python package to structure files using visual and style informations3 versions - Latest release: almost 2 years ago - 8 downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
11 versions - Latest release: over 3 years ago - 5 dependent packages - 77 dependent repositories - 189 thousand downloads last month - 4,869 stars on GitHub - 1 maintainer
layoutparser 0.3.4
A unified toolkit for Deep Learning Based Document Image Analysis11 versions - Latest release: over 3 years ago - 5 dependent packages - 77 dependent repositories - 189 thousand downloads last month - 4,869 stars on GitHub - 1 maintainer
Top 3.2% on pypi.org
102 versions - Latest release: about 1 month ago - 6 dependent packages - 24 dependent repositories - 20.5 thousand downloads last month - 884 stars on GitHub - 2 maintainers
kraken 6.0.2
OCR/HTR engine for all the languages102 versions - Latest release: about 1 month ago - 6 dependent packages - 24 dependent repositories - 20.5 thousand downloads last month - 884 stars on GitHub - 2 maintainers
pdf-layout-scanner 1.3.3
A more complete example of programming with PDFMiner, which continues where the default documenta...7 versions - Latest release: about 6 years ago - 1 dependent repositories - 323 downloads last month - 7 stars on GitHub - 1 maintainer
mineru 2.6.3
A practical tool for converting PDF to Markdown30 versions - Latest release: 2 days ago - 78.2 thousand downloads last month - 47,322 stars on GitHub - 1 maintainer
latex-toolkit 0.1
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。1 version - Latest release: 8 months ago - 18 downloads last month - 32,851 stars on GitHub - 1 maintainer
yomitoku 0.10.0
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese ...21 versions - Latest release: 2 months ago - 4.32 thousand downloads last month - 903 stars on GitHub - 1 maintainer
opensourcedot-mindocr 0.5.1
A toolbox of OCR models and algorithms based on MindSpore.2 versions - Latest release: 11 months ago - 6 downloads last month - 283 stars on GitHub - 1 maintainer
ocrd-gbn 1.0.0
Collection of OCR-D compliant tools for layout analysis and segmentation of historical german-lan...1 version - Latest release: about 5 years ago - 1 dependent repositories - 11 downloads last month - 11 stars on GitHub - 1 maintainer
pdfsegmenter 0.1
This library builds a graph-representation of the content of PDFs. The graph is then clustered, r...1 version - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 23 stars on GitHub - 1 maintainer
mindocr 0.5.0
A toolbox of OCR models and algorithms based on MindSpore.4 versions - Latest release: 7 months ago - 45 downloads last month - 283 stars on GitHub - 1 maintainer
xh-pdf-parser 1.3.1.2
A practical tool for converting PDF to Markdown5 versions - Latest release: 7 months ago - 21 downloads last month - 47,322 stars on GitHub - 1 maintainer
lazyllm-magic-pdf 0.9.0
A practical tool for converting PDF to Markdown1 version - Latest release: 8 months ago - 10 downloads last month - 47,322 stars on GitHub - 1 maintainer
magic-pdf 1.3.12
A practical tool for converting PDF to Markdown48 versions - Latest release: 5 months ago - 26.6 thousand downloads last month - 46,544 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
37 versions - Latest release: 3 months ago - 1 dependent repositories - 2.85 thousand downloads last month - 2,552 stars on GitHub - 1 maintainer
pix2text 1.1.4
An Open-Source Python3 tool for recognizing layouts, tables, math formulas, and text in images, c...37 versions - Latest release: 3 months ago - 1 dependent repositories - 2.85 thousand downloads last month - 2,552 stars on GitHub - 1 maintainer
quanta-pdf 1.0.3
Advanced PDF layout analysis engine for extracting figures, tables, and structured content3 versions - Latest release: 16 days ago - 244 downloads last month - 0 stars on GitHub - 1 maintainer
mseep-pdf2md 0.1.4
PDF to Markdown MCP服务器5 versions - Latest release: about 2 months ago - 136 downloads last month - 32,851 stars on GitHub - 1 maintainer
rapid-layout 1.0.1 💰
Tools for document layout analysis based ONNXRuntime.29 versions - Latest release: 3 months ago - 2.47 thousand downloads last month - 69 stars on GitHub - 1 maintainer
docling-enhanced-onnx 1.0.0
Enhanced Docling Models with ONNX Auto-Detection and Air-Gapped Support1 version - Latest release: about 2 months ago - 118 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
ocr
14
pdf
10
python
10
parser
7
ai4science
6
document-analysis
6
extract-data
6
pdf-converter
6
pdf-extractor-llm
6
pdf-extractor-pretrain
6
pdf-extractor-rag
6
pdf-parser
6
deep-learning
4
document-processing
3
mindspore
2
layoutxlm
2
key-information-extraction
2
dbnet
2
crnn
2
pytorch
2
OCR
2
ocr-large-model
2
table-recognition
2
tablemaster
2
text-detection
2
text-recognition
2
vary-toy
2
machine-learning
2
markdown
2
convert
2
MinerU
2
mineru
2
magic-pdf
2
text-extraction
2
computer-vision
2
table-detection
1
figure-extraction
1
table-ocr
1
mathpix
1
math-ocr
1
math-formula-recognition
1
math-formula
1
latex-pdf
1
latex
1
image-to-markdown
1
detection-model
1
table
1
csv
1
layout analysis
1
pdf-document
1
ppstructure
1
layout
1
rapidocr
1
rapid_layout
1
cdla
1
pp-structure
1
docling
1
onnx
1
table-extraction
1
air-gapped
1
ai
1
pdfminer
1
page-xml
1
Deep Learning
1
Japanese
1
optical-character-recognition
1
neural-networks
1
hocr
1
handwritten-text-recognition
1
alto-xml
1
htr
1
object-detection
1
layout-parser
1
layout-detection
1
document-layout-analysis
1
document-image-processing
1
detectron2
1
OCR-D
1
binarization
1
historical-documents
1
ocr-d
1
segmentation
1
tensorflow
1
deep learning
1
page-segmentation
1
cluster-analysis
1
annotations
1