pypi.org "pdf-to-json" keyword
View the packages on the pypi.org package registry that are tagged with the "pdf-to-json" keyword.
docling-enhanced 2.32.0
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...1 version - Latest release: 4 months ago - 25 downloads last month - 37,086 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
192 versions - Latest release: 9 days ago - 113 dependent packages - 3,374 dependent repositories - 3.47 million downloads last month - 12,544 stars on GitHub - 1 maintainer
unstructured 0.18.14
A library that prepares raw documents for downstream ML tasks.192 versions - Latest release: 9 days ago - 113 dependent packages - 3,374 dependent repositories - 3.47 million downloads last month - 12,544 stars on GitHub - 1 maintainer
unstructured-cpu 0.15.1
A library that prepares raw documents for downstream ML tasks.13 versions - Latest release: about 1 year ago - 104 downloads last month - 12,544 stars on GitHub - 1 maintainer
llama-index-readers-llama-parse 0.5.0
llama-index readers llama-parse integration9 versions - Latest release: about 1 month ago - 6 dependent packages - 2.22 million downloads last month - 3,956 stars on GitHub - 1 maintainer
llama-cloud-services 0.6.63
Tailored SDK clients for LlamaCloud services.62 versions - Latest release: 1 day ago - 9.48 million downloads last month - 3,956 stars on GitHub - 1 maintainer
docstrange 1.1.5
Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, J...16 versions - Latest release: 2 days ago - 1.93 thousand downloads last month - 493 stars on GitHub - 1 maintainer
llama-index-readers-docling 0.4.0
llama-index readers docling integration7 versions - Latest release: about 1 month ago - 14.2 thousand downloads last month - 27,013 stars on GitHub - 1 maintainer
llm-parse 0.1.5
Parse data from documents optimised for downstream llm tasks.6 versions - Latest release: 2 months ago - 103 downloads last month - 3,859 stars on GitHub - 1 maintainer
graphlit-client 1.0.20250830001
Graphlit API Python Client175 versions - Latest release: 5 days ago - 1.91 thousand downloads last month - 5 stars on GitHub - 1 maintainer
extended-docling 2.12.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...1 version - Latest release: 8 months ago - 22 downloads last month - 36,525 stars on GitHub - 1 maintainer
docling-google-ocr 2.13.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...2 versions - Latest release: 7 months ago - 32 downloads last month - 36,025 stars on GitHub - 1 maintainer
clearedge 0.1.17
Build a RAG preprocessing pipeline18 versions - Latest release: over 1 year ago - 74 downloads last month - 11 stars on GitHub - 1 maintainer
toprint 0.1.32
2print/toprint: Python library for printing and converting between HTML, PDF, ZPL, and image form...6 versions - Latest release: 3 months ago - 454 downloads last month - 0 stars on GitHub - 1 maintainer
llama-index-node-parser-docling 0.4.0
llama-index node_parser docling integration6 versions - Latest release: about 1 month ago - 23.9 thousand downloads last month - 28,777 stars on GitHub - 1 maintainer
Related Keywords
pdf
13
document-parser
13
pdf-to-text
11
document-parsing
11
tables
9
pptx
8
docx
7
document
6
html
6
markdown
6
ai
6
parsing
5
xlsx
5
pdf-converter
5
documents
5
pdf-to-markdown
5
convert
5
ocr
4
llm
4
structured-data
4
pdf-to-excel
3
ppt-to-json
3
ppt-to-markdown
3
pdf-document-processor
3
docx-to-markdown
3
langchain
3
docling
3
layout model
3
segmentation
3
table structure
3
table former
3
PDF
3
html-to-markdown
2
image-to-markdown
2
rag
2
document-conversion
2
document-processing
2
NLP
2
HTML
2
deep-learning
2
data-pipelines
2
document-image-analysis
2
document-image-processing
2
donut
2
information-retrieval
2
XML
2
CV
2
machine-learning
2
ml
2
natural-language-processing
2
preprocessing
2
nlp
2
image-to-xml
1
img2xml
1
img2markdown
1
image-to-md
1
img2md
1
image-to-docx
1
img2docx
1
img2zpl
1
image-to-doc
1
img2doc
1
image-to-zpl
1
img2json
1
image-to-json
1
doc2print
1
doc-to-print
1
doc2pdf
1
doc-to-pdf
1
doc2html
1
doc-to-html
1
doc2zpl
1
doc-to-zpl
1
doc2img
1
doc-to-image
1
llama
1
zpl-to-pdf
1
zpl2html
1
zpl-to-html
1
zpl2img
1
zpl-to-image
1
zpl2doc
1
zpl-to-doc
1
zpl2docx
1
zpl-to-docx
1
zpl2md
1
zpl-to-md
1
zpl2markdown
1
zpl-to-markdown
1
zpl2xml
1
zpl-to-xml
1
zpl2json
1
zpl-to-json
1
img2print
1
image-to-print
1
img2pdf
1
image-to-pdf
1
img2html
1
image-to-html
1
markdown-to-docx
1