pypi.org "document-parser" keyword
View the packages on the pypi.org package registry that are tagged with the "document-parser" keyword.
graphlit-client 1.0.20250416001
Graphlit API Python Client126 versions - Latest release: 2 days ago - 5.09 thousand downloads last month - 5 stars on GitHub - 1 maintainer
unstructured-cpu 0.15.1
A library that prepares raw documents for downstream ML tasks.13 versions - Latest release: 8 months ago - 368 downloads last month - 10,877 stars on GitHub - 1 maintainer
python-docparser 1.1.0
Extract text from your docx document.3 versions - Latest release: over 2 years ago - 76 downloads last month - 10 stars on GitHub - 1 maintainer
llama-cloud-services 0.6.12
Tailored SDK clients for LlamaCloud services.13 versions - Latest release: 8 days ago - 2.54 million downloads last month - 3,878 stars on GitHub - 1 maintainer
vision-parse 0.1.13
Parse PDF documents into markdown formatted content using Vision LLMs14 versions - Latest release: 3 months ago - 2.94 thousand downloads last month - 339 stars on GitHub - 1 maintainer
llama-index-readers-docling 0.3.2
llama-index readers docling integration5 versions - Latest release: about 1 month ago - 5.73 thousand downloads last month - 27,013 stars on GitHub - 1 maintainer
llm-parse 0.1.4
Parse data from documents optimised for downstream llm tasks.5 versions - Latest release: 7 months ago - 258 downloads last month - 3,859 stars on GitHub - 1 maintainer
extended-docling 2.12.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...1 version - Latest release: 4 months ago - 74 downloads last month - 26,056 stars on GitHub - 1 maintainer
df-extract 0.0.2
DecisionFacts Extraction Library extracts content from PDF, PPTX, Docx, png, jpg., and convert as...3 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 92 downloads last month - 14 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
43 versions - Latest release: 9 days ago - 14 dependent repositories - 5.27 thousand downloads last month - 2,767 stars on GitHub - 1 maintainer
deepdoctection 0.42.0
Repository for Document AI43 versions - Latest release: 9 days ago - 14 dependent repositories - 5.27 thousand downloads last month - 2,767 stars on GitHub - 1 maintainer
docling-google-ocr 2.13.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...2 versions - Latest release: 3 months ago - 117 downloads last month - 26,056 stars on GitHub - 1 maintainer
llama-index-node-parser-docling 0.3.1
llama-index node_parser docling integration4 versions - Latest release: 2 months ago - 3.86 thousand downloads last month - 26,056 stars on GitHub - 1 maintainer
openparse 0.7.0
Streamlines the process of preparing documents for LLM's.17 versions - Latest release: 5 months ago - 4.26 thousand downloads last month - 2,904 stars on GitHub - 1 maintainer
autorag 0.3.13 💰
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.67 versions - Latest release: 3 months ago - 3.48 thousand downloads last month - 3,756 stars on GitHub - 1 maintainer
semantic-ai 0.0.6
Sematic AI RAG System8 versions - Latest release: 9 months ago - 236 downloads last month - 18 stars on GitHub - 1 maintainer
llama-index-readers-llama-parse 0.4.0
llama-index readers llama-parse integration8 versions - Latest release: 5 months ago - 6 dependent packages - 2.04 million downloads last month - 3,827 stars on GitHub - 1 maintainer
anyparser-crewai 0.0.2
Anyparser CrewAI Integration2 versions - Latest release: 2 months ago - 112 downloads last month - 1 stars on GitHub - 1 maintainer
llamarker 1.0.2
A universal GenAI-based local parser for complex documents of all types.3 versions - Latest release: 3 months ago - 106 downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
181 versions - Latest release: about 1 month ago - 113 dependent packages - 3,374 dependent repositories - 2.7 million downloads last month - 9,368 stars on GitHub - 1 maintainer
unstructured 0.17.0
A library that prepares raw documents for downstream ML tasks.181 versions - Latest release: about 1 month ago - 113 dependent packages - 3,374 dependent repositories - 2.7 million downloads last month - 9,368 stars on GitHub - 1 maintainer
clearedge 0.1.17
Build a RAG preprocessing pipeline18 versions - Latest release: about 1 year ago - 473 downloads last month - 11 stars on GitHub - 1 maintainer
marie-ai 3.0.29
Python library to Integrate AI-powered features into your applications7 versions - Latest release: about 1 year ago - 91 downloads last month - 60 stars on GitHub - 1 maintainer
Related Keywords
pdf
13
pdf-to-json
11
pdf-to-text
10
document-parsing
10
docx
8
pptx
8
ocr
7
tables
7
markdown
6
document
5
llm
5
parsing
5
pdf-to-markdown
4
table-detection
4
python
4
retrieval-augmented-generation
4
xlsx
4
rag
4
pdf-converter
4
html
4
documents
4
convert
4
ai
4
ppt-to-json
3
ppt-to-markdown
3
pdf-to-excel
3
pdf-document-processor
3
docx-to-markdown
3
structured-data
3
table-recognition
3
nlp
3
deep-learning
3
machine-learning
3
langchain
3
document-image-analysis
3
PDF
3
publaynet
2
XML
2
CV
2
HTML
2
NLP
2
document-layout-analysis
2
docling
2
layout model
2
segmentation
2
table structure
2
table former
2
pubtabnet
2
preprocessing
2
data-pipelines
2
document-image-processing
2
donut
2
information-retrieval
2
ml
2
natural-language-processing
2
llama
2
pytorch
2
document parsing
1
AI
1
typescript
1
knowledge-graph
1
local parser
1
genai
1
llama-ai
1
vector-database
1
fastapi
1
semantic-search
1
kag
1
crewai-rag
1
crewai
1
crew-ai-rag
1
crew-ai
1
cag
1
inference-api
1
cache-augmented-generation
1
artificial-intelligence
1
anyparser
1
parse
1
llama-parse
1
llama2
1
openai-api
1
optical-mark-recognition
1
optical-character-recognition
1
omr
1
iwr
1
intelligent-word-recognition
1
intelligent-character-recognition
1
mlops
1
audio
1
video
1
image
1
container
1
docker
1
serving
1
embedding
1
encoding
1
neural-network
1
elastic
1
index
1
icr
1