pypi.org "document-to-markdown" keyword
View the packages on the pypi.org package registry that are tagged with the "document-to-markdown" keyword.
docstrange 1.1.5
Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, J...16 versions - Latest release: 7 days ago - 1.93 thousand downloads last month - 493 stars on GitHub - 1 maintainer
llm-data-converter 2.2.0
Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPo...23 versions - Latest release: about 2 months ago - 297 downloads last month - 3 stars on GitHub - 1 maintainer
document-data-extractor 1.0.4
Best open-source document to markdown extractor for LLM training data. Convert PDF, Word, PowerPo...5 versions - Latest release: about 1 month ago - 78 downloads last month - 3 stars on GitHub - 1 maintainer
Related Keywords
batch-document-processing
3
word-to-markdown
3
powerpoint-to-markdown
3
excel-to-markdown
3
html-to-markdown
3
text-extraction
3
document-ai
3
llm-ready-data
3
layout-detection
3
table-extraction
3
structured-data-extraction
3
local-document-processing
3
pdf-to-markdown
3
tesseract-alternative
3
paddleocr-alternative
3
llm
3
document-processing
3
document-conversion
3
markdown
3
pdf
3
image-processing
3
intelligent-document-processing
3
document-understanding
3
ocr
3
rag
3
ai-training-data
3
unstructured-alternative
3
docling-alternative
3
marker-alternative
3
markitdown-alternative
3
mineru-alternative
3
ppt-to-markdown
2
offline-document-converter
2
offline-document-extractor
2
ai
1
document-parser
1
document-parsing
1
image-to-markdown
1
pdf-parser
1
pdf-to-json
1
structured-data
1
structured-data-capture
1
tables
1
docstrange
1