An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "document-conversion" keyword

View the packages on the pypi.org package registry that are tagged with the "document-conversion" keyword.

magicconvert 0.1.3
MagicConvert is a Python library that converts various document formats (PDF, DOCX, XLSX, PPTX, H...
3 versions - Latest release: 4 months ago - 43 downloads last month - 2 stars on GitHub - 1 maintainer
markitdown-pdf-separators 0.4.3
MarkItDown with PDF page separators - convert PDFs to Markdown with page boundary markers
7 versions - Latest release: about 1 month ago - 529 downloads last month - 0 stars on GitHub - 1 maintainer
pdf2markdown 0.2.0
Python library and CLI tool that leverages LLMs to convert technical PDF documents to well-struct...
1 version - Latest release: 20 days ago - 173 downloads last month - 0 stars on GitHub - 1 maintainer
docstrange 1.1.5
Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, J...
16 versions - Latest release: 5 days ago - 1.93 thousand downloads last month - 493 stars on GitHub - 1 maintainer
llm-data-converter 2.2.0
Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPo...
23 versions - Latest release: about 1 month ago - 297 downloads last month - 3 stars on GitHub - 1 maintainer
document-data-extractor 1.0.4
Best open-source document to markdown extractor for LLM training data. Convert PDF, Word, PowerPo...
5 versions - Latest release: about 1 month ago - 78 downloads last month - 3 stars on GitHub - 1 maintainer
md-server 0.1.2
HTTP API server for converting documents, web pages, and media to markdown
4 versions - Latest release: 27 days ago - 421 downloads last month - 3 stars on GitHub - 1 maintainer
autoscan 0.1.2
High fidelity PDF to Markdown conversion using LLMs (GPT-4o, Gemini, etc.)
3 versions - Latest release: 12 days ago - 85 downloads last month - 3 stars on GitHub - 1 maintainer
markitdown-mcp-server 0.1.11
MCP server for converting documents to markdown using MarkItDown
2 versions - Latest release: 4 months ago - 63 downloads last month - 1 maintainer
document-clipper 1.2.1
A set of utility classes and functions to process documents with Python
30 versions - Latest release: almost 4 years ago - 1 dependent repositories - 122 downloads last month - 4 stars on GitHub - 2 maintainers
loutils 1.4.0 💰
Cross-platform LibreOffice document conversion and printing
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 90 downloads last month - 19 stars on GitHub - 1 maintainer
toprint 0.1.32
2print/toprint: Python library for printing and converting between HTML, PDF, ZPL, and image form...
6 versions - Latest release: 3 months ago - 454 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
markdown 9 pdf 8 ocr 6 document-processing 6 pdf-to-markdown 5 llm 5 html-to-markdown 5 text-extraction 4 unstructured-alternative 3 ai-training-data 3 rag 3 docling-alternative 3 marker-alternative 3 markitdown-alternative 3 document-understanding 3 mineru-alternative 3 paddleocr-alternative 3 intelligent-document-processing 3 tesseract-alternative 3 document-to-markdown 3 local-document-processing 3 structured-data-extraction 3 table-extraction 3 layout-detection 3 llm-ready-data 3 document-ai 3 excel-to-markdown 3 powerpoint-to-markdown 3 word-to-markdown 3 batch-document-processing 3 image-processing 3 ai 3 pdf-to-json 2 file-conversion 2 image-to-markdown 2 offline-document-extractor 2 conversion 2 gpt 2 offline-document-converter 2 ppt-to-markdown 2 pdf-parser 2 server 2 image-to-md 1 img2md 1 img2markdown 1 img2xml 1 image-to-docx 1 img2docx 1 image-to-doc 1 img2doc 1 zpl-to-json 1 image-to-xml 1 img2json 1 image-to-json 1 doc2print 1 doc-to-print 1 doc2pdf 1 doc-to-pdf 1 doc2html 1 doc-to-html 1 doc2zpl 1 doc-to-zpl 1 doc2img 1 doc-to-image 1 doc2docx 1 zpl2json 1 zpl-to-xml 1 zpl2xml 1 zpl-to-markdown 1 zpl2markdown 1 zpl-to-md 1 zpl2md 1 zpl-to-docx 1 zpl2docx 1 zpl-to-doc 1 zpl2doc 1 zpl-to-image 1 zpl2img 1 docx-to-markdown 1 img2print 1 image-to-print 1 img2pdf 1 image-to-pdf 1 img2html 1 image-to-html 1 img2zpl 1 image-to-zpl 1 markdown-to-docx 1 md2xml 1 markdown-to-xml 1 md2json 1 markdown-to-json 1 format-conversion 1 document-generation 1 report-generation 1 label-printing 1 barcode 1 qrcode 1 thermal-printing 1 zebra 1