An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "pdf-to-markdown" keyword

View the packages on the pypi.org package registry that are tagged with the "pdf-to-markdown" keyword.

opendataloader-pdf 0.0.9
A Python wrapper for the opendataloader-pdf Java CLI.
6 versions - Latest release: about 11 hours ago - 290 downloads last month - 6 stars on GitHub
llama-index-readers-llama-parse 0.5.0
llama-index readers llama-parse integration
9 versions - Latest release: about 1 month ago - 6 dependent packages - 2.22 million downloads last month - 3,956 stars on GitHub - 1 maintainer
llama-cloud-services 0.6.63
Tailored SDK clients for LlamaCloud services.
62 versions - Latest release: 2 days ago - 9.48 million downloads last month - 3,956 stars on GitHub - 1 maintainer
docstrange 1.1.5
Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, J...
16 versions - Latest release: 2 days ago - 1.93 thousand downloads last month - 493 stars on GitHub - 1 maintainer
smart-llm-loader 0.1.0
A powerful PDF processing toolkit that seamlessly integrates with LLMs for intelligent document c...
1 version - Latest release: 7 months ago - 14 downloads last month - 66 stars on GitHub - 1 maintainer
llm-data-converter 2.2.0
Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPo...
23 versions - Latest release: about 1 month ago - 297 downloads last month - 3 stars on GitHub - 1 maintainer
document-data-extractor 1.0.4
Best open-source document to markdown extractor for LLM training data. Convert PDF, Word, PowerPo...
5 versions - Latest release: about 1 month ago - 78 downloads last month - 3 stars on GitHub - 1 maintainer
llm-parse 0.1.5
Parse data from documents optimised for downstream llm tasks.
6 versions - Latest release: 2 months ago - 103 downloads last month - 3,859 stars on GitHub - 1 maintainer
vision-parse 0.1.13
Parse PDF documents into markdown formatted content using Vision LLMs
14 versions - Latest release: 7 months ago - 1.74 thousand downloads last month - 423 stars on GitHub - 1 maintainer
llm-food 0.1.3
Serving files for hungry LLMs
4 versions - Latest release: 3 months ago - 111 downloads last month - 18 stars on GitHub - 1 maintainer
toprint 0.1.32
2print/toprint: Python library for printing and converting between HTML, PDF, ZPL, and image form...
6 versions - Latest release: 3 months ago - 454 downloads last month - 0 stars on GitHub - 1 maintainer
markdrop 3.5.0
A comprehensive PDF processing toolkit that converts PDFs to markdown with advanced AI-powered fe...
20 versions - Latest release: 2 months ago - 365 downloads last month - 116 stars on GitHub - 2 maintainers
credeed-pdf-to-markdown 0.1.0
Convert PDF to Markdown using Azure AI Document Intelligence and upload to S3. Provided by the Cr...
1 version - Latest release: 4 months ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
file2txt 1.0.1
file2txt is a Python library takes common file formats and turns them into plain text (a txt file...
3 versions - Latest release: 2 months ago - 12 stars on GitHub - 1 maintainer
magicconvert 0.1.3
MagicConvert is a Python library that converts various document formats (PDF, DOCX, XLSX, PPTX, H...
3 versions - Latest release: 3 months ago - 54 downloads last month - 1 stars on GitHub - 1 maintainer
wisup_e2m 0.1.61
Everything to Markdown.
24 versions - Latest release: about 1 year ago - 174 downloads last month - 1,089 stars on GitHub - 1 maintainer
Related Keywords
pdf 12 markdown 11 document-processing 8 llm 7 document-parser 6 ocr 6 pdf-to-json 6 text-extraction 6 html-to-markdown 6 document-parsing 5 ppt-to-markdown 5 rag 5 document-conversion 5 pdf-to-text 5 tables 5 table-extraction 4 docx-to-markdown 4 structured-data 4 ai 4 paddleocr-alternative 3 tesseract-alternative 3 mineru-alternative 3 markitdown-alternative 3 marker-alternative 3 docling-alternative 3 document 3 document-to-markdown 3 local-document-processing 3 structured-data-extraction 3 layout-detection 3 llm-ready-data 3 document-ai 3 excel-to-markdown 3 powerpoint-to-markdown 3 word-to-markdown 3 batch-document-processing 3 pdf-parser 3 gemini 3 image-to-text 3 parsing 3 pdf-document-processor 3 pdf-to-excel 3 ppt-to-json 3 pptx 3 image-processing 3 intelligent-document-processing 3 document-understanding 3 ai-training-data 3 unstructured-alternative 3 image-to-markdown 2 html 2 pdf-converter 2 offline-document-converter 2 file-conversion 2 openai 2 conversion 2 offline-document-extractor 2 doc-to-json 1 doc2json 1 md2print 1 doc-to-xml 1 doc2xml 1 doc-to-markdown 1 doc2markdown 1 doc-to-md 1 doc2md 1 img2xml 1 markdown-to-print 1 md2pdf 1 markdown-to-pdf 1 md2html 1 markdown-to-html 1 md2zpl 1 markdown-to-zpl 1 md2img 1 markdown-to-image 1 md2doc 1 markdown-to-doc 1 md2docx 1 markdown-to-docx 1 img2markdown 1 image-to-md 1 img2md 1 image-to-docx 1 img2docx 1 image-to-doc 1 img2doc 1 image-to-xml 1 img2json 1 image-to-json 1 doc2print 1 doc-to-print 1 image-to-zpl 1 doc2pdf 1 doc-to-pdf 1 doc2html 1 doc-to-html 1 doc2zpl 1 doc-to-zpl 1 doc2img 1