An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "batch-document-processing" keyword

docstrange 1.1.8
Extract and Convert PDF, Word, PowerPoint, Excel, images, URLs into multiple formats (Markdown, J...
19 versions - Latest release: 5 months ago - 3.9 thousand downloads last month - 935 stars on GitHub - 1 maintainer
document-data-extractor 1.0.4
Best open-source document to markdown extractor for LLM training data. Convert PDF, Word, PowerPo...
5 versions - Latest release: 8 months ago - 17 downloads last month - 3 stars on GitHub - 1 maintainer
llm-data-converter 2.2.0
Best open-source document to markdown converter for LLM training data. Convert PDF, Word, PowerPo...
23 versions - Latest release: 8 months ago - 286 downloads last month - 5 stars on GitHub - 1 maintainer