An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org : extended-docling

SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications, now with Google OCR support.

Registry - Source - Documentation - JSON
purl: pkg:pypi/extended-docling
Keywords: docling , convert , document , pdf , docx , html , markdown , layout model , segmentation , table structure , table former , ai , document-parser , document-parsing , documents , pdf-converter , pdf-to-json , pdf-to-text , pptx , tables , xlsx
License: MIT
Latest release: 8 months ago
First release: 8 months ago
Downloads: 22 last month
Stars: 36,525 on GitHub
Forks: 2,507 on GitHub
Total Commits: 251
Committers: 26
Average commits per author: 9.654
Development Distribution Score (DDS): 0.717
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 10 days ago

    Loading...
    Readme
    Loading...