pypi.org : docling-google-ocr
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/docling-google-ocr
Keywords:
docling
, convert
, document
, pdf
, docx
, html
, markdown
, layout model
, segmentation
, table structure
, table former
, ai
, document-parser
, document-parsing
, documents
, pdf-converter
, pdf-to-json
, pdf-to-text
, pptx
, tables
, xlsx
License: MIT
Latest release: 7 months ago
First release: 7 months ago
Downloads: 32 last month
Stars: 36,025 on GitHub
Forks: 2,457 on GitHub
Total Commits: 251
Committers: 26
Average commits per author: 9.654
Development Distribution Score (DDS): 0.717
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 17 days ago