pypi.org : magicconvert
MagicConvert is a Python library that converts various document formats (PDF, DOCX, XLSX, PPTX, HTML, Images) to markdown text. Features include OCR support, automatic format detection, and URL/file stream handling.
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/magicconvert
Keywords:
document-conversion
, markdown
, ocr
, pdf-to-markdown
, docx-to-markdown
, xlsx-to-markdown
, pptx-to-markdown
, html-to-markdown
, image-to-text
, python-library
, text-extraction
, document-processing
, format-detection
, tesseract-ocr
, file-conversion
, text-processing
License: MIT
Latest release: 2 months ago
First release: 2 months ago
Downloads: 106 last month
Stars: 1 on GitHub
Forks: 0 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 5 days ago