An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org : spark-pdf-python

PDF DataSource for Apache Spark in Python

Registry - Homepage - Documentation - JSON - codemeta.json
purl: pkg:pypi/spark-pdf-python
Keywords: big-data , data-engineering , data-extraction , data-science , ocr , ocr-recognition , pdf , pdf-document , pdf-document-processor , spark , spark-datasource , tesseract , tesseract-ocr
License: AGPL-3.0
Latest release: 10 months ago
First release: 10 months ago
Downloads: 24 last month
Stars: 49 on GitHub
Forks: 3 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 25 days ago

    Loading...
    Readme
    Loading...