pypi.org : spark-pdf-python
PDF DataSource for Apache Spark in Python
Registry
- Homepage
- Documentation
- JSON
- codemeta.json
purl: pkg:pypi/spark-pdf-python
Keywords:
big-data
, data-engineering
, data-extraction
, data-science
, ocr
, ocr-recognition
, pdf
, pdf-document
, pdf-document-processor
, spark
, spark-datasource
, tesseract
, tesseract-ocr
License: AGPL-3.0
Latest release: 10 months ago
First release: 10 months ago
Downloads: 24 last month
Stars: 49 on GitHub
Forks: 3 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 25 days ago