pypi.org : spark-pdf-python
PDF DataSource for Apache Spark in Python
Registry
-
Source
- Homepage
- Documentation
- JSON
purl: pkg:pypi/spark-pdf-python
Keywords:
big-data
, data-engineering
, data-extraction
, data-science
, ocr
, ocr-recognition
, pdf
, pdf-document
, pdf-document-processor
, spark
, spark-datasource
, tesseract
, tesseract-ocr
License: AGPL-3.0
Latest release: 2 months ago
First release: 2 months ago
Downloads: 93 last month
Stars: 45 on GitHub
Forks: 4 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 10 days ago