pypi.org : pdf-language-detector
A python script to iterate over a list of PDF in a directory and try to guess their language with Tesseract OCR.
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/pdf-language-detector
Keywords:
hacktoberfest
, lstm
, machine-learning
, ocr
, ocr-engine
, tesseract
, tesseract-ocr
License: Apache-2.0
Latest release: almost 2 years ago
First release: almost 2 years ago
Downloads: 517 last month
Stars: 60,749 on GitHub
Forks: 9,343 on GitHub
Total Commits: 5218
Committers: 222
Average commits per author: 23.505
Development Distribution Score (DDS): 0.567
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 5 days ago