pypi.org : pdfsegmenter
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/pdfsegmenter
Keywords:
pdf
, document-processing
, python
, page-segmentation
, layout-analysis
, cluster-analysis
, annotations
, csv
, table
, detection-model
License: MIT
Latest release: over 4 years ago
First release: over 4 years ago
Dependent repositories: 1
Downloads: 38 last month
Stars: 22 on GitHub
Forks: 3 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 26 days ago