An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "pdf-parsing" keyword

View the packages on the pypi.org package registry that are tagged with the "pdf-parsing" keyword.

Top 0.8% on pypi.org
pdfplumber 0.11.6
Plumb a PDF for detailed information about each char, rectangle, and line.
72 versions - Latest release: 22 days ago - 118 dependent packages - 1,210 dependent repositories - 3.59 million downloads last month - 7,545 stars on GitHub - 1 maintainer
pdfplumber-aemc 0.11.3
Plumb a PDF for detailed information about each char, rectangle, and line.
16 versions - Latest release: 12 months ago - 1 dependent repositories - 471 downloads last month - 7,545 stars on GitHub - 1 maintainer
Top 0.4% on pypi.org
pypdf 5.4.0
A pure-python PDF library capable of splitting, merging, cropping, and transforming PDF files
68 versions - Latest release: about 1 month ago - 390 dependent packages - 3,809 dependent repositories - 11.9 million downloads last month - 8,259 stars on GitHub - 2 maintainers
Top 0.4% on pypi.org
pypdf2 3.0.1
A pure-python PDF library capable of splitting, merging, cropping, and transforming PDF files
66 versions - Latest release: over 2 years ago - 160 dependent packages - 1,626 dependent repositories - 12.4 million downloads last month - 7,337 stars on GitHub - 2 maintainers
iqdmpdf 0.3.0
Scans a directory for IMRT QA results
17 versions - Latest release: about 4 years ago - 1 dependent repositories - 1.27 thousand downloads last month - 10 stars on GitHub - 1 maintainer
declatravaux 0.1.11
Utilitaire de transmission de déclarations issues de la plateforme http://www.reseaux-et-canalisa...
11 versions - Latest release: about 4 years ago - 1 dependent repositories - 452 downloads last month - 7,337 stars on GitHub - 1 maintainer
pdfdecrypt 1.0
Remove passwords from PDF documents
1 version - Latest release: over 2 years ago - 63 downloads last month - 7,337 stars on GitHub - 1 maintainer
depdf 0.2.2
PDF table & paragraph extractor
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 193 downloads last month - 11 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
py-pdf-parser 0.13.0
A tool to help extracting information from structured PDFs.
13 versions - Latest release: 9 months ago - 3 dependent repositories - 15.1 thousand downloads last month - 335 stars on GitHub - 1 maintainer
tikara 0.1.6
The metadata and text content extractor for almost every file type.
6 versions - Latest release: 3 months ago - 214 downloads last month - 1 stars on GitHub - 1 maintainer
pdf-bank-statement-parser 0.1.1
Command-line tool for converting PDF bank statements into CSV
2 versions - Latest release: 6 months ago - 94 downloads last month - 1 stars on GitHub - 1 maintainer
tetebeche 0.1.1
Script to generate a tėte-bêche book from two pdfs
2 versions - Latest release: over 2 years ago - 90 downloads last month - 8,259 stars on GitHub - 1 maintainer
pdf4py 0.1.0
A PDF parser written in Python3 with no external dependencies.
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 137 downloads last month - 57 stars on GitHub - 1 maintainer
pdfbook 0.1.0
Rearrange pages in PDFs for printing books
1 version - Latest release: about 2 years ago - 90 downloads last month - 7,337 stars on GitHub - 1 maintainer
pdftools.pdfposter 0.8.1
Scale and tile PDF images/pages to print on multiple pages.
8 versions - Latest release: over 2 years ago - 3 dependent repositories - 411 downloads last month - 8,259 stars on GitHub - 1 maintainer
flyer-composer 1.0rc2
Rearrange PDF pages to print as flyers on one paper
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 70 downloads last month - 8,259 stars on GitHub - 1 maintainer
refchaser 0.0.3
Written in python, for checking reference lists in systematic reviews and literature reviews, hel...
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 62 downloads last month - 21 stars on GitHub - 1 maintainer
Related Keywords
pdf 14 python 10 pdf-parser 9 help-wanted 8 pdf-documents 8 pdf-manipulation 8 pypdf2 8 table-extraction 3 document-parsing 2 text-mining 2 information-extraction 2 text-extraction 1 text-parsing 1 text-processing 1 text-recognition 1 tika 1 unstructured-data 1 word-documents 1 image-to-text 1 java 1 llm 1 text-analytics 1 structured-data 1 powerpoint 1 office-documents 1 ocr 1 mime-type 1 metadata 1 language-detection 1 image-extraction 1 format-identification 1 format-detection 1 file-type 1 systematic-reviews 1 systematic-literature-reviews 1 scihub 1 research-paper 1 pdf-downloader 1 literature-review 1 evidence-based-medicine 1 citation-managment-tool 1 cermine 1 bibliographic-references 1 leaflet 1 flyer 1 poster 1 book 1 parser 1 fnb 1 first-national-bank 1 financial-analysis 1 banking 1 bank 1 retrieval-augmented-generation 1 pdf-to-text 1 natural-language-processing 1 ml 1 metadata-extraction 1 data-processing 1 data-parsing 1 data-extraction 1 content-type 1 content-processing 1 content-parsing 1 content-management 1 content-intelligence 1 content-indexing 1 content-extraction 1 content-detection 1 apache-tika 1 py-pdf-parser 1 parsing 1 pdftk 1 paragraph-extraction 1 pdf-to-html 1 decrypt 1 radiation-oncology 1 qa 1 datamining 1 IMRT QA 1 radiation oncology 1 data mining 1 file-reader 1 file-processing 1 file-parsing 1 file-identification 1 file-format 1 file-conversion 1 file-analysis 1 excel 1 docx 1 document-understanding 1 document-text 1 document-reader 1 document-processing 1 document-ocr 1 document-metadata 1 document-management 1 document-intelligence 1 document-indexing 1