An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "pdf-document-processor" keyword

View the packages on the pypi.org package registry that are tagged with the "pdf-document-processor" keyword.

llama-cloud-services 0.6.12
Tailored SDK clients for LlamaCloud services.
13 versions - Latest release: 9 days ago - 2.92 million downloads last month - 3,878 stars on GitHub - 1 maintainer
pypdfform 2.2.1
The Python library for PDF forms.
105 versions - Latest release: about 9 hours ago - 1 dependent repositories - 12.1 thousand downloads last month - 544 stars on GitHub - 1 maintainer
pdfconduit-api 0.1.27
PDF toolkit for preparing documents for distribution.
27 versions - Latest release: about 6 years ago - 1 dependent repositories - 589 downloads last month - 26 stars on GitHub - 1 maintainer
pagelabels 1.2.1 💰
Python library to manipulate PDF page numbers and labels.
7 versions - Latest release: 9 months ago - 3 dependent repositories - 697 downloads last month - 74 stars on GitHub - 1 maintainer
pdfconduit-modify 2.2.4
PDF toolkit for preparing documents for distribution.
13 versions - Latest release: almost 5 years ago - 1 dependent repositories - 344 downloads last month - 26 stars on GitHub - 1 maintainer
pdfconduit 4.7.3
PDF toolkit for preparing documents for distribution.
81 versions - Latest release: 16 days ago - 1 dependent repositories - 3.66 thousand downloads last month - 24 stars on GitHub - 1 maintainer
pdfconduit-convert 1.2.9
PDF toolkit for preparing documents for distribution.
17 versions - Latest release: about 4 years ago - 2 dependent repositories - 414 downloads last month - 26 stars on GitHub - 1 maintainer
llm-parse 0.1.4
Parse data from documents optimised for downstream llm tasks.
5 versions - Latest release: 7 months ago - 258 downloads last month - 3,859 stars on GitHub - 1 maintainer
pdfcatalog 1.0.2
Build catalogs for PDF documents automatically.
5 versions - Latest release: about 5 years ago - 1 dependent repositories - 265 downloads last month - 6 stars on GitHub - 1 maintainer
txt-from-pdf 1.3.1
Extract clean text from PDFs.
10 versions - Latest release: 9 months ago - 262 downloads last month - 1 stars on GitHub - 1 maintainer
pdfconduit-utils 1.1.2
PDF toolkit for preparing documents for distribution.
8 versions - Latest release: over 5 years ago - 1 dependent repositories - 230 downloads last month - 26 stars on GitHub - 1 maintainer
spark-pdf-python 0.1.1
PDF DataSource for Apache Spark in Python
3 versions - Latest release: 2 months ago - 93 downloads last month - 45 stars on GitHub - 1 maintainer
pyspark-pdf 0.1.0rc9
Spark-Pdf is a library for processing documents using Apache Spark
8 versions - Latest release: 5 months ago - 254 downloads last month - 45 stars on GitHub - 1 maintainer
fastpdf 1.0.4
SDK for PDF rendering, generation & transformation via Fast PDF Service.
7 versions - Latest release: over 1 year ago - 198 downloads last month - 0 stars on GitHub - 1 maintainer
pdfer 0.1.7
The package will help you manage and parse PDFs to text with OCR and not.
6 versions - Latest release: about 4 years ago - 1 dependent repositories - 141 downloads last month - 1 stars on GitHub - 1 maintainer
pdflex 0.1.9
Python tools for PDF automation.
7 versions - Latest release: 30 days ago - 391 downloads last month - 3 stars on GitHub - 1 maintainer
scaledp 0.2.2
ScaleDP is a library for processing documents using Apache Spark and LLMs
58 versions - Latest release: about 1 month ago - 1.94 thousand downloads last month - 9 stars on GitHub - 1 maintainer
pdfcontentconverter 0.3.1
A tool for converting PDF text as well as structural features into a pandas dataframe.
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 216 downloads last month - 8 stars on GitHub - 1 maintainer
llama-index-readers-llama-parse 0.4.0
llama-index readers llama-parse integration
8 versions - Latest release: 5 months ago - 6 dependent packages - 2.04 million downloads last month - 3,827 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
pdf2jpg 0.0.9
Wrapper to convert PDF files into jpg
10 versions - Latest release: about 6 years ago - 1 dependent package - 13 dependent repositories - 2.79 thousand downloads last month - 50 stars on GitHub - 1 maintainer
client-onedoc 0.0.21
Onedoc SDK for Python
21 versions - Latest release: 12 months ago - 1.29 thousand downloads last month - 69 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
pdfcropmargins 2.2.0
A command-line program to crop the margins of PDF files, with many options.
64 versions - Latest release: 4 months ago - 3 dependent packages - 9 dependent repositories - 9.19 thousand downloads last month - 357 stars on GitHub - 1 maintainer
auto-research 1.0
Geberate scientific survey with just a query
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 75 downloads last month - 57 stars on GitHub - 1 maintainer
stevenpy 0.0.2
Parallel Pooling Batch Document Processor
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 50 downloads last month - 0 stars on GitHub - 1 maintainer
pdfconduit-transform 1.2.2
PDF toolkit for preparing documents for distribution.
6 versions - Latest release: almost 5 years ago - 1 dependent repositories - 159 downloads last month - 26 stars on GitHub - 1 maintainer
pdfconduit-gui 1.2.0
GUI wrapper for pdfconduit.
25 versions - Latest release: about 6 years ago - 1 dependent repositories - 448 downloads last month - 26 stars on GitHub - 1 maintainer
burdoc 0.2.3
Advanced PDF parsing for python
4 versions - Latest release: 9 months ago - 153 downloads last month - 9 stars on GitHub - 1 maintainer
pdfdarkmode 1.0.5
Converts PDFs to have a grey background to be easier on the eyes
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 211 downloads last month - 17 stars on GitHub - 1 maintainer
pdfwork 0.4.0
基于 pikepdf 封装的命令行工具,处理 PDF 文件用
6 versions - Latest release: almost 4 years ago - 1 dependent repositories - 264 downloads last month - 2 stars on GitHub - 1 maintainer
ez-parse 0.1.2
A Python library for parsing PDFs of LinkedIn profiles
3 versions - Latest release: almost 2 years ago - 100 downloads last month - 0 stars on GitHub - 1 maintainer