Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "pdf" keyword

llama-index-readers-pdf-table 0.1.3
llama-index readers pdf_table integration
5 versions - Latest release: 3 months ago - 383 downloads last month - 82,076 stars on GitHub - 1 maintainer
llama-index-readers-file 0.1.22
llama-index readers file integration
32 versions - Latest release: 17 days ago - 20 dependent packages - 526 thousand downloads last month - 31,781 stars on GitHub - 1 maintainer
etherpump 0.0.20 💰
Pumping text from etherpads into publications
20 versions - Latest release: about 3 years ago - 1 dependent repositories - 22 downloads last month - 15,384 stars on GitHub - 2 maintainers
Top 1.0% on pypi.org
ocrmypdf 16.2.0 💰
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
230 versions - Latest release: about 1 month ago - 10 dependent packages - 108 dependent repositories - 95.5 thousand downloads last month - 12,250 stars on GitHub - 1 maintainer
h2ogpt 0.2.0
Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports...
1 version - Latest release: 3 months ago - 36 downloads last month - 10,301 stars on GitHub - 1 maintainer
cell-bin 1.3.4 💰
A framework for generating single-cell gene expression data
10 versions - Latest release: 3 months ago - 1 dependent package - 246 downloads last month - 8,959 stars on GitHub - 1 maintainer
marker-pdf 0.2.6
Convert PDF to markdown with high speed and accuracy.
12 versions - Latest release: 9 days ago - 5.63 thousand downloads last month - 8,811 stars on GitHub - 1 maintainer
Top 0.4% on pypi.org
pypdf2 3.0.1
A pure-python PDF library capable of splitting, merging, cropping, and transforming PDF files
66 versions - Latest release: over 1 year ago - 160 dependent packages - 1,626 dependent repositories - 7.56 million downloads last month - 7,337 stars on GitHub - 2 maintainers
flyer-composer 1.0rc2
Rearrange PDF pages to print as flyers on one paper
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 26 downloads last month - 7,337 stars on GitHub - 1 maintainer
pdftools.pdfposter 0.8.1
Scale and tile PDF images/pages to print on multiple pages.
8 versions - Latest release: over 1 year ago - 3 dependent repositories - 262 downloads last month - 7,337 stars on GitHub - 1 maintainer
declatravaux 0.1.11
Utilitaire de transmission de déclarations issues de la plateforme http://www.reseaux-et-canalisa...
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 116 downloads last month - 7,337 stars on GitHub - 1 maintainer
pdfbook 0.1.0
Rearrange pages in PDFs for printing books
1 version - Latest release: about 1 year ago - 36 downloads last month - 7,337 stars on GitHub - 1 maintainer
tetebeche 0.1.1
Script to generate a tėte-bêche book from two pdfs
2 versions - Latest release: over 1 year ago - 3 downloads last month - 7,262 stars on GitHub - 1 maintainer
pdfdecrypt 1.0
Remove passwords from PDF documents
1 version - Latest release: over 1 year ago - 21 downloads last month - 7,262 stars on GitHub - 1 maintainer
Top 0.4% on pypi.org
pypdf 4.2.0
A pure-python PDF library capable of splitting, merging, cropping, and transforming PDF files
59 versions - Latest release: about 2 months ago - 390 dependent packages - 3,809 dependent repositories - 5.18 million downloads last month - 6,801 stars on GitHub - 2 maintainers
surya-ocr 0.4.6
OCR, layout, reading order, and line detection in 90+ languages
21 versions - Latest release: 8 days ago - 1 dependent package - 11 thousand downloads last month - 6,739 stars on GitHub - 1 maintainer
Top 0.6% on pypi.org
weasyprint 0.42.3 💰
The Awesome Document Factory
106 versions - Latest release: about 6 years ago - 114 dependent packages - 1,344 dependent repositories - 2.04 million downloads last month - 6,608 stars on GitHub - 3 maintainers
pdfplumber-aemc 0.11.3
Plumb a PDF for detailed information about each char, rectangle, and line.
16 versions - Latest release: 18 days ago - 1 dependent repositories - 471 downloads last month - 5,644 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
parsr-client 3.2.3
Python client for Parsr - Transforms PDF, Documents and Images into Enriched Structured Data
10 versions - Latest release: almost 4 years ago - 1 dependent package - 11 dependent repositories - 219 downloads last month - 5,634 stars on GitHub - 1 maintainer
20220429-pdfminer-jameslp310 0.0.2
PDF parser and analyzer
1 version - Latest release: about 2 years ago - 1 dependent repositories - 82 downloads last month - 5,496 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
pdfminer.six 20231228
PDF parser and analyzer
26 versions - Latest release: 5 months ago - 162 dependent packages - 2,496 dependent repositories - 3.66 million downloads last month - 5,496 stars on GitHub - 3 maintainers
e.pdfminer.six 0.0.1
PDF parser and analyzer
1 version - Latest release: over 4 years ago - 64 downloads last month - 5,496 stars on GitHub - 1 maintainer
marker-ocr 0.1.0 removed
Convert PDF to markdown with high speed and accuracy.
1 version - Latest release: 5 months ago - 4,998 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
pdfplumber 0.11.0
Plumb a PDF for detailed information about each char, rectangle, and line.
66 versions - Latest release: 3 months ago - 118 dependent packages - 1,210 dependent repositories - 975 thousand downloads last month - 4,885 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
khoj-assistant 1.12.0 💰
An AI copilot for your Second Brain
631 versions - Latest release: 25 days ago - 1 dependent repositories - 6.32 thousand downloads last month - 4,733 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
unstructured 0.14.0
A library that prepares raw documents for downstream ML tasks.
135 versions - Latest release: 8 days ago - 113 dependent packages - 3,374 dependent repositories - 1.16 million downloads last month - 4,064 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
pymupdfb 1.24.3
MuPDF shared libraries for PyMuPDF.
14 versions - Latest release: 16 days ago - 4 dependent packages - 133 dependent repositories - 1.97 million downloads last month - 4,025 stars on GitHub - 1 maintainer
aqpymupdf 1.23.7
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
1 version - Latest release: about 1 month ago - 39 downloads last month - 4,025 stars on GitHub - 1 maintainer
pdfautonup 1.9.0
Convert PDF files to 'n-up' PDF files, guessing the output layout.
21 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 178 downloads last month - 4,025 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pymupdf 1.24.4
A high performance Python library for data extraction, analysis, conversion & manipulation of PDF...
115 versions - Latest release: 10 days ago - 206 dependent packages - 1,798 dependent repositories - 2.89 million downloads last month - 4,025 stars on GitHub - 1 maintainer
sphinx_pyppeteer_builder 1.0.0
A Sphinx PDF builder using pyppeteer
3 versions - Latest release: 8 months ago - 198 downloads last month - 3,445 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
borb 2.1.23 💰
borb is a library for reading, creating and manipulating PDF files in python.
59 versions - Latest release: 10 days ago - 1 dependent package - 26 dependent repositories - 21.7 thousand downloads last month - 3,304 stars on GitHub - 1 maintainer
grobid-client 0.8.5
A client library for accessing Grobid
15 versions - Latest release: 4 months ago - 2 dependent packages - 465 downloads last month - 3,022 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
mglib 1.3.9 💰
Common code used across all Papermerge project utilities
22 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 115 downloads last month - 2,340 stars on GitHub - 1 maintainer
limitpages 1.0.0 💰
Papermerge App to limit number of uploaded documents
1 version - Latest release: over 3 years ago - 1 dependent repositories - 9 downloads last month - 2,340 stars on GitHub - 1 maintainer
Top 0.9% on pypi.org
xhtml2pdf 0.2.15 💰
PDF generator using HTML and CSS
31 versions - Latest release: 4 months ago - 40 dependent packages - 2,256 dependent repositories - 726 thousand downloads last month - 2,185 stars on GitHub - 6 maintainers
Top 4.7% on pypi.org
pdftabextract 0.3.0
A set of tools for data mining (OCR-processed) PDFs
5 versions - Latest release: over 6 years ago - 12 dependent repositories - 1.29 thousand downloads last month - 2,159 stars on GitHub - 2 maintainers
Top 1.5% on pypi.org
tabula-py 2.9.1 💰
Simple wrapper for tabula-java, read tables from PDF into DataFrame
54 versions - Latest release: 11 days ago - 23 dependent packages - 565 dependent repositories - 308 thousand downloads last month - 2,070 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
pikepdf 8.15.1 💰
Read and write PDFs with Python, powered by qpdf
226 versions - Latest release: about 1 month ago - 59 dependent packages - 383 dependent repositories - 1.55 million downloads last month - 2,026 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
pdfrw 0.4
PDF file reader/writer library
4 versions - Latest release: over 6 years ago - 25 dependent packages - 1,188 dependent repositories - 288 thousand downloads last month - 1,828 stars on GitHub - 4 maintainers
Top 2.1% on pypi.org
invoice2data 0.4.5
Python parser to extract data from pdf invoice
102 versions - Latest release: 6 months ago - 5 dependent packages - 28 dependent repositories - 7.92 thousand downloads last month - 1,695 stars on GitHub - 2 maintainers
opticr 0.2.0 💰
expose a single interface and API to few OCR tools
2 versions - Latest release: over 1 year ago - 18 downloads last month - 1,459 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
pdf2image 1.17.0 💰
A wrapper around the pdftoppm and pdftocairo command line tools to convert PDF to a PIL Image list.
46 versions - Latest release: 5 months ago - 195 dependent packages - 2,760 dependent repositories - 1.72 million downloads last month - 1,459 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
excalibur-py 0.4.3 💰
A web interface to extract tabular data from PDFs.
9 versions - Latest release: about 4 years ago - 10 dependent repositories - 1.56 thousand downloads last month - 1,457 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
pdf2img 0.1.2 💰
A wrapper around the pdftoppm and pdftocairo command line tools to convert PDF to a PIL Image list.
3 versions - Latest release: about 3 years ago - 3 dependent repositories - 1.46 thousand downloads last month - 1,380 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
lightnovel-crawler 3.7.1 💰
An app to download novels from online sources and generate e-books.
180 versions - Latest release: 15 days ago - 1 dependent package - 1 dependent repositories - 2.92 thousand downloads last month - 1,164 stars on GitHub - 1 maintainer
Top 2.9% on pypi.org
rpaframework-pdf 7.3.2
PDF library of RPA Framework
44 versions - Latest release: about 2 months ago - 1 dependent package - 28 dependent repositories - 711 thousand downloads last month - 1,039 stars on GitHub - 5 maintainers
Top 3.4% on pypi.org
pdfx 1.4.1 💰
Extract metadata and URLs from PDF files, and download all referenced PDFs
11 versions - Latest release: about 3 years ago - 1 dependent package - 40 dependent repositories - 2.6 thousand downloads last month - 1,018 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
arxiv 2.1.0
Python wrapper for the arXiv API: https://arxiv.org/help/api/
33 versions - Latest release: 5 months ago - 77 dependent packages - 1,503 dependent repositories - 161 thousand downloads last month - 986 stars on GitHub - 1 maintainer
mathtranslate 3.1.2
Translate math-heavy papers
50 versions - Latest release: 9 months ago - 315 downloads last month - 971 stars on GitHub - 1 maintainer
llama-index-readers-smart-pdf-loader 0.1.4
llama-index readers smart_pdf_loader integration
6 versions - Latest release: about 1 month ago - 1 dependent package - 2.09 thousand downloads last month - 960 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
fpdf2 2.7.8
Simple & fast PDF generation for Python
45 versions - Latest release: 4 months ago - 65 dependent packages - 229 dependent repositories - 1.19 million downloads last month - 954 stars on GitHub - 2 maintainers
arxiv-client 0.3.2
Python3 client for the arXiv API
12 versions - Latest release: about 1 month ago - 508 downloads last month - 923 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
rows 0.4.1
A common, beautiful interface to tabular data, no matter the format
10 versions - Latest release: over 5 years ago - 42 dependent repositories - 1.76 thousand downloads last month - 861 stars on GitHub - 1 maintainer
slc.docconv 1.3
Add-on for collective.documentviewer that allows web service like conversion
2 versions - Latest release: almost 9 years ago - 2 dependent repositories - 13 downloads last month - 832 stars on GitHub - 4 maintainers
Top 2.0% on pypi.org
pdftotext 2.2.2 💰
Simple PDF text extraction
15 versions - Latest release: over 2 years ago - 17 dependent packages - 257 dependent repositories - 74.7 thousand downloads last month - 819 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
pypandoc-binary 1.8.1 💰
Thin wrapper for pandoc.
6 versions - Latest release: about 2 years ago - 14 dependent packages - 37 dependent repositories - 110 thousand downloads last month - 816 stars on GitHub - 1 maintainer
cagen 0.1.0 💰
A static site generator for cmpalgorithms project
56 versions - Latest release: over 1 year ago - 272 downloads last month - 816 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pypandoc 1.8.1 💰
Thin wrapper for pandoc.
57 versions - Latest release: about 2 years ago - 311 dependent packages - 2,617 dependent repositories - 3.24 million downloads last month - 746 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
cairosvg 2.7.1 💰
A Simple SVG Converter based on Cairo
62 versions - Latest release: 10 months ago - 146 dependent packages - 2,524 dependent repositories - 1.25 million downloads last month - 719 stars on GitHub - 3 maintainers
Top 5.7% on pypi.org
mayan-edms 4.6.4
Free Open Source Electronic Document Management System
247 versions - Latest release: about 1 month ago - 3 dependent repositories - 2.53 thousand downloads last month - 611 stars on GitLab.com - 1 maintainer
Top 4.4% on pypi.org
jupyterlab-latex 3.1.0 💰
JupyterLab extension for running LaTeX
11 versions - Latest release: over 2 years ago - 23 dependent repositories - 1.57 thousand downloads last month - 603 stars on GitHub - 1 maintainer
wereadscan 0.8.7
WeRead PDF Scanner
10 versions - Latest release: over 1 year ago - 1 dependent repositories - 116 downloads last month - 552 stars on GitHub - 1 maintainer
wereadscan-html 0.1.1
WeRead HTML Scanner
2 versions - Latest release: 9 months ago - 61 downloads last month - 552 stars on GitHub - 1 maintainer
pdf.tocgen 1.3.4 💰
Automatically generate table of contents for pdf files
16 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 1.02 thousand downloads last month - 549 stars on GitHub - 1 maintainer
pdf-outline-edit 24.2.3 💰
Mini PDF outline editor
10 versions - Latest release: 4 months ago - 58 downloads last month - 549 stars on GitHub - 1 maintainer
thepipe-api 0.3.5
Automate information extraction for multimodal LLMs.
24 versions - Latest release: 9 days ago - 1.5 thousand downloads last month - 537 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
rst2pdf 0.95.1
Convert reStructured Text to PDF via ReportLab.
36 versions - Latest release: over 3 years ago - 16 dependent packages - 485 dependent repositories - 91.2 thousand downloads last month - 536 stars on GitHub - 5 maintainers
texify 0.1.9
OCR for latex images
8 versions - Latest release: 8 days ago - 1 dependent package - 3.64 thousand downloads last month - 515 stars on GitHub - 1 maintainer
rinohtype-reloaded 0.3.3 💰
The Python document processor
1 version - Latest release: over 4 years ago - 19 downloads last month - 495 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
rinohtype 0.5.4 💰
The Python document processor
15 versions - Latest release: almost 2 years ago - 19 dependent packages - 190 dependent repositories - 22.6 thousand downloads last month - 495 stars on GitHub - 1 maintainer
dummypdf 2.0.0
Generate dummy pdf files with configurable paper size and number of pages.
11 versions - Latest release: 8 months ago - 1 dependent repositories - 504 downloads last month - 493 stars on GitHub - 1 maintainer
pdfimpose 2.5.0
Perform imposition of a PDF file.
15 versions - Latest release: 3 months ago - 1 dependent package - 5 dependent repositories - 1.72 thousand downloads last month - 493 stars on GitHub - 1 maintainer
niiif-niiif 0.3.1 💰
Création et dépôt de manifestes IIIF pour des données déposées sur Nakala
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 43 downloads last month - 474 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
pyhanko 0.25.0 💰
Tools for stamping and signing PDF files
37 versions - Latest release: 19 days ago - 2 dependent packages - 40 dependent repositories - 1.09 million downloads last month - 446 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
slate 0.5.2 💰
Extract text from PDF documents easily.
4 versions - Latest release: 9 months ago - 46 dependent repositories - 1.28 thousand downloads last month - 422 stars on GitHub - 1 maintainer
pdfsyntax 0.1.1
A Python library to inspect and modify the internal structure of a PDF file
10 versions - Latest release: 14 days ago - 1 dependent repositories - 203 downloads last month - 420 stars on GitHub - 1 maintainer
andreo1 0.0.1
Read text or text in images inside a pdf and turn it into string
1 version - Latest release: almost 2 years ago - 28 downloads last month - 412 stars on GitHub - 1 maintainer
poppdf 0.17.8
A wrapper around the poppler's and pdftoimage, pdftphtml and pdftotext command line tools to extr...
31 versions - Latest release: almost 3 years ago - 1 dependent repositories - 158 downloads last month - 412 stars on GitHub - 1 maintainer
pdftotree-mercurial 1.3
Convert PDF into hOCR with text, tables, and figures being recognized and preserved. (Without skl...
4 versions - Latest release: about 1 year ago - 35 downloads last month - 404 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
pdftotree 0.5.0
Convert PDF into hOCR with text, tables, and figures being recognized and preserved.
21 versions - Latest release: over 3 years ago - 22 dependent repositories - 2.59 thousand downloads last month - 403 stars on GitHub - 3 maintainers
Top 5.3% on pypi.org
tia 0.3.0
Toolkit for integration and analysis
3 versions - Latest release: over 8 years ago - 1 dependent package - 5 dependent repositories - 576 downloads last month - 400 stars on GitHub - 1 maintainer
audible-cli 0.3.1 💰
Command line interface (cli) for the audible package.
21 versions - Latest release: 2 months ago - 705 downloads last month - 399 stars on GitHub - 1 maintainer
notebook-as-pdf-updated 0.5.1
Jupyter extension to export notebooks as PDFs
2 versions - Latest release: 8 months ago - 19 downloads last month - 365 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
notebook-as-pdf 0.5.0
Jupyter extension to export notebooks as PDFs
8 versions - Latest release: about 3 years ago - 37 dependent repositories - 7.38 thousand downloads last month - 365 stars on GitHub - 1 maintainer
notebook-as-pdf-abirami 0.5.8
Jupyter extension to export notebooks as PDFs
6 versions - Latest release: 7 months ago - 34 downloads last month - 365 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
django-easy-pdf 0.1.1
Django PDF views, the easy way
3 versions - Latest release: about 7 years ago - 267 dependent repositories - 16.9 thousand downloads last month - 362 stars on GitHub - 1 maintainer
code2pdf 1.0.0
Converts given source code into pdf file with syntax highlighting, line numbe...
3 versions - Latest release: about 8 years ago - 2 dependent repositories - 31 downloads last month - 346 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
distfit 1.8.0 💰
distfit is a python library for probability density fitting.
48 versions - Latest release: 9 days ago - 8 dependent packages - 14 dependent repositories - 42 thousand downloads last month - 339 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
py-pdf-parser 0.12.0
A tool to help extracting information from structured PDFs.
12 versions - Latest release: 7 months ago - 3 dependent repositories - 8.43 thousand downloads last month - 335 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
pdfcropmargins 2.1.2
A command-line program to crop the margins of PDF files, with many options.
61 versions - Latest release: about 1 month ago - 3 dependent packages - 9 dependent repositories - 3.87 thousand downloads last month - 323 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
django-wkhtmltopdf 3.4.0
Converts HTML to PDF using wkhtmltopdf.
21 versions - Latest release: about 2 years ago - 2 dependent packages - 190 dependent repositories - 58.1 thousand downloads last month - 323 stars on GitHub - 2 maintainers
Top 3.4% on pypi.org
mkdocs-pdf-export-plugin 0.5.10
An MkDocs plugin to export content pages as PDF files
24 versions - Latest release: over 2 years ago - 1 dependent package - 171 dependent repositories - 28 thousand downloads last month - 308 stars on GitHub - 2 maintainers
reportbro-plus-lib 1.6.3
Generate PDF and Excel reports from visually designed templates
12 versions - Latest release: over 3 years ago - 1 dependent repositories - 55 downloads last month - 306 stars on GitHub - 1 maintainer
svglibwheel 0.1
A pure-Python library for reading and converting SVG
1 version - Latest release: about 2 months ago - 11 downloads last month - 300 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
svglib 1.5.1
A pure-Python library for reading and converting SVG
22 versions - Latest release: over 1 year ago - 93 dependent packages - 1,712 dependent repositories - 912 thousand downloads last month - 300 stars on GitHub - 2 maintainers
Top 2.7% on pypi.org
mkdocs-with-pdf 0.9.3
Generate a single PDF file from MkDocs repository
28 versions - Latest release: almost 3 years ago - 4 dependent packages - 165 dependent repositories - 45.7 thousand downloads last month - 299 stars on GitHub - 1 maintainer
mkdocs-with-pdf-multiply-docs 0.0.2
Generate a single PDF file from MkDocs repository
2 versions - Latest release: 9 months ago - 13 downloads last month - 299 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
pypdfium2 4.30.0
Python bindings to PDFium
113 versions - Latest release: 16 days ago - 65 dependent packages - 546 dependent repositories - 1.22 million downloads last month - 289 stars on GitHub - 4 maintainers
Top 7.6% on pypi.org
scipdf-parser 0.52
Python parser for scientific PDF based on GROBID.
7 versions - Latest release: 7 months ago - 2 dependent packages - 1 dependent repositories - 12.5 thousand downloads last month - 287 stars on GitHub - 1 maintainer