An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "documents" keyword

View the packages on the pypi.org package registry that are tagged with the "documents" keyword.

stencila_types 2.0.0b2
Python types for Stencila
7 versions - Latest release: 9 months ago - 217 downloads last month - 820 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
stencila 2.0.0b2
Python SDK for Stencila
12 versions - Latest release: 9 months ago - 1 dependent repositories - 2.3 thousand downloads last month - 820 stars on GitHub - 1 maintainer
stencila_plugin 2.0.0b3
Library for building Stencila Plugins
12 versions - Latest release: 9 months ago - 312 downloads last month - 820 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
cerberus 1.3.7 💰
Lightweight, extensible schema and data validation tool for Pythondictionaries.
29 versions - Latest release: 4 months ago - 140 dependent packages - 2,028 dependent repositories - 3.63 million downloads last month - 3,114 stars on GitHub - 2 maintainers
etherpump 0.0.20 💰
Pumping text from etherpads into publications
20 versions - Latest release: about 4 years ago - 1 dependent repositories - 518 downloads last month - 17,317 stars on GitHub - 2 maintainers
Top 1.6% on pypi.org
pyocr 0.8.5
A Python wrapper for OCR engines (Tesseract, Cuneiform, etc)
30 versions - Latest release: over 1 year ago - 5 dependent packages - 255 dependent repositories - 20.5 thousand downloads last month - 7,831 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
svglib 1.5.1
A pure-Python library for reading and converting SVG
22 versions - Latest release: over 2 years ago - 93 dependent packages - 1,712 dependent repositories - 1.44 million downloads last month - 315 stars on GitHub - 2 maintainers
wdoc 3.0.2
A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (ur...
99 versions - Latest release: about 16 hours ago - 3.35 thousand downloads last month - 46 stars on GitHub - 1 maintainer
papermerge-core 2.1.5
Open source document management system for digital archives
77 versions - Latest release: about 2 years ago - 4 dependent repositories - 1.55 thousand downloads last month - 347 stars on GitHub - 1 maintainer
stencila-pyla 0.3.1
Python interpreter for executable documents
9 versions - Latest release: over 4 years ago - 1 dependent repositories - 240 downloads last month - 2 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
jsonpath-ng 1.7.0
A final implementation of JSONPath for Python that aims to be standard compliant, including arith...
10 versions - Latest release: 6 months ago - 214 dependent packages - 1,548 dependent repositories - 36.5 million downloads last month - 647 stars on GitHub - 2 maintainers
docoskin 0.1.0
"Onion-skin" visual differences between a reference document image and a scanned copy
1 version - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 1 maintainer
kindlepy 1.0.0
CLI tool for mailing your documents to your kindle device.
1 version - Latest release: over 8 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 1 maintainer
index 0.5.0.dev1
Index System
3 versions - Latest release: 2 months ago - 36 dependent repositories - 864 downloads last month - 0 stars on GitHub - 1 maintainer
unoserver-fork 1.3.1
A server for file conversions with Libre Office. With function to update index
2 versions - Latest release: over 2 years ago - 31 downloads last month - 0 stars on GitHub - 1 maintainer
similar-documents 0.1.4
Generate similarity scores for documents from cli
6 versions - Latest release: almost 4 years ago - 1 dependent repositories - 240 downloads last month - 8 stars on GitHub - 1 maintainer
word2html 0.3.0
A quick and dirty script to convert a Word (docx) document to html.
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 208 downloads last month - 53 stars on GitHub - 1 maintainer
nr-documents-records-model-builder 1.0.0
OARepo model builder extension for NR document records
2 versions - Latest release: over 2 years ago - 85 downloads last month - 1 maintainer
zen_document_parser 0.11
A library for parsing various government documents as well as general PDFs
1 version - Latest release: about 9 years ago - 2 dependent repositories - 29 downloads last month - 2 stars on GitHub - 1 maintainer
docling-sdg 0.1.3
Docling for Synthetic Data Generation (SDG) provides a set of tools to create artificial data fro...
2 versions - Latest release: 23 days ago - 300 downloads last month - 6 stars on GitHub - 1 maintainer
tei-chunker 0.1.0
Hierarchical document chunking for TEI XML documents
1 version - Latest release: 2 months ago - 53 downloads last month - 1 maintainer
ffmulticonverter 1.7.1
GUI File Format Converter
13 versions - Latest release: over 1 year ago - 2 dependent repositories - 56 downloads last month - 1 maintainer
doxx 0.9.4
A Simple, Flexible Text Templating, Build, & Project Distribution System
5 versions - Latest release: about 10 years ago - 2 dependent repositories - 282 downloads last month - 1 maintainer
oldp 0.8.0
Open Legal Data Platform
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 99 downloads last month - 109 stars on GitHub - 1 maintainer
llama-index-readers-docling 0.3.2
llama-index readers docling integration
5 versions - Latest release: about 1 month ago - 5.73 thousand downloads last month - 27,013 stars on GitHub - 1 maintainer
unstructured-expanded 0.16.11
Expansion to the unstructured package, adding support for image extraction.
9 versions - Latest release: 4 months ago - 317 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
unoserver 3.0.1
A server for file conversions with Libre Office
30 versions - Latest release: 6 months ago - 9 dependent repositories - 36 thousand downloads last month - 557 stars on GitHub - 2 maintainers
unoserver-appx 1.0.1
A server for file conversions with Libre Office
2 versions - Latest release: 10 months ago - 115 downloads last month - 557 stars on GitHub - 1 maintainer
redacted-py 1.0.8
Redacting classified documents
4 versions - Latest release: 3 months ago - 228 downloads last month - 3 stars on GitHub - 1 maintainer
invenio-documents 0.1.0.post1
Invenio module that adds filesystem abstraction.
3 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 82 downloads last month - 1 stars on GitHub - 4 maintainers
cdpdumpingutils 0.2.1
Download all your courses filles from cahier de prepa
2 versions - Latest release: about 2 years ago - 108 downloads last month - 13 stars on GitHub - 1 maintainer
pyugt 1.0.10
Universal Game Translator from on-screen text in Python
32 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.21 thousand downloads last month - 31 stars on GitHub - 1 maintainer
docling 2.29.0
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...
95 versions - Latest release: 9 days ago - 405 thousand downloads last month - 78 stars on GitHub - 1 maintainer
hades-nlp 0.1.2
Homologous Automated Document Exploration and Summarization - A powerful tool for comparing simil...
3 versions - Latest release: over 1 year ago - 124 downloads last month - 8 stars on GitHub - 2 maintainers
classify-bills 1.0.1
Automatically sort and archive PDF bills and statements
2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 59 downloads last month - 1 stars on GitHub - 1 maintainer
extended-docling 2.12.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...
1 version - Latest release: 4 months ago - 74 downloads last month - 26,056 stars on GitHub - 1 maintainer
preprocess-docs 0.0.6
An open source document preprocessor for AI.
6 versions - Latest release: about 1 year ago - 199 downloads last month - 0 stars on GitHub - 1 maintainer
jsonpath-ig 1.5.3
A final implementation of JSONPath for Python that aims to be standard compliant, including arith...
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 59 downloads last month - 589 stars on GitHub - 1 maintainer
jsonpath-ng-i 1.0.3
A final implementation of JSONPath for Python that aims to be standard compliant, including arith...
4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 111 downloads last month - 590 stars on GitHub - 1 maintainer
ffconverter 2.4.6
File Format Converter with Qt GUI
22 versions - Latest release: 6 months ago - 275 downloads last month - 5 stars on GitHub - 1 maintainer
svglibwheel 0.1
A pure-Python library for reading and converting SVG
1 version - Latest release: about 1 year ago - 28 downloads last month - 338 stars on GitHub - 1 maintainer
sleepyconvert 1.0.3
Converts data files, images and documents to different formats
7 versions - Latest release: 8 days ago - 552 downloads last month - 0 stars on GitHub - 1 maintainer
docman 0.0.6
Document Manager
6 versions - Latest release: 8 months ago - 178 downloads last month - 0 stars on GitHub - 1 maintainer
dsw-tdk 4.17.0 💰
Data Stewardship Wizard Template Development Toolkit
98 versions - Latest release: 18 days ago - 1 dependent repositories - 2.42 thousand downloads last month - 4 stars on GitHub - 1 maintainer
scribd-downloader 1.3.1
Download documents, books and audiobooks off Scribd
1 version - Latest release: about 2 years ago - 2 dependent repositories - 48 downloads last month - 1 maintainer
docling-google-ocr 2.13.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...
2 versions - Latest release: 3 months ago - 117 downloads last month - 26,056 stars on GitHub - 1 maintainer
doctoolsllm 0.99.0
(Now winston_doc) A perfect AI powered RAG for document query and summary. Supports ~all LLM and ...
21 versions - Latest release: 9 months ago - 534 downloads last month - 46 stars on GitHub - 1 maintainer
llama-index-node-parser-docling 0.3.1
llama-index node_parser docling integration
4 versions - Latest release: 2 months ago - 3.86 thousand downloads last month - 26,056 stars on GitHub - 1 maintainer
llama-index-readers-preprocess 0.3.0
llama-index readers preprocess integration
8 versions - Latest release: 5 months ago - 210 downloads last month - 4 stars on GitHub - 1 maintainer
scrapontologies 1.1.0 💰
Library for extracting schemas and building ontologies from documents using LLM
2 versions - Latest release: 6 months ago - 95 downloads last month - 18,931 stars on GitHub - 2 maintainers
expose-text 0.1.6
A Python module that exposes text for modification in multiple file types.
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 197 downloads last month - 17 stars on GitHub - 1 maintainer
docsvault 0.1.4
Web app used to securely version your documents on git
6 versions - Latest release: over 4 years ago - 1 dependent repositories - 179 downloads last month - 1 maintainer
dedoc 2.3.2
Extract content and logical tree structure from textual documents
29 versions - Latest release: 4 months ago - 1.43 thousand downloads last month - 226 stars on GitHub - 1 maintainer
nr-documents-records 1.0.0
NR documents data model
2 versions - Latest release: over 2 years ago - 95 downloads last month - 1 maintainer
fauxdoc 1.1.0
Python package for generating fake (faux) record or document (doc) data conforming to bespoke req...
2 versions - Latest release: about 2 years ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
boxdetect 1.0.2 💰
boxdetect is a Python package based on OpenCV which allows you to easily detect rectangular shape...
5 versions - Latest release: about 2 years ago - 1 dependent package - 3 dependent repositories - 50 thousand downloads last month - 109 stars on GitHub - 1 maintainer
mimeogram 1.2
Exchange of file collections with LLMs.
15 versions - Latest release: 20 days ago - 673 downloads last month - 1 stars on GitHub - 1 maintainer
blazingdocs 1.0.1
BlazingDocs Python client
1 version - Latest release: over 3 years ago - 1 dependent repositories - 49 downloads last month - 2 stars on GitHub - 1 maintainer
docdump 1.0.4
A package to extract text from common document types.
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 119 downloads last month - 0 stars on GitHub - 1 maintainer
epaper 0.0.0
A simple and productive personal documents library
1 version - Latest release: over 1 year ago - 2 dependent repositories - 2 stars on GitHub - 1 maintainer
pwr 2.6
This program helps you write html pages like documents in word processors
3 versions - Latest release: over 11 years ago - 4 dependent repositories - 89 downloads last month - 0 stars on GitHub - 1 maintainer
frat 2.0.7
Fast Rectangle Annotation Tool
6 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 9 stars on GitHub - 1 maintainer
draftable-compare-api 1.4.2
Client library for the Draftable document comparison API
19 versions - Latest release: 3 months ago - 2 dependent repositories - 3.79 thousand downloads last month - 9 stars on GitHub - 2 maintainers
nr-documents-app 1.0.8
Application package for NR documents site
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 399 downloads last month - 0 stars on GitHub - 1 maintainer
peslac 0.1.4
A Python package for the Peslac API
5 versions - Latest release: 3 months ago - 158 downloads last month - 0 stars on GitHub - 1 maintainer
unstructured-platform 0.4.3
Python SDK for the Unstructured Platform API
4 versions - Latest release: 3 months ago - 176 downloads last month - 1 maintainer
snaketex 0.1.3
A LaTeX template system for large and multi-user projects.
5 versions - Latest release: almost 9 years ago - 1 dependent repositories - 83 downloads last month - 1 stars on GitHub - 1 maintainer
marshmallow-br 0.1.1
An unofficial extension to Marshmallow fields and validators for Brazilian documents
3 versions - Latest release: over 2 years ago - 93 downloads last month - 0 stars on GitHub - 1 maintainer
srsparser 1.4.9
A library that translates semi-structured documents into a structured form and contains natural l...
64 versions - Latest release: almost 3 years ago - 1 dependent repositories - 768 downloads last month - 2 stars on GitHub - 1 maintainer
edocuments 1.1.0
eDocuments - a simple and productive personal documents library
10 versions - Latest release: almost 7 years ago - 2 dependent repositories - 440 downloads last month - 2 stars on GitHub - 1 maintainer
quickdocs 1.6.3
Creates HTML docs from a project's readme and sphinx-apidoc.
10 versions - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 279 downloads last month - 1 stars on GitHub - 1 maintainer
modoboa-pdfcredentials 1.5.0 💰
Generate PDF documents containing user credentials
15 versions - Latest release: almost 3 years ago - 2 dependent repositories - 309 downloads last month - 8 stars on GitHub - 1 maintainer
iddt 0.1.13
Internet Document Discovery Tool
12 versions - Latest release: over 9 years ago - 2 dependent repositories - 174 downloads last month - 0 stars on GitHub - 1 maintainer
pythonrlsa 1.0.0
Python Run Length Smoothing Algorithm for Document Processing
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 241 downloads last month - 28 stars on GitHub - 1 maintainer
doc-curation 0.1.20 💰
A package for curating doc file collections, with ability to sync with youtube and archive.org do...
28 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 587 downloads last month - 7 stars on GitHub - 1 maintainer
gradio-test2 0.0.2
This is a test component
1 version - Latest release: over 1 year ago - 62 downloads last month - 1 maintainer
libreserver 0.1.1
A server for file conversions with Libre Office
2 versions - Latest release: about 1 year ago - 124 downloads last month - 0 stars on GitHub - 1 maintainer
mongoengine_fuel 1.0.3
Factory for MongoDB documents created with mongoengine
4 versions - Latest release: over 1 year ago - 2 dependent repositories - 84 downloads last month - 25 stars on GitHub - 1 maintainer
dict-curation 0.0.3
A package for curating dictionaries (esp in babylon and stardict formats).
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 102 downloads last month - 1 maintainer
gdoc-down 0.0.10
Download Google documents to files
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 208 downloads last month - 15 stars on GitHub - 2 maintainers
docowling 1.0.17
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...
17 versions - Latest release: 3 months ago - 525 downloads last month - 1 stars on GitHub - 1 maintainer
ashtadhyayi-data 0.0.3 💰
A package for curating doc file collections, with ability to sync with youtube and archive.org do...
2 versions - Latest release: over 2 years ago - 55 downloads last month - 0 stars on GitHub - 1 maintainer
unstructured-platform-sdk 0.1.0 removed
Python SDK for the Unstructured Platform API
1 version - Latest release: 3 months ago - 1 maintainer
whintpy 1.0.1
WhintPy is a pure Python-based solution to manage documents with or without authentication access.
3 versions - Latest release: 10 months ago - 94 downloads last month - 1 maintainer
scrapontology 💰
Library for extracting schemas and building ontologies from documents using LLM
4 versions - 222 downloads last month - 12,938 stars on GitHub - 1 maintainer
smog 0.0.4 removed
simple media organizer
4 versions - Latest release: over 2 years ago - 19 downloads last month - 1 stars on GitHub - 1 maintainer