pypi.org "documents" keyword
View the packages on the pypi.org package registry that are tagged with the "documents" keyword.
stencila_types 2.0.0b2
Python types for Stencila7 versions - Latest release: 9 months ago - 217 downloads last month - 820 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
12 versions - Latest release: 9 months ago - 1 dependent repositories - 2.3 thousand downloads last month - 820 stars on GitHub - 1 maintainer
stencila 2.0.0b2
Python SDK for Stencila12 versions - Latest release: 9 months ago - 1 dependent repositories - 2.3 thousand downloads last month - 820 stars on GitHub - 1 maintainer
stencila_plugin 2.0.0b3
Library for building Stencila Plugins12 versions - Latest release: 9 months ago - 312 downloads last month - 820 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
29 versions - Latest release: 4 months ago - 140 dependent packages - 2,028 dependent repositories - 3.63 million downloads last month - 3,114 stars on GitHub - 2 maintainers
cerberus 1.3.7 💰
Lightweight, extensible schema and data validation tool for Pythondictionaries.29 versions - Latest release: 4 months ago - 140 dependent packages - 2,028 dependent repositories - 3.63 million downloads last month - 3,114 stars on GitHub - 2 maintainers
etherpump 0.0.20 💰
Pumping text from etherpads into publications20 versions - Latest release: about 4 years ago - 1 dependent repositories - 518 downloads last month - 17,317 stars on GitHub - 2 maintainers
Top 1.6% on pypi.org
30 versions - Latest release: over 1 year ago - 5 dependent packages - 255 dependent repositories - 20.5 thousand downloads last month - 7,831 stars on GitHub - 1 maintainer
pyocr 0.8.5
A Python wrapper for OCR engines (Tesseract, Cuneiform, etc)30 versions - Latest release: over 1 year ago - 5 dependent packages - 255 dependent repositories - 20.5 thousand downloads last month - 7,831 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
22 versions - Latest release: over 2 years ago - 93 dependent packages - 1,712 dependent repositories - 1.44 million downloads last month - 315 stars on GitHub - 2 maintainers
svglib 1.5.1
A pure-Python library for reading and converting SVG22 versions - Latest release: over 2 years ago - 93 dependent packages - 1,712 dependent repositories - 1.44 million downloads last month - 315 stars on GitHub - 2 maintainers
wdoc 3.0.2
A perfect AI powered RAG for document query and summary. Supports ~all LLM and ~all filetypes (ur...99 versions - Latest release: about 16 hours ago - 3.35 thousand downloads last month - 46 stars on GitHub - 1 maintainer
papermerge-core 2.1.5
Open source document management system for digital archives77 versions - Latest release: about 2 years ago - 4 dependent repositories - 1.55 thousand downloads last month - 347 stars on GitHub - 1 maintainer
stencila-pyla 0.3.1
Python interpreter for executable documents9 versions - Latest release: over 4 years ago - 1 dependent repositories - 240 downloads last month - 2 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
10 versions - Latest release: 6 months ago - 214 dependent packages - 1,548 dependent repositories - 36.5 million downloads last month - 647 stars on GitHub - 2 maintainers
jsonpath-ng 1.7.0
A final implementation of JSONPath for Python that aims to be standard compliant, including arith...10 versions - Latest release: 6 months ago - 214 dependent packages - 1,548 dependent repositories - 36.5 million downloads last month - 647 stars on GitHub - 2 maintainers
docoskin 0.1.0
"Onion-skin" visual differences between a reference document image and a scanned copy1 version - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 1 maintainer
kindlepy 1.0.0
CLI tool for mailing your documents to your kindle device.1 version - Latest release: over 8 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 1 maintainer
index 0.5.0.dev1
Index System3 versions - Latest release: 2 months ago - 36 dependent repositories - 864 downloads last month - 0 stars on GitHub - 1 maintainer
unoserver-fork 1.3.1
A server for file conversions with Libre Office. With function to update index2 versions - Latest release: over 2 years ago - 31 downloads last month - 0 stars on GitHub - 1 maintainer
similar-documents 0.1.4
Generate similarity scores for documents from cli6 versions - Latest release: almost 4 years ago - 1 dependent repositories - 240 downloads last month - 8 stars on GitHub - 1 maintainer
word2html 0.3.0
A quick and dirty script to convert a Word (docx) document to html.5 versions - Latest release: over 3 years ago - 1 dependent repositories - 208 downloads last month - 53 stars on GitHub - 1 maintainer
nr-documents-records-model-builder 1.0.0
OARepo model builder extension for NR document records2 versions - Latest release: over 2 years ago - 85 downloads last month - 1 maintainer
zen_document_parser 0.11
A library for parsing various government documents as well as general PDFs1 version - Latest release: about 9 years ago - 2 dependent repositories - 29 downloads last month - 2 stars on GitHub - 1 maintainer
docling-sdg 0.1.3
Docling for Synthetic Data Generation (SDG) provides a set of tools to create artificial data fro...2 versions - Latest release: 23 days ago - 300 downloads last month - 6 stars on GitHub - 1 maintainer
tei-chunker 0.1.0
Hierarchical document chunking for TEI XML documents1 version - Latest release: 2 months ago - 53 downloads last month - 1 maintainer
ffmulticonverter 1.7.1
GUI File Format Converter13 versions - Latest release: over 1 year ago - 2 dependent repositories - 56 downloads last month - 1 maintainer
doxx 0.9.4
A Simple, Flexible Text Templating, Build, & Project Distribution System5 versions - Latest release: about 10 years ago - 2 dependent repositories - 282 downloads last month - 1 maintainer
oldp 0.8.0
Open Legal Data Platform2 versions - Latest release: about 6 years ago - 1 dependent repositories - 99 downloads last month - 109 stars on GitHub - 1 maintainer
llama-index-readers-docling 0.3.2
llama-index readers docling integration5 versions - Latest release: about 1 month ago - 5.73 thousand downloads last month - 27,013 stars on GitHub - 1 maintainer
unstructured-expanded 0.16.11
Expansion to the unstructured package, adding support for image extraction.9 versions - Latest release: 4 months ago - 317 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
30 versions - Latest release: 6 months ago - 9 dependent repositories - 36 thousand downloads last month - 557 stars on GitHub - 2 maintainers
unoserver 3.0.1
A server for file conversions with Libre Office30 versions - Latest release: 6 months ago - 9 dependent repositories - 36 thousand downloads last month - 557 stars on GitHub - 2 maintainers
unoserver-appx 1.0.1
A server for file conversions with Libre Office2 versions - Latest release: 10 months ago - 115 downloads last month - 557 stars on GitHub - 1 maintainer
redacted-py 1.0.8
Redacting classified documents4 versions - Latest release: 3 months ago - 228 downloads last month - 3 stars on GitHub - 1 maintainer
invenio-documents 0.1.0.post1
Invenio module that adds filesystem abstraction.3 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 82 downloads last month - 1 stars on GitHub - 4 maintainers
cdpdumpingutils 0.2.1
Download all your courses filles from cahier de prepa2 versions - Latest release: about 2 years ago - 108 downloads last month - 13 stars on GitHub - 1 maintainer
pyugt 1.0.10
Universal Game Translator from on-screen text in Python32 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.21 thousand downloads last month - 31 stars on GitHub - 1 maintainer
docling 2.29.0
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...95 versions - Latest release: 9 days ago - 405 thousand downloads last month - 78 stars on GitHub - 1 maintainer
hades-nlp 0.1.2
Homologous Automated Document Exploration and Summarization - A powerful tool for comparing simil...3 versions - Latest release: over 1 year ago - 124 downloads last month - 8 stars on GitHub - 2 maintainers
classify-bills 1.0.1
Automatically sort and archive PDF bills and statements2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 59 downloads last month - 1 stars on GitHub - 1 maintainer
extended-docling 2.12.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...1 version - Latest release: 4 months ago - 74 downloads last month - 26,056 stars on GitHub - 1 maintainer
preprocess-docs 0.0.6
An open source document preprocessor for AI.6 versions - Latest release: about 1 year ago - 199 downloads last month - 0 stars on GitHub - 1 maintainer
jsonpath-ig 1.5.3
A final implementation of JSONPath for Python that aims to be standard compliant, including arith...2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 59 downloads last month - 589 stars on GitHub - 1 maintainer
jsonpath-ng-i 1.0.3
A final implementation of JSONPath for Python that aims to be standard compliant, including arith...4 versions - Latest release: almost 4 years ago - 1 dependent repositories - 111 downloads last month - 590 stars on GitHub - 1 maintainer
ffconverter 2.4.6
File Format Converter with Qt GUI22 versions - Latest release: 6 months ago - 275 downloads last month - 5 stars on GitHub - 1 maintainer
svglibwheel 0.1
A pure-Python library for reading and converting SVG1 version - Latest release: about 1 year ago - 28 downloads last month - 338 stars on GitHub - 1 maintainer
sleepyconvert 1.0.3
Converts data files, images and documents to different formats7 versions - Latest release: 8 days ago - 552 downloads last month - 0 stars on GitHub - 1 maintainer
docman 0.0.6
Document Manager6 versions - Latest release: 8 months ago - 178 downloads last month - 0 stars on GitHub - 1 maintainer
dsw-tdk 4.17.0 💰
Data Stewardship Wizard Template Development Toolkit98 versions - Latest release: 18 days ago - 1 dependent repositories - 2.42 thousand downloads last month - 4 stars on GitHub - 1 maintainer
scribd-downloader 1.3.1
Download documents, books and audiobooks off Scribd1 version - Latest release: about 2 years ago - 2 dependent repositories - 48 downloads last month - 1 maintainer
docling-google-ocr 2.13.1
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...2 versions - Latest release: 3 months ago - 117 downloads last month - 26,056 stars on GitHub - 1 maintainer
doctoolsllm 0.99.0
(Now winston_doc) A perfect AI powered RAG for document query and summary. Supports ~all LLM and ...21 versions - Latest release: 9 months ago - 534 downloads last month - 46 stars on GitHub - 1 maintainer
llama-index-node-parser-docling 0.3.1
llama-index node_parser docling integration4 versions - Latest release: 2 months ago - 3.86 thousand downloads last month - 26,056 stars on GitHub - 1 maintainer
llama-index-readers-preprocess 0.3.0
llama-index readers preprocess integration8 versions - Latest release: 5 months ago - 210 downloads last month - 4 stars on GitHub - 1 maintainer
scrapontologies 1.1.0 💰
Library for extracting schemas and building ontologies from documents using LLM2 versions - Latest release: 6 months ago - 95 downloads last month - 18,931 stars on GitHub - 2 maintainers
expose-text 0.1.6
A Python module that exposes text for modification in multiple file types.4 versions - Latest release: over 4 years ago - 1 dependent repositories - 197 downloads last month - 17 stars on GitHub - 1 maintainer
docsvault 0.1.4
Web app used to securely version your documents on git6 versions - Latest release: over 4 years ago - 1 dependent repositories - 179 downloads last month - 1 maintainer
dedoc 2.3.2
Extract content and logical tree structure from textual documents29 versions - Latest release: 4 months ago - 1.43 thousand downloads last month - 226 stars on GitHub - 1 maintainer
nr-documents-records 1.0.0
NR documents data model2 versions - Latest release: over 2 years ago - 95 downloads last month - 1 maintainer
fauxdoc 1.1.0
Python package for generating fake (faux) record or document (doc) data conforming to bespoke req...2 versions - Latest release: about 2 years ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
5 versions - Latest release: about 2 years ago - 1 dependent package - 3 dependent repositories - 50 thousand downloads last month - 109 stars on GitHub - 1 maintainer
boxdetect 1.0.2 💰
boxdetect is a Python package based on OpenCV which allows you to easily detect rectangular shape...5 versions - Latest release: about 2 years ago - 1 dependent package - 3 dependent repositories - 50 thousand downloads last month - 109 stars on GitHub - 1 maintainer
mimeogram 1.2
Exchange of file collections with LLMs.15 versions - Latest release: 20 days ago - 673 downloads last month - 1 stars on GitHub - 1 maintainer
blazingdocs 1.0.1
BlazingDocs Python client1 version - Latest release: over 3 years ago - 1 dependent repositories - 49 downloads last month - 2 stars on GitHub - 1 maintainer
docdump 1.0.4
A package to extract text from common document types.5 versions - Latest release: over 4 years ago - 1 dependent repositories - 119 downloads last month - 0 stars on GitHub - 1 maintainer
epaper 0.0.0
A simple and productive personal documents library1 version - Latest release: over 1 year ago - 2 dependent repositories - 2 stars on GitHub - 1 maintainer
pwr 2.6
This program helps you write html pages like documents in word processors3 versions - Latest release: over 11 years ago - 4 dependent repositories - 89 downloads last month - 0 stars on GitHub - 1 maintainer
frat 2.0.7
Fast Rectangle Annotation Tool6 versions - Latest release: about 2 years ago - 1 dependent repositories - 177 downloads last month - 9 stars on GitHub - 1 maintainer
draftable-compare-api 1.4.2
Client library for the Draftable document comparison API19 versions - Latest release: 3 months ago - 2 dependent repositories - 3.79 thousand downloads last month - 9 stars on GitHub - 2 maintainers
nr-documents-app 1.0.8
Application package for NR documents site9 versions - Latest release: over 2 years ago - 1 dependent repositories - 399 downloads last month - 0 stars on GitHub - 1 maintainer
peslac 0.1.4
A Python package for the Peslac API5 versions - Latest release: 3 months ago - 158 downloads last month - 0 stars on GitHub - 1 maintainer
unstructured-platform 0.4.3
Python SDK for the Unstructured Platform API4 versions - Latest release: 3 months ago - 176 downloads last month - 1 maintainer
snaketex 0.1.3
A LaTeX template system for large and multi-user projects.5 versions - Latest release: almost 9 years ago - 1 dependent repositories - 83 downloads last month - 1 stars on GitHub - 1 maintainer
marshmallow-br 0.1.1
An unofficial extension to Marshmallow fields and validators for Brazilian documents3 versions - Latest release: over 2 years ago - 93 downloads last month - 0 stars on GitHub - 1 maintainer
srsparser 1.4.9
A library that translates semi-structured documents into a structured form and contains natural l...64 versions - Latest release: almost 3 years ago - 1 dependent repositories - 768 downloads last month - 2 stars on GitHub - 1 maintainer
edocuments 1.1.0
eDocuments - a simple and productive personal documents library10 versions - Latest release: almost 7 years ago - 2 dependent repositories - 440 downloads last month - 2 stars on GitHub - 1 maintainer
quickdocs 1.6.3
Creates HTML docs from a project's readme and sphinx-apidoc.10 versions - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 279 downloads last month - 1 stars on GitHub - 1 maintainer
modoboa-pdfcredentials 1.5.0 💰
Generate PDF documents containing user credentials15 versions - Latest release: almost 3 years ago - 2 dependent repositories - 309 downloads last month - 8 stars on GitHub - 1 maintainer
iddt 0.1.13
Internet Document Discovery Tool12 versions - Latest release: over 9 years ago - 2 dependent repositories - 174 downloads last month - 0 stars on GitHub - 1 maintainer
pythonrlsa 1.0.0
Python Run Length Smoothing Algorithm for Document Processing3 versions - Latest release: about 4 years ago - 1 dependent repositories - 241 downloads last month - 28 stars on GitHub - 1 maintainer
doc-curation 0.1.20 💰
A package for curating doc file collections, with ability to sync with youtube and archive.org do...28 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 587 downloads last month - 7 stars on GitHub - 1 maintainer
gradio-test2 0.0.2
This is a test component1 version - Latest release: over 1 year ago - 62 downloads last month - 1 maintainer
libreserver 0.1.1
A server for file conversions with Libre Office2 versions - Latest release: about 1 year ago - 124 downloads last month - 0 stars on GitHub - 1 maintainer
mongoengine_fuel 1.0.3
Factory for MongoDB documents created with mongoengine4 versions - Latest release: over 1 year ago - 2 dependent repositories - 84 downloads last month - 25 stars on GitHub - 1 maintainer
dict-curation 0.0.3
A package for curating dictionaries (esp in babylon and stardict formats).2 versions - Latest release: about 4 years ago - 1 dependent repositories - 102 downloads last month - 1 maintainer
gdoc-down 0.0.10
Download Google documents to files5 versions - Latest release: over 4 years ago - 1 dependent repositories - 208 downloads last month - 15 stars on GitHub - 2 maintainers
docowling 1.0.17
SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for power...17 versions - Latest release: 3 months ago - 525 downloads last month - 1 stars on GitHub - 1 maintainer
ashtadhyayi-data 0.0.3 💰
A package for curating doc file collections, with ability to sync with youtube and archive.org do...2 versions - Latest release: over 2 years ago - 55 downloads last month - 0 stars on GitHub - 1 maintainer
unstructured-platform-sdk 0.1.0 removed
Python SDK for the Unstructured Platform API1 version - Latest release: 3 months ago - 1 maintainer
whintpy 1.0.1
WhintPy is a pure Python-based solution to manage documents with or without authentication access.3 versions - Latest release: 10 months ago - 94 downloads last month - 1 maintainer
scrapontology 💰
Library for extracting schemas and building ontologies from documents using LLM4 versions - 222 downloads last month - 12,938 stars on GitHub - 1 maintainer
smog 0.0.4 removed
simple media organizer4 versions - Latest release: over 2 years ago - 19 downloads last month - 1 stars on GitHub - 1 maintainer
Related Keywords
pdf
22
python
20
docx
16
document
14
html
13
ai
10
convert
8
nlp
7
llm
7
search
7
unoconv
6
conversion
6
tables
6
markdown
6
pptx
6
libreoffice
5
ocr
5
doc
5
openai
5
text
5
images
5
api
5
json
4
layout model
4
pdf-to-text
4
xlsx
4
natural language processing
4
data
4
segmentation
4
langchain
4
pdf-to-json
4
pdf-converter
4
document-parsing
4
document-parser
4
template
4
table structure
4
uno
4
machine learning
4
executable
4
docling
4
table former
4
question-answering
3
OCR
3
library
3
unstructured
3
scanned-documents
3
filter
3
jsonpath
3
path
3
query
3
xpath
3
scan
3
opencv
3
internet-archive
3
books
3
artificial intelligence
3
pdf-generation
3
word
3
rag
3
PDF
3
python3
3
SDK
3
programmable
3
interactive
3
reproducible
3
AI
3
parser
2
webscraping
2
templates
2
automated-scraper
2
gpt-3
2
imagemagick
2
ffmpeg
2
gpt-4
2
llama3
2
video
2
machine-learning
2
sc
2
scraping-python
2
audio
2
scrapingweb
2
chunking
2
extension
2
file format
2
odt
2
xml
2
visualization
2
preprocess
2
index
2
personal
2
productive
2
simple
2
converter
2
sdk
2
text extraction
2
cli
2
command-line
2
gpt
2
graph
2
knowledge graph
2