An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "document processing" keyword

chunklet-py 2.2.0
High-fidelity context-aware chunking and interactive visualization for RAG. Advanced segmentation...
8 versions - Latest release: 16 days ago - 494 downloads last month - 62 stars on GitHub - 1 maintainer
docnav 1.0.1
AI-powered document querying with citations
2 versions - Latest release: about 2 months ago - 88 downloads last month - 1 maintainer
onnxtr 0.8.1 💰
Onnx Text Recognition (OnnxTR): docTR Onnx-Wrapper for high-performance OCR on documents.
18 versions - Latest release: about 1 month ago - 56.4 thousand downloads last month - 159 stars on GitHub - 1 maintainer
chunklet 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.
19 versions - Latest release: 6 months ago - 162 downloads last month - 23 stars on GitHub - 1 maintainer
lex-pdftotext 1.0.0
Extract and structure text from Brazilian legal PDF documents (PJe format)
1 version - Latest release: 3 months ago - 19 downloads last month - 1 maintainer
arkeo 0.2.6
markdown archiver betasaurus
8 versions - Latest release: 3 months ago - 185 downloads last month - 1 maintainer
pdfsmith 0.2.0
PDF to Markdown conversion with multiple backend support
1 version - Latest release: 3 months ago - 23 downloads last month - 1 stars on GitHub - 1 maintainer
textextraction 0.1.4
Extract and process text from images and PDFs
5 versions - Latest release: 11 months ago - 27 downloads last month - 0 stars on GitHub - 1 maintainer
doc2data 0.2.0
Integrated document processing with machine learning.
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 123 downloads last month - 10 stars on GitHub - 1 maintainer
llama-index-readers-layoutir 0.1.1
llama-index readers LayoutIR integration
1 version - Latest release: 20 days ago - 84 downloads last month - 1 maintainer
file-parse-by-bajirao 0.1.0
Universal Document Processor for LLM Processing - extracts text, tables, numeric data, and metada...
1 version - Latest release: 3 months ago - 42 downloads last month - 1 maintainer
docling-ocr-onnxtr 0.2.1 💰
Onnx Text Recognition (OnnxTR) OCR plugin for docling
6 versions - Latest release: about 1 month ago - 27 thousand downloads last month - 11 stars on GitHub - 1 maintainer
doctr-labeler 0.2.1
A Python package for labeling and annotating documents
9 versions - Latest release: 3 months ago - 168 downloads last month - 15 stars on GitHub - 1 maintainer
intelisys 0.5.6
Intelligence/AI services for the Lifsys Enterprise with enhanced max_history_words, efficient his...
37 versions - Latest release: over 1 year ago - 218 downloads last month - 0 stars on GitHub - 1 maintainer
docu-devs-api-client 1.6.1
A client library for accessing DocuDevs API
36 versions - Latest release: 29 days ago - 1.02 thousand downloads last month - 1 maintainer
detect-row 2.0.5
Hệ thống trích xuất bảng, hàng, cột hoàn chỉnh với AI và GPU support
13 versions - Latest release: 9 months ago - 87 downloads last month - 1 maintainer
rainbow-pdf-processor 0.1.0
A powerful PDF processing tool with text extraction, table recognition, and image extraction capa...
1 version - Latest release: 11 months ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
llmgraphtransformer 0.1.0
A powerful tool for transforming documents into graph-based structures using Large Language Model...
3 versions - Latest release: about 1 year ago - 716 downloads last month - 6 stars on GitHub - 1 maintainer
bbox-align 0.2.8
A python library that reorders bounding boxes generated by OCR engines into the correct reading o...
11 versions - Latest release: 8 months ago - 860 downloads last month - 2 stars on GitHub - 1 maintainer
pydocai 0.1.0
Extract text from PDFs using pypdfium2 with OCR fallback via pytesseract
1 version - Latest release: about 1 month ago
docutray 0.1.0
Python library for the DocuTray API
1 version - Latest release: about 1 month ago
py-document-chunker 0.3.0 removed
A state-of-the-art Python package for advanced text segmentation (chunking).
2 versions - Latest release: 6 months ago - 265 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords