pypi.org "pdf-processing" keyword
visual-rag-toolkit 0.2.0
End-to-end visual document retrieval with ColPali, featuring two-stage pooling for scalable search7 versions - Latest release: 26 days ago - 323 downloads last month - 1 stars on GitHub - 1 maintainer
pdftl 0.11.1
A capable CLI tool for PDF manipulation inspired by pdftk.17 versions - Latest release: 27 days ago - 499 downloads last month - 1 stars on GitHub - 1 maintainer
journal-vetter 1.0.1
uses tokenized & carefully summarized journals to query an LLM for analysis, based also on user-d...2 versions - Latest release: 7 months ago - 13 downloads last month - 0 stars on GitHub - 1 maintainer
mcp-pdf 2.0.14
Secure FastMCP server for comprehensive PDF processing - text extraction, OCR, table extraction, ...20 versions - Latest release: 19 days ago - 348 downloads last month - 0 stars on GitHub - 1 maintainer
pdfmcp-tools 0.1.1
MCP server for comprehensive PDF processing with 18 specialized tools2 versions - Latest release: 6 months ago - 11 downloads last month - 1 stars on GitHub - 1 maintainer
pdf-snip 0.0.3
A package to help manage pdf pages, images and their conversions during different NLP, CV or othe...2 versions - Latest release: about 1 year ago - 17 downloads last month - 3 stars on GitHub - 1 maintainer
nutrient-dws 3.0.0
Python client library for Nutrient Document Web Services API4 versions - Latest release: 21 days ago - 128 downloads last month - 54 stars on GitHub - 1 maintainer
flockparser 1.0.9
Distributed document RAG system with intelligent GPU/CPU orchestration8 versions - Latest release: 4 months ago - 96 downloads last month - 3 stars on GitHub - 1 maintainer
vlense 0.1.4
A Python package to extract text from images and PDFs using Vision Language Model (VLM).5 versions - Latest release: over 1 year ago - 34 downloads last month - 1 stars on GitHub - 1 maintainer
peslac 0.1.4
A Python package for the Peslac API5 versions - Latest release: about 1 year ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
preocr 1.4.0
A fast, layout-aware OCR decision engine for document processing pipelines. Detects whether files...26 versions - Latest release: 23 days ago - 1 maintainer
aikitx 1.0.0
A comprehensive GUI toolkit for Large Language Models (LLMs) with GGUF support, document processi...1 version - Latest release: 8 months ago - 56 downloads last month - 0 stars on GitHub - 1 maintainer
papermage 0.20.0
Papermage. Casting magic over scientific PDFs.8 versions - Latest release: almost 2 years ago - 158 downloads last month - 786 stars on GitHub - 3 maintainers
docling-extractor 1.0.0
Production-grade document extraction with intelligent fallback chain: Docling -> PyMuPDF -> pdfpl...1 version - Latest release: 2 months ago - 1 maintainer
fileseek 0.1.3
FileSeek – AI-Powered Local Document Archive&Search3 versions - Latest release: about 1 year ago - 23 downloads last month - 1 maintainer
Related Keywords
ocr
8
document-processing
6
python
4
api
4
llm
4
pdf
4
text-extraction
4
mcp
3
nlp
3
cli
3
machine-learning
3
ai
3
rag
2
image-processing
2
fastmcp
2
semantic-search
2
computer-vision
2
ocr-optimization
1
layout-analysis
1
pdf-analysis
1
pre-ocr
1
ocr-routing
1
document-ai
1
gguf
1
transformers
1
ocr-decision
1
pdf-merger
1
pdf-splitter
1
pdf-ocr
1
image-ocr
1
document-ocr
1
file-processing
1
remote-file-processing
1
remote-file
1
nodejs
1
tools
1
documents
1
peslac
1
vision-language-model
1
workload-orchestration
1
web-ui
1
document-similarity
1
text-analysis
1
file-archival
1
document-indexing
1
offline-search
1
local-search
1
file-monitoring
1
vector-search
1
document-management
1
databricks
1
clinical-trials
1
data-engineering
1
docling
1
document-extraction
1
scientific-papers
1
natural-language-processing
1
multimodal
1
conversation-ai
1
text-generation
1
language-models
1
deep-learning
1
neural-networks
1
inference
1
summarization
1
email-automation
1
huggingface
1
ctransformers
1
llama-cpp
1
pyside6
1
gui
1
chatbot
1
vram-aware
1
text-summarization
1
research-tool
1
pypi-package
1
pypi
1
pymupdf-fitz
1
pymupdf
1
openai
1
llms
1
langchain-python
1
langchain
1
gpt-4
1
document-embeddings
1
document-embedding
1
chatgpt
1
academic-journals
1
cli-app
1
automation
1
manipulation
1
pdftl
1
pdftk
1
visual-search
1
visual-rag
1
qdrant
1
multimodal-rag
1
late-interaction
1
document-retrieval
1
colpali
1