An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "information-retrieval" keyword

View the packages on the pypi.org package registry that are tagged with the "information-retrieval" keyword.

irtm 0.0.4
A toolbox for Information Retrieval & Text Mining.
4 versions - Latest release: about 4 years ago - 1 dependent repositories - 18 downloads last month - 1 stars on GitHub - 1 maintainer
ragl 0.10.1
ragl: retrieval-augmented generation (RAG) for text.
23 versions - Latest release: 2 months ago - 83 downloads last month - 1 maintainer
rag-engine 0.1.2
A Retrieval-Augmented Generation (RAG) Engine for managing embeddings and similarity search
2 versions - Latest release: over 1 year ago - 67 downloads last month - 3 stars on GitHub - 1 maintainer
h2ogpte 1.6.43
Client library for Enterprise h2oGPTe
176 versions - Latest release: 14 days ago - 1 dependent package - 1 dependent repositories - 4.79 thousand downloads last month - 1 maintainer
dynamic-prompting 0.2.6
Dynamic Few-Shot Prompting is a Python package that dynamically selects N samples that are contex...
4 versions - Latest release: over 1 year ago - 17 downloads last month - 2 stars on GitHub - 1 maintainer
viincci-rag 2.0.0
Universal multi-domain research system with RAG (Retrieval-Augmented Generation) capabilities
1 version - Latest release: about 11 hours ago
ir-metrics 0.1.6
The most common information retrieval (IR) metrics
15 versions - Latest release: almost 5 years ago - 5 dependent repositories - 4 thousand downloads last month - 5 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
easyocr 1.7.2
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
33 versions - Latest release: about 1 year ago - 90 dependent packages - 671 dependent repositories - 1.45 million downloads last month - 28,302 stars on GitHub - 1 maintainer
langroid-examples 0.1.25 💰
Add your description here
19 versions - Latest release: 11 months ago - 47 downloads last month - 3,293 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
haystack-ai 2.19.0
LLM framework to build customizable, production-ready LLM applications. Connect components (model...
288 versions - Latest release: 23 days ago - 51 dependent packages - 2 dependent repositories - 388 thousand downloads last month - 16,679 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
rank-bm25 0.2.2
Various BM25 algorithms for document ranking
4 versions - Latest release: over 3 years ago - 69 dependent packages - 345 dependent repositories - 2.54 million downloads last month - 1,260 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
text2text 1.9.5
Text2Text Language Modeling Toolkit
192 versions - Latest release: 10 months ago - 4 dependent repositories - 454 downloads last month - 302 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
catalyst 22.2.1 💰
Catalyst. Accelerated deep learning R&D with PyTorch.
104 versions - Latest release: over 3 years ago - 13 dependent packages - 179 dependent repositories - 54.5 thousand downloads last month - 3,235 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
gensim 4.4.0 💰
Python framework for fast Vector Space Modelling
92 versions - Latest release: 26 days ago - 426 dependent packages - 13,895 dependent repositories - 5.57 million downloads last month - 15,255 stars on GitHub - 2 maintainers
web-research-agent 1.2.1
An AI agent using ReAct methodology for autonomous web research tasks
22 versions - Latest release: about 1 month ago - 96 downloads last month - 0 stars on GitHub - 1 maintainer
open-retrievals 0.0.14
Text Embeddings for Retrieval and RAG based on transformers
16 versions - Latest release: 10 months ago - 46 downloads last month - 69 stars on GitHub - 1 maintainer
toolfront 0.4.1
Build AI applications in Markdown.
28 versions - Latest release: 2 days ago - 599 downloads last month - 785 stars on GitHub - 2 maintainers
vke 1.8.1
Python Keyphrase Extraction module
1 version - Latest release: over 4 years ago - 1 dependent repositories - 15 downloads last month - 1,583 stars on GitHub - 1 maintainer
pkelambda 1.8.1
Python Keyphrase Extraction module
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 12 downloads last month - 1,582 stars on GitHub - 1 maintainer
gensim-bz2-nsml 3.8.0 💰
Python framework for fast Vector Space Modelling
1 version - Latest release: over 6 years ago - 1 dependent repositories - 45 downloads last month - 15,949 stars on GitHub - 1 maintainer
megabots 0.0.11
🤖 Megabots provides State-of-the-art, production ready bots made mega-easy, so you don't have to ...
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 14 downloads last month - 341 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
allrank 1.4.3
allRank is a framework for training learning-to-rank neural models
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 384 downloads last month - 959 stars on GitHub - 1 maintainer
ir-kit 1.0.1
Utilities for information retrieval in python
5 versions - Latest release: over 8 years ago - 1 dependent repositories - 392 downloads last month - 2 stars on GitHub - 1 maintainer
forte.health 0.1.0
NLP pipeline framework for biomedical and clinical domains
1 version - Latest release: over 3 years ago - 10 downloads last month - 13 stars on GitHub - 3 maintainers
continuous-eval 0.3.14
Open-Source Evaluation for GenAI Applications.
28 versions - Latest release: 10 months ago - 4.57 thousand downloads last month - 505 stars on GitHub - 1 maintainer
langroid 0.59.18 💰
Harness LLMs with Multi-Agent Programming
503 versions - Latest release: 4 days ago - 1 dependent repositories - 103 thousand downloads last month - 2,245 stars on GitHub - 1 maintainer
zensols.spanmatch 0.0.1
An API to match spans of semantically similar text across documents.
1 version - Latest release: over 2 years ago - 8 downloads last month - 1 stars on GitHub - 1 maintainer
billm 0.1.6
Tool for converting LLMs from uni-directional to bi-directional for tasks like classification and...
7 versions - Latest release: over 1 year ago - 129 downloads last month - 441 stars on GitHub - 1 maintainer
yabm25 0.1.1
Fast BM25 search engine for Python with RAG support
2 versions - Latest release: 9 months ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
similaripy 0.3.1
High-performance KNN similarity functions in Python, optimized for sparse matrices
20 versions - Latest release: 5 days ago - 3 dependent repositories - 2.09 thousand downloads last month - 57 stars on GitHub - 1 maintainer
neuralqa 0.0.31a0 💰
NeuralQA: Question Answering on Large Datasets
12 versions - Latest release: about 5 years ago - 2 dependent repositories - 22 downloads last month - 233 stars on GitHub - 1 maintainer
freediscovery-stabilizer 1.5.dev0
Open source software for E-Discovery and Information Retrieval
2 versions - Latest release: over 1 year ago - 1 dependent package - 32 downloads last month - 75 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
unstructured 0.18.18
A library that prepares raw documents for downstream ML tasks.
194 versions - Latest release: 6 days ago - 113 dependent packages - 3,374 dependent repositories - 3.97 million downloads last month - 13,122 stars on GitHub - 1 maintainer
easy-elasticsearch 0.0.9
An easy-to-use Elasticsearch BM25 interface
8 versions - Latest release: about 3 years ago - 3 dependent repositories - 1.66 thousand downloads last month - 31 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
tevatron 0.1.0
Tevatron: A toolkit for learning and running deep dense retrieval models.
1 version - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 291 downloads last month - 697 stars on GitHub - 1 maintainer
unstructured-cpu 0.15.1
A library that prepares raw documents for downstream ML tasks.
13 versions - Latest release: over 1 year ago - 54 downloads last month - 12,775 stars on GitHub - 1 maintainer
retrivex 0.1.0
Retrieval Models Explainability Toolkit. Explain embedding similarity prediction.
1 version - Latest release: about 1 month ago - 59 downloads last month - 1 stars on GitHub - 1 maintainer
ccpke 2.0.0
Python Keyphrase Extraction module
1 version - Latest release: almost 3 years ago - 1 dependent package - 50 downloads last month - 1,582 stars on GitHub - 1 maintainer
rakun2 0.31
RaKUn 2.0; Better faster stronger lighter
13 versions - Latest release: 7 months ago - 1 dependent repositories - 598 downloads last month - 68 stars on GitHub - 1 maintainer
promptcache 0.0.1a1
A tool for caching prompts and compleation based on embedding
1 version - Latest release: almost 2 years ago - 6 downloads last month - 1,947 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
farm-haystack 1.26.4
LLM framework to build customizable, production-ready LLM applications. Connect components (model...
116 versions - Latest release: about 1 year ago - 21 dependent packages - 237 dependent repositories - 77.8 thousand downloads last month - 23,124 stars on GitHub - 1 maintainer
nalcos 0.1.1 💰
Search Git commits in natural language
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 13 downloads last month - 56 stars on GitHub - 1 maintainer
retriv 0.2.3
retriv: A Python Search Engine for Humans.
10 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 540 downloads last month - 231 stars on GitHub - 1 maintainer
pyplexity 0.2.12
Perplexity filter for documents and bulk HTML and WARC boilerplate removal.
18 versions - Latest release: over 2 years ago - 1 dependent repositories - 44.9 thousand downloads last month - 38 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
txtai 9.1.0
All-in-one open-source AI framework for semantic search, LLM orchestration and language model wor...
54 versions - Latest release: 8 days ago - 13 dependent packages - 14 dependent repositories - 29.7 thousand downloads last month - 11,746 stars on GitHub - 1 maintainer
bikidata 0.3.6
Developer-friendly Queries over RDF triples
31 versions - Latest release: 8 days ago - 540 downloads last month - 10 stars on GitHub - 1 maintainer
geesedb 0.0.2
Graph Engine for Exploration and Search over Evolving DataBases
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 17 downloads last month - 33 stars on GitHub - 1 maintainer
fastrag 3.1.2
An Efficient Retrieval Augmentation and Generation Framework for Intel Hardware.
6 versions - Latest release: 12 months ago - 90 downloads last month - 945 stars on GitHub - 1 maintainer
pvoctopus 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 7 months ago - 1 dependent package - 1 dependent repositories - 68 downloads last month - 37 stars on GitHub - 1 maintainer
pvoctopusdemo 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.
15 versions - Latest release: 7 months ago - 1 dependent repositories - 64 downloads last month - 37 stars on GitHub - 1 maintainer
wiki-passage-retriever 0.1.4
A small tool for retrieving relevant passages to a question from Wikipedia
8 versions - Latest release: almost 5 years ago - 1 dependent repositories - 19 downloads last month - 2 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
ranx 0.3.21
ranx: A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion
47 versions - Latest release: 3 months ago - 4 dependent packages - 7 dependent repositories - 33.8 thousand downloads last month - 594 stars on GitHub - 1 maintainer
asoen-ocr 1.0.0
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
1 version - Latest release: over 2 years ago - 19 downloads last month - 26,519 stars on GitHub - 1 maintainer
patzilla 0.169.3
PatZilla is a modular patent information research platform and data integration toolkit. It featu...
48 versions - Latest release: about 6 years ago - 1 dependent repositories - 175 downloads last month - 108 stars on GitHub - 1 maintainer
pyranker 0.1.3
A Python based package consisiting of BM25 and Vector Space Rankers for Information Retri...
3 versions - Latest release: over 2 years ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
qnabot 0.0.6
Create a question answering over docs bot with one line of code.
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 20 downloads last month - 350 stars on GitHub - 1 maintainer
fabricator-ai 0.2.0
Conveniently generating datasets with large language models.
3 versions - Latest release: about 2 years ago - 26 downloads last month - 20,653 stars on GitHub - 1 maintainer
cogexpke 1.8.1
Python Keyphrase Extraction module
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 14 downloads last month - 1,582 stars on GitHub - 1 maintainer
repro-eval 0.4.0
A tool to quantify the replicability and reproducibility of system-oriented IR experiments.
8 versions - Latest release: over 3 years ago - 2 dependent repositories - 34 downloads last month - 11 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
pke-tool 1.8.1
Python Keyphrase Extraction module
1 version - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 323 downloads last month - 1,582 stars on GitHub - 1 maintainer
horusner 0.1.5
HORUS Framework
3 versions - Latest release: over 8 years ago - 1 dependent repositories - 6 downloads last month - 48 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
pyserini 1.3.0
A Python toolkit for reproducible information retrieval research with sparse and dense representa...
49 versions - Latest release: 11 days ago - 9 dependent packages - 71 dependent repositories - 23.8 thousand downloads last month - 1,946 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
gpl 0.1.4
GPL is an unsupervised domain adaptation method for training dense retrievers. It is based on que...
14 versions - Latest release: about 3 years ago - 2 dependent repositories - 138 downloads last month - 337 stars on GitHub - 1 maintainer
judie 0.0.2
Judge and evaluate your chunk quality with Judie, the Owl! Quick, easy, and effective!
2 versions - Latest release: 5 months ago - 12 downloads last month - 1 maintainer
bm25-fusion 0.1.4
An ultra-fast BM25 retriever with support for multiple variants, metadata filtering, and stopword...
8 versions - Latest release: 8 months ago - 92 downloads last month - 0 stars on GitHub - 1 maintainer
pyndri 0.4
pyndri is a Python interface to the Indri search engine
3 versions - Latest release: over 7 years ago - 3 dependent repositories - 21 downloads last month - 89 stars on GitHub - 1 maintainer
mseep-txtai 9.1.1
All-in-one open-source AI framework for semantic search, LLM orchestration and language model wor...
1 version - Latest release: 2 months ago - 15 downloads last month - 11,716 stars on GitHub - 1 maintainer
libraft-cu13 25.10.0
RAFT: Reusable Algorithms Functions and other Tools (C++)
2 versions - Latest release: about 1 month ago - 418 downloads last month - 948 stars on GitHub - 1 maintainer
pylibraft-cu11 25.6.0
RAFT: Reusable Algorithms Functions and other Tools
20 versions - Latest release: 5 months ago - 4 dependent packages - 1 dependent repositories - 1.33 thousand downloads last month - 948 stars on GitHub - 2 maintainers
libraft-cu12 25.10.0
RAFT: Reusable Algorithms Functions and other Tools (C++)
5 versions - Latest release: about 1 month ago - 55.2 thousand downloads last month - 948 stars on GitHub - 1 maintainer
raft-dask-cu11 25.6.0
Reusable Accelerated Functions & Tools Dask Infrastructure
20 versions - Latest release: 5 months ago - 2 dependent packages - 1 dependent repositories - 1.08 thousand downloads last month - 948 stars on GitHub - 2 maintainers
libraft-cu11 25.6.0
RAFT: Reusable Algorithms Functions and other Tools (C++)
3 versions - Latest release: 5 months ago - 986 downloads last month - 948 stars on GitHub - 1 maintainer
pylibraft-cu13 25.10.0
RAFT: Reusable Algorithms Functions and other Tools
2 versions - Latest release: about 1 month ago - 575 downloads last month - 948 stars on GitHub - 1 maintainer
raft-dask-cu13 25.10.0
Reusable Accelerated Functions & Tools Dask Infrastructure
2 versions - Latest release: about 1 month ago - 528 downloads last month - 948 stars on GitHub - 1 maintainer
pylibraft-cu12 25.10.0
RAFT: Reusable Algorithms Functions and other Tools
17 versions - Latest release: about 1 month ago - 4 dependent packages - 1 dependent repositories - 55.7 thousand downloads last month - 602 stars on GitHub - 1 maintainer
raft-dask-cu12 25.10.0
Reusable Accelerated Functions & Tools Dask Infrastructure
17 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 52 thousand downloads last month - 602 stars on GitHub - 1 maintainer
goldenretriever-core 1.0.0
Dense Retriever
7 versions - Latest release: over 1 year ago - 19 downloads last month - 9 stars on GitHub - 1 maintainer
sycamore-ai 0.1.33
Sycamore is an LLM-powered semantic data preparation system for building search applications.
33 versions - Latest release: 4 months ago - 1.67 thousand downloads last month - 572 stars on GitHub - 2 maintainers
atarashi 0.0.11
An intelligent license scanner.
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 48 downloads last month - 31 stars on GitHub - 4 maintainers
fast-plaid 1.2.0
Fast Plaid.
29 versions - Latest release: 2 months ago - 11.4 thousand downloads last month - 162 stars on GitHub - 1 maintainer
ocrpy 0.3.10
unified interface to google vision, aws textract, azure & tesseract OCR tools.
12 versions - Latest release: about 3 years ago - 1 dependent repositories - 103 downloads last month - 216 stars on GitHub - 1 maintainer
vtext 0.2.0
Natural Language Processing in Rust with Python bidings
4 versions - Latest release: over 5 years ago - 4 dependent repositories - 48 downloads last month - 153 stars on GitHub - 1 maintainer
sinapsis-ocr 0.1.9
Implements Sinapsis templates to perform optical character recognition on images
9 versions - Latest release: 2 months ago - 120 downloads last month - 20 stars on GitHub - 1 maintainer
sinapsis-doctr 0.1.7
Perform optical character recognition using the DocTR library
7 versions - Latest release: 3 months ago - 109 downloads last month - 20 stars on GitHub - 1 maintainer
sinapsis-easyocr 0.1.9
Perform optical character recognition using the EasyOCR library
9 versions - Latest release: 2 months ago - 125 downloads last month - 20 stars on GitHub - 1 maintainer
embed-anything-gpu 0.6.6
Embed anything at lightning speed
21 versions - Latest release: 15 days ago - 232 downloads last month - 753 stars on GitHub - 1 maintainer
embed-anything 0.6.6
Embed anything at lightning speed
47 versions - Latest release: 15 days ago - 562 downloads last month - 753 stars on GitHub - 1 maintainer
stealerlib 0.0.1
Python Information Stealer Library (for Windows)
1 version - Latest release: over 2 years ago - 16 downloads last month - 7 stars on GitHub - 1 maintainer
archive-query-log 1.2.26
Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives.
48 versions - Latest release: almost 2 years ago - 750 downloads last month - 31 stars on GitHub - 4 maintainers
pylate 1.3.4
A library for training and retrieval with ColBERT.
15 versions - Latest release: 27 days ago - 12.2 thousand downloads last month - 609 stars on GitHub - 1 maintainer
tirex-tracker 0.2.16
Automatic resource and metadata tracking for information retrieval experiments.
17 versions - Latest release: about 1 month ago - 332 downloads last month - 8 stars on GitHub - 2 maintainers
stark-qa 0.1.3
Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases
4 versions - Latest release: about 1 year ago - 161 downloads last month - 319 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
mteb 2.1.1
Massive Text Embedding Benchmark
589 versions - Latest release: 16 days ago - 6 dependent packages - 25 dependent repositories - 407 thousand downloads last month - 1,441 stars on GitHub - 4 maintainers
skifts 0.1.0
Search for the most relevant documents containing words from a query
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
docling-haystack 0.1.1
Docling Haystack converter
1 version - Latest release: 11 months ago - 1.96 thousand downloads last month - 20,653 stars on GitHub - 1 maintainer
gritlm 1.0.2
GritLM
9 versions - Latest release: over 1 year ago - 7.02 thousand downloads last month - 668 stars on GitHub - 1 maintainer
llmemory 0.4.0
High-performance document memory system with vector search capabilities for Python applications
1 version - Latest release: 17 days ago
hinteval 0.0.6
A Python framework designed for both generating and evaluating hints.
6 versions - Latest release: 2 months ago - 53 downloads last month - 33 stars on GitHub - 1 maintainer
colpali-engine 0.3.12
The code used to train and run inference with the ColPali architecture.
21 versions - Latest release: 4 months ago - 61.2 thousand downloads last month - 2,185 stars on GitHub - 2 maintainers
freediscovery 1.3.1
Open source software for E-Discovery and Information Retrieval
6 versions - Latest release: over 7 years ago - 1 dependent repositories - 45 downloads last month - 75 stars on GitHub - 1 maintainer