Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "information-retrieval" keyword

archive-query-log 1.2.26
Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives.
29 versions - Latest release: 6 months ago - 230 downloads last month - 20 stars on GitHub - 4 maintainers
Top 3.3% on pypi.org
pytrec-eval 0.5
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
3 versions - Latest release: over 3 years ago - 9 dependent packages - 36 dependent repositories - 104 thousand downloads last month - 249 stars on GitHub - 1 maintainer
raft-dask-cu11 24.4.0
Reusable Accelerated Functions & Tools Dask Infrastructure
13 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 105 downloads last month - 602 stars on GitHub - 3 maintainers
Top 9.6% on pypi.org
tevatron 0.1.0
Tevatron: A toolkit for learning and running deep dense retrieval models.
1 version - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 303 downloads last month - 373 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
catalyst 22.2.1 πŸ’°
Catalyst. Accelerated deep learning R&D with PyTorch.
104 versions - Latest release: about 2 years ago - 13 dependent packages - 179 dependent repositories - 12.3 thousand downloads last month - 3,233 stars on GitHub - 1 maintainer
catalyst-pdm 22.4.1 πŸ’°
Catalyst fork compatible with PDM
1 version - Latest release: about 1 year ago - 37 downloads last month - 3,233 stars on GitHub - 1 maintainer
pylibraft-cu11 24.4.0
RAFT: Reusable Algorithms Functions and other Tools
13 versions - Latest release: about 1 month ago - 4 dependent packages - 1 dependent repositories - 216 downloads last month - 602 stars on GitHub - 3 maintainers
pylibraft-cu12 24.4.0
RAFT: Reusable Algorithms Functions and other Tools
8 versions - Latest release: about 1 month ago - 4 dependent packages - 1 dependent repositories - 251 downloads last month - 602 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
unstructured 0.13.7
A library that prepares raw documents for downstream ML tasks.
132 versions - Latest release: 7 days ago - 113 dependent packages - 3,374 dependent repositories - 1.09 million downloads last month - 4,064 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
txtai 7.1.0
All-in-one open-source embeddings database for semantic search, LLM orchestration and language mo...
38 versions - Latest release: 26 days ago - 13 dependent packages - 14 dependent repositories - 7.85 thousand downloads last month - 6,914 stars on GitHub - 1 maintainer
fabricator-ai 0.2.0
Conveniently generating datasets with large language models.
3 versions - Latest release: 7 months ago - 36 downloads last month - 13,508 stars on GitHub - 1 maintainer
langroid 0.1.243
Harness LLMs with Multi-Agent Programming
218 versions - Latest release: about 13 hours ago - 1 dependent repositories - 7.11 thousand downloads last month - 1,497 stars on GitHub - 1 maintainer
stringzilla 3.8.3
SIMD-accelerated string search, sort, hashes, fingerprints, & edit distances
37 versions - Latest release: 18 days ago - 1 dependent package - 23.8 thousand downloads last month - 1,749 stars on GitHub - 1 maintainer
simsimd 4.3.1
Fastest SIMD-Accelerated Vector Similarity Functions for x86 and Arm
43 versions - Latest release: about 1 month ago - 5 dependent packages - 1 dependent repositories - 45.8 thousand downloads last month - 727 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
ranx 0.3.19
ranx: A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion
45 versions - Latest release: 6 months ago - 4 dependent packages - 7 dependent repositories - 13.2 thousand downloads last month - 348 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
rank-bm25 0.2.2
Various BM25 algorithms for document ranking
4 versions - Latest release: about 2 years ago - 69 dependent packages - 345 dependent repositories - 304 thousand downloads last month - 755 stars on GitHub - 1 maintainer
raft-dask-cu12 24.4.0
Reusable Accelerated Functions & Tools Dask Infrastructure
8 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 221 downloads last month - 602 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
pytrec-eval-terrier 0.5.6
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
6 versions - Latest release: 7 months ago - 1 dependent package - 11 dependent repositories - 31.6 thousand downloads last month - 248 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
pyserini 0.35.0
A Python toolkit for reproducible information retrieval research with sparse and dense representa...
36 versions - Latest release: about 1 month ago - 9 dependent packages - 71 dependent repositories - 9.17 thousand downloads last month - 1,254 stars on GitHub - 1 maintainer
pvoctopus 2.0.0
Octopus Speech-to-Index engine.
13 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 126 downloads last month - 34 stars on GitHub - 1 maintainer
persianstemmer 1.0.0
Persian Stemmer for Python
1 version - Latest release: about 7 years ago - 2 dependent repositories - 23 downloads last month - 50 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
mteb 2.0.0
Massive Text Embedding Benchmark
141 versions - Latest release: about 2 months ago - 6 dependent packages - 25 dependent repositories - 197 thousand downloads last month - 1,441 stars on GitHub - 4 maintainers
Top 6.2% on pypi.org
lexicalrichness 0.5.1
A small module to compute textual lexical richness (aka lexical diversity).
15 versions - Latest release: 9 months ago - 5 dependent packages - 7 dependent repositories - 987 downloads last month - 75 stars on GitHub - 1 maintainer
cuvs-cu12 24.4.0
cuVS: Vector Search on the GPU
1 version - Latest release: about 1 month ago - 37 downloads last month - 72 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
ir-datasets 0.5.7
provides a common interface to many IR ad-hoc ranking benchmarks, training datasets, etc.
18 versions - Latest release: 16 days ago - 10 dependent packages - 21 dependent repositories - 31.5 thousand downloads last month - 296 stars on GitHub - 1 maintainer
promptcache 0.0.1a1
A tool for caching prompts and compleation based on embedding
1 version - Latest release: 4 months ago - 12 downloads last month - 1,714 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
instructorembedding 1.0.1
Text embedding tool
2 versions - Latest release: 12 months ago - 14 dependent packages - 34 dependent repositories - 116 thousand downloads last month - 1,714 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
haystack-ai 2.1.0
LLM framework to build customizable, production-ready LLM applications. Connect components (model...
190 versions - Latest release: 9 days ago - 51 dependent packages - 2 dependent repositories - 92.1 thousand downloads last month - 11,976 stars on GitHub - 1 maintainer
h2ogpte 1.4.13
Client library for Enterprise h2oGPTe
49 versions - Latest release: 13 days ago - 1 dependent package - 1 dependent repositories - 18.4 thousand downloads last month - 1 maintainer
Top 0.2% on pypi.org
gensim 4.3.2 πŸ’°
Python framework for fast Vector Space Modelling
90 versions - Latest release: 9 months ago - 426 dependent packages - 13,895 dependent repositories - 5.5 million downloads last month - 15,255 stars on GitHub - 2 maintainers
cuvs-cu11 24.4.0
cuVS: Vector Search on the GPU
1 version - Latest release: about 1 month ago - 21 downloads last month - 72 stars on GitHub - 1 maintainer
freediscovery-stabilizer 1.5.dev0
Open source software for E-Discovery and Information Retrieval
2 versions - Latest release: 29 days ago - 1 dependent package - 206 downloads last month - 73 stars on GitHub - 1 maintainer
c-mteb 1.1.1
Chinese Massive Text Embedding Benchmark
3 versions - Latest release: 28 days ago - 192 downloads last month - 5,126 stars on GitHub - 1 maintainer
lm-cocktail 0.0.4
LM_Cocktail
5 versions - Latest release: 4 months ago - 128 downloads last month - 5,126 stars on GitHub - 1 maintainer
Top 7.0% on pypi.org
flagembedding 1.2.9
FlagEmbedding
24 versions - Latest release: 28 days ago - 4 dependent packages - 1 dependent repositories - 74.1 thousand downloads last month - 5,126 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
farm-haystack 1.25.5
LLM framework to build customizable, production-ready LLM applications. Connect components (model...
108 versions - Latest release: 21 days ago - 21 dependent packages - 237 dependent repositories - 105 thousand downloads last month - 13,508 stars on GitHub - 4 maintainers
Top 0.7% on pypi.org
easyocr 1.7.1
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
32 versions - Latest release: 8 months ago - 90 dependent packages - 671 dependent repositories - 225 thousand downloads last month - 22,043 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
beir 2.0.0
A Heterogeneous Benchmark for Information Retrieval
29 versions - Latest release: 10 months ago - 9 dependent packages - 30 dependent repositories - 168 thousand downloads last month - 1,370 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
asone-ocr 1.6.2
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
2 versions - Latest release: about 1 year ago - 2 dependent packages - 1 dependent repositories - 231 downloads last month - 20,452 stars on GitHub - 1 maintainer
agent-search 0.1.0
AgentSearch: An open source framework and dataset for webscale local search.
9 versions - Latest release: 4 months ago - 1 dependent package - 1.27 thousand downloads last month - 360 stars on GitHub - 1 maintainer
sycamore-ai 0.1.16
Sycamore is an LLM-powered semantic data preparation system for building search applications.
16 versions - Latest release: 8 days ago - 530 downloads last month - 169 stars on GitHub - 1 maintainer
gritlm 1.0.0
GritLM
7 versions - Latest release: 3 months ago - 4.89 thousand downloads last month - 404 stars on GitHub - 1 maintainer
showsys 1.0.1
A simple package to get system spec information
2 versions - Latest release: 3 months ago - 26 downloads last month - 2 stars on GitHub - 1 maintainer
marqo1 2.1.0
Tensor search for humans
1 version - Latest release: 4 months ago - 16 downloads last month - 4,085 stars on GitHub - 1 maintainer
elasticsearch-ir-evaluator 0.4.4
A Python package for easily calculating information retrieval (IR) accuracy metrics using Elastic...
9 versions - Latest release: about 1 month ago - 371 downloads last month - 0 stars on GitHub - 1 maintainer
continuous-eval 0.3.7
Open-Source Evaluation for GenAI Application Pipelines.
19 versions - Latest release: 20 days ago - 1.24 thousand downloads last month - 311 stars on GitHub - 1 maintainer
brevia 0.0.27
Extensible API and framework to build your Retrieval Augmented Generation (RAG) and Information E...
27 versions - Latest release: 5 days ago - 1 thousand downloads last month - 21 stars on GitHub - 1 maintainer
llmware 0.2.13
An enterprise-grade LLM-based development framework, tools, and fine-tuned models
32 versions - Latest release: 4 days ago - 2.35 thousand downloads last month - 1,647 stars on GitHub - 1 maintainer
tiledb-vector-search 0.4.0
TileDB Vector Search Python client
18 versions - Latest release: 1 day ago - 3.05 thousand downloads last month - 44 stars on GitHub - 4 maintainers
easyocr-itgn 1.2.3
Modified Easyorc By IntoThatGoodNight
3 versions - Latest release: 10 months ago - 44 downloads last month - 20,429 stars on GitHub - 1 maintainer
zensols.spanmatch 0.0.1
An API to match spans of semantically similar text across documents.
1 version - Latest release: 11 months ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
nocv2easyocr 0.1.1
This is a fork of the EasyOCR library without the opencv requirement
2 versions - Latest release: about 1 year ago - 91 downloads last month - 20,452 stars on GitHub - 1 maintainer
megabots 0.0.11
πŸ€– Megabots provides State-of-the-art, production ready bots made mega-easy, so you don't have to ...
5 versions - Latest release: about 1 year ago - 1 dependent repositories - 51 downloads last month - 335 stars on GitHub - 1 maintainer
pyranker 0.1.3
A Python based package consisiting of BM25 and Vector Space Rankers for Information Retri...
3 versions - Latest release: about 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
qnabot 0.0.6
Create a question answering over docs bot with one line of code.
6 versions - Latest release: about 1 year ago - 1 dependent repositories - 45 downloads last month - 335 stars on GitHub - 1 maintainer
pydoxtools 0.8.0
This library contains a set of tools in order to extract and synthesize structured information fr...
12 versions - Latest release: 4 months ago - 73 downloads last month - 55 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
axcelocr 1.6.3
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
1 version - Latest release: about 1 year ago - 29 downloads last month - 20,441 stars on GitHub - 1 maintainer
asoen-ocr 1.0.0
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
1 version - Latest release: about 1 year ago - 29 downloads last month - 22,008 stars on GitHub - 1 maintainer
ccpke 2.0.0
Python Keyphrase Extraction module
1 version - Latest release: over 1 year ago - 1 dependent package - 15 downloads last month - 1,522 stars on GitHub - 1 maintainer
pyterrier-sentence-transformers 0.2.2
Create an pyterrier index using any sentence-transformers model
1 version - Latest release: over 1 year ago - 1 dependent repositories - 14 downloads last month - 5 stars on GitHub - 1 maintainer
website-report 1.0.1
a simple library that generates website reports
2 versions - Latest release: over 1 year ago - 16 downloads last month - 4 stars on GitHub - 1 maintainer
autograph-obsidian 0.3
Automatic knowledge graph generation.
3 versions - Latest release: 8 months ago - 1 dependent repositories - 12 downloads last month - 18 stars on GitHub - 1 maintainer
forte.health 0.1.0
NLP pipeline framework for biomedical and clinical domains
1 version - Latest release: almost 2 years ago - 11 downloads last month - 10 stars on GitHub - 3 maintainers
ocrpy 0.3.10
unified interface to google vision, aws textract, azure & tesseract OCR tools.
12 versions - Latest release: almost 2 years ago - 1 dependent repositories - 264 downloads last month - 216 stars on GitHub - 1 maintainer
weaviate-cli 2.2.0
Comand line interface to interact with weaviate
21 versions - Latest release: about 1 year ago - 1 dependent repositories - 1.43 thousand downloads last month - 9,582 stars on GitHub - 2 maintainers
wayward 0.3.2
Wayward is a Python package that helps to identify characteristic terms from single documents or ...
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 30 downloads last month - 9 stars on GitHub - 1 maintainer
vtext 0.2.0
Natural Language Processing in Rust with Python bidings
4 versions - Latest release: almost 4 years ago - 4 dependent repositories - 191 downloads last month - 147 stars on GitHub - 1 maintainer
useb 0.0.1
Heterogenous, Task- and Domain-Specific Benchmark for Unsupervised Sentence Embeddings used in th...
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 12 downloads last month - 31 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
text2text 1.4.4
Text2Text: Crosslingual NLP/G toolkit
142 versions - Latest release: 3 months ago - 4 dependent repositories - 984 downloads last month - 272 stars on GitHub - 1 maintainer
skifts 0.1.0
Search for the most relevant documents containing words from a query
1 version - Latest release: over 2 years ago - 1 dependent repositories - 7 downloads last month - 0 stars on GitHub - 1 maintainer
shpinfo 0.2.0 πŸ’°
A command line program to print meta information about the given shapefile.
3 versions - Latest release: almost 9 years ago - 2 dependent repositories - 24 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
rocketqa 1.1.0
RocketQA development tools and examples, building on top of PaddlePaddle2.0.
2 versions - Latest release: about 2 years ago - 1 dependent package - 5 dependent repositories - 163 downloads last month - 736 stars on GitHub - 1 maintainer
rmdl 1.0.8
RMDL: Random Multimodel Deep Learning for Classification
6 versions - Latest release: almost 4 years ago - 1 dependent repositories - 59 downloads last month - 412 stars on GitHub - 1 maintainer
repro-eval 0.4.0
A tool to quantify the replicability and reproducibility of system-oriented IR experiments.
8 versions - Latest release: about 2 years ago - 2 dependent repositories - 77 downloads last month - 10 stars on GitHub - 1 maintainer
rank-eval 0.1.3
rank_eval: A Blazing Fast Python Library for Ranking Evaluation and Comparison
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 236 downloads last month - 352 stars on GitHub - 1 maintainer
pytrec-eval-git 0.5
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 10 downloads last month - 235 stars on GitHub - 1 maintainer
pyplexity 0.2.12
Perplexity filter for documents and bulk HTML and WARC boilerplate removal.
18 versions - Latest release: 11 months ago - 1 dependent repositories - 200 downloads last month - 38 stars on GitHub - 2 maintainers
pynutshell 1.0.2
An unsupervised text summarization and information retrieval library under the hood using natural...
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 30 downloads last month - 13 stars on GitHub - 1 maintainer
pyndri 0.4
pyndri is a Python interface to the Indri search engine
3 versions - Latest release: about 6 years ago - 3 dependent repositories - 23 downloads last month - 88 stars on GitHub - 1 maintainer
pvoctopusdemo 2.0.0
Octopus Speech-to-Index engine demo.
13 versions - Latest release: 6 months ago - 1 dependent repositories - 136 downloads last month - 34 stars on GitHub - 1 maintainer
posscore 0.0.1
This is a package for the POSSCORE metric.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 6 downloads last month - 4 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
pke-tool 1.8.1
Python Keyphrase Extraction module
1 version - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 454 downloads last month - 1,522 stars on GitHub - 1 maintainer
pkelambda 1.8.1
Python Keyphrase Extraction module
1 version - Latest release: over 2 years ago - 1 dependent repositories - 22 downloads last month - 1,522 stars on GitHub - 1 maintainer
patzilla 0.169.3
PatZilla is a modular patent information research platform and data integration toolkit. It featu...
48 versions - Latest release: over 4 years ago - 1 dependent repositories - 69 downloads last month - 93 stars on GitHub - 1 maintainer
neuralqa 0.0.31a0 πŸ’°
NeuralQA: Question Answering on Large Datasets
12 versions - Latest release: over 3 years ago - 2 dependent repositories - 25 downloads last month - 230 stars on GitHub - 1 maintainer
netizenship 0.2.3
Tool to check the username with popular websites for membership
13 versions - Latest release: over 4 years ago - 1 dependent repositories - 85 downloads last month - 45 stars on GitHub - 1 maintainer
nalcos 0.1.1 πŸ’°
Search Git commits in natural language
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 21 downloads last month - 53 stars on GitHub - 1 maintainer
myeasyocr 1.2.3
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution
1 version - Latest release: over 3 years ago - 1 dependent repositories - 44 downloads last month - 20,452 stars on GitHub - 1 maintainer
multiplex-plot 0.5.0
Multiplex: visualizations that tell storiesβ€”A Python library to create and annotate beautiful net...
14 versions - Latest release: over 3 years ago - 1 dependent repositories - 108 downloads last month - 104 stars on GitHub - 1 maintainer
metriks 0.0.2
metriks is a Python package of commonly used metrics for evaluating information retrieval models.
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 17 downloads last month - 25 stars on GitHub - 2 maintainers
kex 2.0.6
Light/easy keyword extraction from documents.
5 versions - Latest release: 11 months ago - 1 dependent repositories - 40 downloads last month - 53 stars on GitHub - 1 maintainer
irtm 0.0.4
A toolbox for Information Retrieval & Text Mining.
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 22 downloads last month - 1 stars on GitHub - 1 maintainer
ir-kit 1.0.1
Utilities for information retrieval in python
5 versions - Latest release: almost 7 years ago - 1 dependent repositories - 8 downloads last month - 2 stars on GitHub - 1 maintainer
himpy 0.0.1
Histogram model
1 version - Latest release: over 2 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
hdltex 1.0.5
HDLTex: Hierarchical Deep Learning for Text Classification
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 16 downloads last month - 255 stars on GitHub - 1 maintainer
greparl 0.3
Grep the Greek Parliament
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 6 downloads last month - 2 stars on GitHub - 1 maintainer
gensim-plural 3.8.1 πŸ’°
Python framework for fast Vector Space Modelling
1 version - Latest release: over 4 years ago - 1 dependent repositories - 62 downloads last month - 15,198 stars on GitHub - 1 maintainer
gensim-bz2-nsml 3.8.0 πŸ’°
Python framework for fast Vector Space Modelling
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 63 downloads last month - 15,255 stars on GitHub - 1 maintainer
geesedb 0.0.2
Graph Engine for Exploration and Search over Evolving DataBases
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 18 downloads last month - 33 stars on GitHub - 1 maintainer
fornax 0.1.1
Approximate fuzzy subgraph matching in polynomial time
3 versions - Latest release: almost 5 years ago - 1 dependent repositories - 10 downloads last month - 21 stars on GitHub - 1 maintainer
Related Keywords