pypi.org "information-retrieval" keyword
View the packages on the pypi.org package registry that are tagged with the "information-retrieval" keyword.
irtm 0.0.4
A toolbox for Information Retrieval & Text Mining.4 versions - Latest release: about 4 years ago - 1 dependent repositories - 18 downloads last month - 1 stars on GitHub - 1 maintainer
ragl 0.10.1
ragl: retrieval-augmented generation (RAG) for text.23 versions - Latest release: 2 months ago - 83 downloads last month - 1 maintainer
rag-engine 0.1.2
A Retrieval-Augmented Generation (RAG) Engine for managing embeddings and similarity search2 versions - Latest release: over 1 year ago - 67 downloads last month - 3 stars on GitHub - 1 maintainer
h2ogpte 1.6.43
Client library for Enterprise h2oGPTe176 versions - Latest release: 14 days ago - 1 dependent package - 1 dependent repositories - 4.79 thousand downloads last month - 1 maintainer
dynamic-prompting 0.2.6
Dynamic Few-Shot Prompting is a Python package that dynamically selects N samples that are contex...4 versions - Latest release: over 1 year ago - 17 downloads last month - 2 stars on GitHub - 1 maintainer
viincci-rag 2.0.0
Universal multi-domain research system with RAG (Retrieval-Augmented Generation) capabilities1 version - Latest release: about 11 hours ago
ir-metrics 0.1.6
The most common information retrieval (IR) metrics15 versions - Latest release: almost 5 years ago - 5 dependent repositories - 4 thousand downloads last month - 5 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
33 versions - Latest release: about 1 year ago - 90 dependent packages - 671 dependent repositories - 1.45 million downloads last month - 28,302 stars on GitHub - 1 maintainer
easyocr 1.7.2
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution33 versions - Latest release: about 1 year ago - 90 dependent packages - 671 dependent repositories - 1.45 million downloads last month - 28,302 stars on GitHub - 1 maintainer
langroid-examples 0.1.25 💰
Add your description here19 versions - Latest release: 11 months ago - 47 downloads last month - 3,293 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
288 versions - Latest release: 23 days ago - 51 dependent packages - 2 dependent repositories - 388 thousand downloads last month - 16,679 stars on GitHub - 1 maintainer
haystack-ai 2.19.0
LLM framework to build customizable, production-ready LLM applications. Connect components (model...288 versions - Latest release: 23 days ago - 51 dependent packages - 2 dependent repositories - 388 thousand downloads last month - 16,679 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
4 versions - Latest release: over 3 years ago - 69 dependent packages - 345 dependent repositories - 2.54 million downloads last month - 1,260 stars on GitHub - 1 maintainer
rank-bm25 0.2.2
Various BM25 algorithms for document ranking4 versions - Latest release: over 3 years ago - 69 dependent packages - 345 dependent repositories - 2.54 million downloads last month - 1,260 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
192 versions - Latest release: 10 months ago - 4 dependent repositories - 454 downloads last month - 302 stars on GitHub - 1 maintainer
text2text 1.9.5
Text2Text Language Modeling Toolkit192 versions - Latest release: 10 months ago - 4 dependent repositories - 454 downloads last month - 302 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
104 versions - Latest release: over 3 years ago - 13 dependent packages - 179 dependent repositories - 54.5 thousand downloads last month - 3,235 stars on GitHub - 1 maintainer
catalyst 22.2.1 💰
Catalyst. Accelerated deep learning R&D with PyTorch.104 versions - Latest release: over 3 years ago - 13 dependent packages - 179 dependent repositories - 54.5 thousand downloads last month - 3,235 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
92 versions - Latest release: 26 days ago - 426 dependent packages - 13,895 dependent repositories - 5.57 million downloads last month - 15,255 stars on GitHub - 2 maintainers
gensim 4.4.0 💰
Python framework for fast Vector Space Modelling92 versions - Latest release: 26 days ago - 426 dependent packages - 13,895 dependent repositories - 5.57 million downloads last month - 15,255 stars on GitHub - 2 maintainers
web-research-agent 1.2.1
An AI agent using ReAct methodology for autonomous web research tasks22 versions - Latest release: about 1 month ago - 96 downloads last month - 0 stars on GitHub - 1 maintainer
open-retrievals 0.0.14
Text Embeddings for Retrieval and RAG based on transformers16 versions - Latest release: 10 months ago - 46 downloads last month - 69 stars on GitHub - 1 maintainer
toolfront 0.4.1
Build AI applications in Markdown.28 versions - Latest release: 2 days ago - 599 downloads last month - 785 stars on GitHub - 2 maintainers
vke 1.8.1
Python Keyphrase Extraction module1 version - Latest release: over 4 years ago - 1 dependent repositories - 15 downloads last month - 1,583 stars on GitHub - 1 maintainer
pkelambda 1.8.1
Python Keyphrase Extraction module1 version - Latest release: almost 4 years ago - 1 dependent repositories - 12 downloads last month - 1,582 stars on GitHub - 1 maintainer
gensim-bz2-nsml 3.8.0 💰
Python framework for fast Vector Space Modelling1 version - Latest release: over 6 years ago - 1 dependent repositories - 45 downloads last month - 15,949 stars on GitHub - 1 maintainer
megabots 0.0.11
🤖 Megabots provides State-of-the-art, production ready bots made mega-easy, so you don't have to ...5 versions - Latest release: over 2 years ago - 1 dependent repositories - 14 downloads last month - 341 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 384 downloads last month - 959 stars on GitHub - 1 maintainer
allrank 1.4.3
allRank is a framework for training learning-to-rank neural models8 versions - Latest release: over 4 years ago - 1 dependent repositories - 384 downloads last month - 959 stars on GitHub - 1 maintainer
ir-kit 1.0.1
Utilities for information retrieval in python5 versions - Latest release: over 8 years ago - 1 dependent repositories - 392 downloads last month - 2 stars on GitHub - 1 maintainer
forte.health 0.1.0
NLP pipeline framework for biomedical and clinical domains1 version - Latest release: over 3 years ago - 10 downloads last month - 13 stars on GitHub - 3 maintainers
continuous-eval 0.3.14
Open-Source Evaluation for GenAI Applications.28 versions - Latest release: 10 months ago - 4.57 thousand downloads last month - 505 stars on GitHub - 1 maintainer
langroid 0.59.18 💰
Harness LLMs with Multi-Agent Programming503 versions - Latest release: 4 days ago - 1 dependent repositories - 103 thousand downloads last month - 2,245 stars on GitHub - 1 maintainer
zensols.spanmatch 0.0.1
An API to match spans of semantically similar text across documents.1 version - Latest release: over 2 years ago - 8 downloads last month - 1 stars on GitHub - 1 maintainer
billm 0.1.6
Tool for converting LLMs from uni-directional to bi-directional for tasks like classification and...7 versions - Latest release: over 1 year ago - 129 downloads last month - 441 stars on GitHub - 1 maintainer
yabm25 0.1.1
Fast BM25 search engine for Python with RAG support2 versions - Latest release: 9 months ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
similaripy 0.3.1
High-performance KNN similarity functions in Python, optimized for sparse matrices20 versions - Latest release: 5 days ago - 3 dependent repositories - 2.09 thousand downloads last month - 57 stars on GitHub - 1 maintainer
neuralqa 0.0.31a0 💰
NeuralQA: Question Answering on Large Datasets12 versions - Latest release: about 5 years ago - 2 dependent repositories - 22 downloads last month - 233 stars on GitHub - 1 maintainer
freediscovery-stabilizer 1.5.dev0
Open source software for E-Discovery and Information Retrieval2 versions - Latest release: over 1 year ago - 1 dependent package - 32 downloads last month - 75 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
194 versions - Latest release: 6 days ago - 113 dependent packages - 3,374 dependent repositories - 3.97 million downloads last month - 13,122 stars on GitHub - 1 maintainer
unstructured 0.18.18
A library that prepares raw documents for downstream ML tasks.194 versions - Latest release: 6 days ago - 113 dependent packages - 3,374 dependent repositories - 3.97 million downloads last month - 13,122 stars on GitHub - 1 maintainer
easy-elasticsearch 0.0.9
An easy-to-use Elasticsearch BM25 interface8 versions - Latest release: about 3 years ago - 3 dependent repositories - 1.66 thousand downloads last month - 31 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
1 version - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 291 downloads last month - 697 stars on GitHub - 1 maintainer
tevatron 0.1.0
Tevatron: A toolkit for learning and running deep dense retrieval models.1 version - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 291 downloads last month - 697 stars on GitHub - 1 maintainer
unstructured-cpu 0.15.1
A library that prepares raw documents for downstream ML tasks.13 versions - Latest release: over 1 year ago - 54 downloads last month - 12,775 stars on GitHub - 1 maintainer
retrivex 0.1.0
Retrieval Models Explainability Toolkit. Explain embedding similarity prediction.1 version - Latest release: about 1 month ago - 59 downloads last month - 1 stars on GitHub - 1 maintainer
ccpke 2.0.0
Python Keyphrase Extraction module1 version - Latest release: almost 3 years ago - 1 dependent package - 50 downloads last month - 1,582 stars on GitHub - 1 maintainer
rakun2 0.31
RaKUn 2.0; Better faster stronger lighter13 versions - Latest release: 7 months ago - 1 dependent repositories - 598 downloads last month - 68 stars on GitHub - 1 maintainer
promptcache 0.0.1a1
A tool for caching prompts and compleation based on embedding1 version - Latest release: almost 2 years ago - 6 downloads last month - 1,947 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
116 versions - Latest release: about 1 year ago - 21 dependent packages - 237 dependent repositories - 77.8 thousand downloads last month - 23,124 stars on GitHub - 1 maintainer
farm-haystack 1.26.4
LLM framework to build customizable, production-ready LLM applications. Connect components (model...116 versions - Latest release: about 1 year ago - 21 dependent packages - 237 dependent repositories - 77.8 thousand downloads last month - 23,124 stars on GitHub - 1 maintainer
nalcos 0.1.1 💰
Search Git commits in natural language2 versions - Latest release: about 4 years ago - 1 dependent repositories - 13 downloads last month - 56 stars on GitHub - 1 maintainer
retriv 0.2.3
retriv: A Python Search Engine for Humans.10 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 540 downloads last month - 231 stars on GitHub - 1 maintainer
pyplexity 0.2.12
Perplexity filter for documents and bulk HTML and WARC boilerplate removal.18 versions - Latest release: over 2 years ago - 1 dependent repositories - 44.9 thousand downloads last month - 38 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
54 versions - Latest release: 8 days ago - 13 dependent packages - 14 dependent repositories - 29.7 thousand downloads last month - 11,746 stars on GitHub - 1 maintainer
txtai 9.1.0
All-in-one open-source AI framework for semantic search, LLM orchestration and language model wor...54 versions - Latest release: 8 days ago - 13 dependent packages - 14 dependent repositories - 29.7 thousand downloads last month - 11,746 stars on GitHub - 1 maintainer
bikidata 0.3.6
Developer-friendly Queries over RDF triples31 versions - Latest release: 8 days ago - 540 downloads last month - 10 stars on GitHub - 1 maintainer
geesedb 0.0.2
Graph Engine for Exploration and Search over Evolving DataBases2 versions - Latest release: over 4 years ago - 1 dependent repositories - 17 downloads last month - 33 stars on GitHub - 1 maintainer
fastrag 3.1.2
An Efficient Retrieval Augmentation and Generation Framework for Intel Hardware.6 versions - Latest release: 12 months ago - 90 downloads last month - 945 stars on GitHub - 1 maintainer
pvoctopus 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.15 versions - Latest release: 7 months ago - 1 dependent package - 1 dependent repositories - 68 downloads last month - 37 stars on GitHub - 1 maintainer
pvoctopusdemo 2.0.2
⚠️ DEPRECATED: This package is no longer maintained.15 versions - Latest release: 7 months ago - 1 dependent repositories - 64 downloads last month - 37 stars on GitHub - 1 maintainer
wiki-passage-retriever 0.1.4
A small tool for retrieving relevant passages to a question from Wikipedia8 versions - Latest release: almost 5 years ago - 1 dependent repositories - 19 downloads last month - 2 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
47 versions - Latest release: 3 months ago - 4 dependent packages - 7 dependent repositories - 33.8 thousand downloads last month - 594 stars on GitHub - 1 maintainer
ranx 0.3.21
ranx: A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion47 versions - Latest release: 3 months ago - 4 dependent packages - 7 dependent repositories - 33.8 thousand downloads last month - 594 stars on GitHub - 1 maintainer
asoen-ocr 1.0.0
End-to-End Multi-Lingual Optical Character Recognition (OCR) Solution1 version - Latest release: over 2 years ago - 19 downloads last month - 26,519 stars on GitHub - 1 maintainer
patzilla 0.169.3
PatZilla is a modular patent information research platform and data integration toolkit. It featu...48 versions - Latest release: about 6 years ago - 1 dependent repositories - 175 downloads last month - 108 stars on GitHub - 1 maintainer
pyranker 0.1.3
A Python based package consisiting of BM25 and Vector Space Rankers for Information Retri...3 versions - Latest release: over 2 years ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
qnabot 0.0.6
Create a question answering over docs bot with one line of code.6 versions - Latest release: over 2 years ago - 1 dependent repositories - 20 downloads last month - 350 stars on GitHub - 1 maintainer
fabricator-ai 0.2.0
Conveniently generating datasets with large language models.3 versions - Latest release: about 2 years ago - 26 downloads last month - 20,653 stars on GitHub - 1 maintainer
cogexpke 1.8.1
Python Keyphrase Extraction module1 version - Latest release: almost 4 years ago - 1 dependent repositories - 14 downloads last month - 1,582 stars on GitHub - 1 maintainer
repro-eval 0.4.0
A tool to quantify the replicability and reproducibility of system-oriented IR experiments.8 versions - Latest release: over 3 years ago - 2 dependent repositories - 34 downloads last month - 11 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
1 version - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 323 downloads last month - 1,582 stars on GitHub - 1 maintainer
pke-tool 1.8.1
Python Keyphrase Extraction module1 version - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 323 downloads last month - 1,582 stars on GitHub - 1 maintainer
horusner 0.1.5
HORUS Framework3 versions - Latest release: over 8 years ago - 1 dependent repositories - 6 downloads last month - 48 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
49 versions - Latest release: 11 days ago - 9 dependent packages - 71 dependent repositories - 23.8 thousand downloads last month - 1,946 stars on GitHub - 1 maintainer
pyserini 1.3.0
A Python toolkit for reproducible information retrieval research with sparse and dense representa...49 versions - Latest release: 11 days ago - 9 dependent packages - 71 dependent repositories - 23.8 thousand downloads last month - 1,946 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
14 versions - Latest release: about 3 years ago - 2 dependent repositories - 138 downloads last month - 337 stars on GitHub - 1 maintainer
gpl 0.1.4
GPL is an unsupervised domain adaptation method for training dense retrievers. It is based on que...14 versions - Latest release: about 3 years ago - 2 dependent repositories - 138 downloads last month - 337 stars on GitHub - 1 maintainer
judie 0.0.2
Judge and evaluate your chunk quality with Judie, the Owl! Quick, easy, and effective!2 versions - Latest release: 5 months ago - 12 downloads last month - 1 maintainer
bm25-fusion 0.1.4
An ultra-fast BM25 retriever with support for multiple variants, metadata filtering, and stopword...8 versions - Latest release: 8 months ago - 92 downloads last month - 0 stars on GitHub - 1 maintainer
pyndri 0.4
pyndri is a Python interface to the Indri search engine3 versions - Latest release: over 7 years ago - 3 dependent repositories - 21 downloads last month - 89 stars on GitHub - 1 maintainer
mseep-txtai 9.1.1
All-in-one open-source AI framework for semantic search, LLM orchestration and language model wor...1 version - Latest release: 2 months ago - 15 downloads last month - 11,716 stars on GitHub - 1 maintainer
libraft-cu13 25.10.0
RAFT: Reusable Algorithms Functions and other Tools (C++)2 versions - Latest release: about 1 month ago - 418 downloads last month - 948 stars on GitHub - 1 maintainer
pylibraft-cu11 25.6.0
RAFT: Reusable Algorithms Functions and other Tools20 versions - Latest release: 5 months ago - 4 dependent packages - 1 dependent repositories - 1.33 thousand downloads last month - 948 stars on GitHub - 2 maintainers
libraft-cu12 25.10.0
RAFT: Reusable Algorithms Functions and other Tools (C++)5 versions - Latest release: about 1 month ago - 55.2 thousand downloads last month - 948 stars on GitHub - 1 maintainer
raft-dask-cu11 25.6.0
Reusable Accelerated Functions & Tools Dask Infrastructure20 versions - Latest release: 5 months ago - 2 dependent packages - 1 dependent repositories - 1.08 thousand downloads last month - 948 stars on GitHub - 2 maintainers
libraft-cu11 25.6.0
RAFT: Reusable Algorithms Functions and other Tools (C++)3 versions - Latest release: 5 months ago - 986 downloads last month - 948 stars on GitHub - 1 maintainer
pylibraft-cu13 25.10.0
RAFT: Reusable Algorithms Functions and other Tools2 versions - Latest release: about 1 month ago - 575 downloads last month - 948 stars on GitHub - 1 maintainer
raft-dask-cu13 25.10.0
Reusable Accelerated Functions & Tools Dask Infrastructure2 versions - Latest release: about 1 month ago - 528 downloads last month - 948 stars on GitHub - 1 maintainer
pylibraft-cu12 25.10.0
RAFT: Reusable Algorithms Functions and other Tools17 versions - Latest release: about 1 month ago - 4 dependent packages - 1 dependent repositories - 55.7 thousand downloads last month - 602 stars on GitHub - 1 maintainer
raft-dask-cu12 25.10.0
Reusable Accelerated Functions & Tools Dask Infrastructure17 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 52 thousand downloads last month - 602 stars on GitHub - 1 maintainer
goldenretriever-core 1.0.0
Dense Retriever7 versions - Latest release: over 1 year ago - 19 downloads last month - 9 stars on GitHub - 1 maintainer
sycamore-ai 0.1.33
Sycamore is an LLM-powered semantic data preparation system for building search applications.33 versions - Latest release: 4 months ago - 1.67 thousand downloads last month - 572 stars on GitHub - 2 maintainers
atarashi 0.0.11
An intelligent license scanner.2 versions - Latest release: about 5 years ago - 1 dependent repositories - 48 downloads last month - 31 stars on GitHub - 4 maintainers
fast-plaid 1.2.0
Fast Plaid.29 versions - Latest release: 2 months ago - 11.4 thousand downloads last month - 162 stars on GitHub - 1 maintainer
ocrpy 0.3.10
unified interface to google vision, aws textract, azure & tesseract OCR tools.12 versions - Latest release: about 3 years ago - 1 dependent repositories - 103 downloads last month - 216 stars on GitHub - 1 maintainer
vtext 0.2.0
Natural Language Processing in Rust with Python bidings4 versions - Latest release: over 5 years ago - 4 dependent repositories - 48 downloads last month - 153 stars on GitHub - 1 maintainer
sinapsis-ocr 0.1.9
Implements Sinapsis templates to perform optical character recognition on images9 versions - Latest release: 2 months ago - 120 downloads last month - 20 stars on GitHub - 1 maintainer
sinapsis-doctr 0.1.7
Perform optical character recognition using the DocTR library7 versions - Latest release: 3 months ago - 109 downloads last month - 20 stars on GitHub - 1 maintainer
sinapsis-easyocr 0.1.9
Perform optical character recognition using the EasyOCR library9 versions - Latest release: 2 months ago - 125 downloads last month - 20 stars on GitHub - 1 maintainer
embed-anything-gpu 0.6.6
Embed anything at lightning speed21 versions - Latest release: 15 days ago - 232 downloads last month - 753 stars on GitHub - 1 maintainer
embed-anything 0.6.6
Embed anything at lightning speed47 versions - Latest release: 15 days ago - 562 downloads last month - 753 stars on GitHub - 1 maintainer
stealerlib 0.0.1
Python Information Stealer Library (for Windows)1 version - Latest release: over 2 years ago - 16 downloads last month - 7 stars on GitHub - 1 maintainer
archive-query-log 1.2.26
Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives.48 versions - Latest release: almost 2 years ago - 750 downloads last month - 31 stars on GitHub - 4 maintainers
pylate 1.3.4
A library for training and retrieval with ColBERT.15 versions - Latest release: 27 days ago - 12.2 thousand downloads last month - 609 stars on GitHub - 1 maintainer
tirex-tracker 0.2.16
Automatic resource and metadata tracking for information retrieval experiments.17 versions - Latest release: about 1 month ago - 332 downloads last month - 8 stars on GitHub - 2 maintainers
stark-qa 0.1.3
Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases4 versions - Latest release: about 1 year ago - 161 downloads last month - 319 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
589 versions - Latest release: 16 days ago - 6 dependent packages - 25 dependent repositories - 407 thousand downloads last month - 1,441 stars on GitHub - 4 maintainers
mteb 2.1.1
Massive Text Embedding Benchmark589 versions - Latest release: 16 days ago - 6 dependent packages - 25 dependent repositories - 407 thousand downloads last month - 1,441 stars on GitHub - 4 maintainers
skifts 0.1.0
Search for the most relevant documents containing words from a query1 version - Latest release: almost 4 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
docling-haystack 0.1.1
Docling Haystack converter1 version - Latest release: 11 months ago - 1.96 thousand downloads last month - 20,653 stars on GitHub - 1 maintainer
gritlm 1.0.2
GritLM9 versions - Latest release: over 1 year ago - 7.02 thousand downloads last month - 668 stars on GitHub - 1 maintainer
llmemory 0.4.0
High-performance document memory system with vector search capabilities for Python applications1 version - Latest release: 17 days ago
hinteval 0.0.6
A Python framework designed for both generating and evaluating hints.6 versions - Latest release: 2 months ago - 53 downloads last month - 33 stars on GitHub - 1 maintainer
colpali-engine 0.3.12
The code used to train and run inference with the ColPali architecture.21 versions - Latest release: 4 months ago - 61.2 thousand downloads last month - 2,185 stars on GitHub - 2 maintainers
freediscovery 1.3.1
Open source software for E-Discovery and Information Retrieval6 versions - Latest release: over 7 years ago - 1 dependent repositories - 45 downloads last month - 75 stars on GitHub - 1 maintainer
Related Keywords
python
65
machine-learning
65
nlp
55
llm
52
rag
40
vector-search
35
natural-language-processing
34
deep-learning
27
search
26
pytorch
26
semantic-search
24
ai
23
retrieval-augmented-generation
23
embeddings
20
retrieval
19
question-answering
18
transformers
18
gpu
18
bm25
17
ranking
16
distance
16
clustering
16
vector-store
16
statistics
16
search-engine
15
vector-similarity
15
cuda
15
sparse
15
nearest-neighbors
15
neighborhood-methods
15
anns
15
data-mining
14
image-processing
14
language-model
13
ocr
13
large-language-models
13
information retrieval
11
scene-text-recognition
10
summarization
10
text-classification
10
data-science
10
keyword-extraction
10
easyocr
10
vector-database
10
chatgpt
10
keyphrase-extraction
9
linear-algebra
9
generative-ai
9
building-blocks
9
NLP
9
primitives
9
random-sampling
9
solvers
9
cnn
8
agents
8
learning
8
similarity-search
8
keyword
8
evaluation
8
embedding
8
sentence-embeddings
7
keyphrase
7
computational-linguistics
7
scene-text
7
transformer
7
bert
7
information
7
text-mining
7
llama
7
dataset
7
text-embedding
7
deep
7
neural
7
network
7
crnn
7
lstm
7
optical-character-recognition
7
numba
6
agent
6
chatbot
6
text-semantic-similarity
6
sentence-transformers
6
RAG
6
tf-idf
6
research
6
langchain
6
gpt-4
6
recognition
6
character
6
optical
6
evaluation-metrics
6
artificial-intelligence
5
colpali
5
LLM
5
trec_eval
5
lexical-search
5
gpt
5
text-processing
5
metrics
5
ml
5