Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "evaluation-framework" keyword
Top 1.5% on pypi.org
6 versions - Latest release: 3 months ago - 15 dependent packages - 252 dependent repositories - 187 thousand downloads last month - 5,485 stars on GitHub - 4 maintainers
lm-eval 0.4.2
A framework for evaluating language models6 versions - Latest release: 3 months ago - 15 dependent packages - 252 dependent repositories - 187 thousand downloads last month - 5,485 stars on GitHub - 4 maintainers
repsys-framework 0.4.1
Framework for developing and analyzing recommender systems.21 versions - Latest release: 10 months ago - 2 dependent repositories - 25 downloads last month - 34 stars on GitHub - 1 maintainer
vectory 0.1.7
Streamline the benchmark and experimentation process of your models that rely on generating embed...8 versions - Latest release: over 1 year ago - 51 downloads last month - 64 stars on GitHub - 1 maintainer
pactus 0.4.2
Framework to evaluate Trajectory Classification Algorithms14 versions - Latest release: 9 months ago - 31 downloads last month - 43 stars on GitHub - 1 maintainer
gval 0.2.5
Flexible, portable, and efficient geospatial evaluations for a variety of data.13 versions - Latest release: 5 months ago - 179 downloads last month - 22 stars on GitHub - 1 maintainer
evaluation-framework 1.3
Evaluation Framework for testing and comparing graph embedding techniques2 versions - Latest release: over 4 years ago - 1 dependent repositories - 11 downloads last month - 10 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
219 versions - Latest release: 28 days ago - 7 dependent packages - 1 dependent repositories - 49.1 thousand downloads last month - 1,929 stars on GitHub - 2 maintainers
deepeval 0.21.40
The open-source evaluation framework for LLMs.219 versions - Latest release: 28 days ago - 7 dependent packages - 1 dependent repositories - 49.1 thousand downloads last month - 1,929 stars on GitHub - 2 maintainers
tvallogging 1.0.0
Logging for Tonic Validate4 versions - Latest release: 6 months ago - 18 downloads last month - 208 stars on GitHub - 1 maintainer
tieval 0.1.2
A framework for evaluation and development of temporal-aware models.11 versions - Latest release: 3 months ago - 1 dependent repositories - 41 downloads last month - 15 stars on GitHub - 1 maintainer
nlproc-tools 0.1.0
Load and manage data collection with nlproc.tools3 versions - Latest release: 11 months ago - 8 downloads last month - 14 stars on GitHub - 1 maintainer
athina 1.2.19
Python SDK to configure and run evaluations for your LLM-based application63 versions - Latest release: 25 days ago - 3.23 thousand downloads last month - 139 stars on GitHub - 1 maintainer
rankeval 0.8.2
Tool for the analysis and evaluation of Learning to Rank models based on ensembles of regression ...8 versions - Latest release: over 4 years ago - 1 dependent repositories - 395 downloads last month - 87 stars on GitHub - 1 maintainer
kolena 1.18.0
Client for Kolena's machine learning testing platform.68 versions - Latest release: 27 days ago - 1 dependent repositories - 7.13 thousand downloads last month - 38 stars on GitHub - 1 maintainer
kolena-client 1.18.0
Client for Kolena's machine learning testing platform.73 versions - Latest release: 27 days ago - 1.19 thousand downloads last month - 38 stars on GitHub - 1 maintainer
corl 3.16.2
Core ACT3 Reinforcement Learning (RL) Library - Core framework and base implementations of common...6 versions - Latest release: about 2 months ago - 1 dependent package - 83 downloads last month - 29 stars on GitHub - 1 maintainer
zenoml-image-classification 0.0.3
Image Classification for Zeno3 versions - Latest release: almost 2 years ago - 1 dependent repositories - 32 downloads last month - 209 stars on GitHub - 1 maintainer
evalify 0.1.4
Evaluate your face or voice verification models literally in seconds.5 versions - Latest release: 12 months ago - 1 dependent repositories - 41 downloads last month - 19 stars on GitHub - 1 maintainer
continuous-eval 0.3.7
Open-Source Evaluation for GenAI Application Pipelines.22 versions - Latest release: about 2 months ago - 1.25 thousand downloads last month - 330 stars on GitHub - 1 maintainer
zenoml 0.6.4
Interactive Evaluation Framework for Machine Learning51 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 164 downloads last month - 209 stars on GitHub - 1 maintainer
zenoml-text-classification 0.0.2
Text Classification for Zeno2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 9 downloads last month - 209 stars on GitHub - 1 maintainer
kaiko-eva 0.0.1
Evaluation Framework for oncology foundation models.5 versions - Latest release: 3 months ago - 45 downloads last month - 45 stars on GitHub - 1 maintainer
gan-evaluator 1.15
GAN Evaluator for IS and FID9 versions - Latest release: about 1 year ago - 32 downloads last month - 10 stars on GitHub - 1 maintainer
sim4rec 0.0.2
Simulator for recommendation algorithms2 versions - Latest release: 10 months ago - 32 downloads last month - 42 stars on GitHub - 1 maintainer
quica 0.2.5
Quick Inter Coder Agreement in Python6 versions - Latest release: over 3 years ago - 1 dependent repositories - 67 downloads last month - 23 stars on GitHub - 1 maintainer
pydgn 1.5.6
A Python Package for Deep Graph Networks39 versions - Latest release: about 2 months ago - 1 dependent repositories - 302 downloads last month - 214 stars on GitHub - 1 maintainer
orbis-eval 2.3.5
An Extendable Evaluation Pipeline for Named Entity Drill-Down Analysis21 versions - Latest release: over 2 years ago - 1 dependent repositories - 65 downloads last month - 8 stars on GitHub - 3 maintainers
zenoml-image-segmentation 0.0.1
Image Segmentation for Zeno1 version - Latest release: almost 2 years ago - 1 dependent repositories - 11 downloads last month - 209 stars on GitHub - 1 maintainer
edgerun-galileo 0.10.4
Galileo: A framework for distributed load testing experiments20 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 69 downloads last month - 10 stars on GitHub - 2 maintainers
deepevals 0.2.0
Eval1 version - Latest release: 10 months ago - 27 downloads last month - 1,635 stars on GitHub - 1 maintainer
thresh 1.1.1
Load and manage data collection with thresh.tools5 versions - Latest release: 10 months ago - 119 downloads last month - 13 stars on GitHub - 1 maintainer
tonic-validate 6.0.0
RAG evaluation metrics.24 versions - Latest release: about 1 month ago - 2 dependent packages - 2.3 thousand downloads last month - 208 stars on GitHub - 1 maintainer
projectmoonshot-imda 0.3.4
A simple and modular tool to evaluate and red-team any LLM application.10 versions - Latest release: 3 months ago - 162 downloads last month - 21 stars on GitHub - 1 maintainer
testllm 0.14.1
Deep eval provides evaluation platform to accelerate development of LLMs and Agents1 version - Latest release: 9 months ago - 32 downloads last month - 1,635 stars on GitHub - 1 maintainer
tvalmetrics 1.0.2
RAG evaluation metrics.6 versions - Latest release: 6 months ago - 1 dependent package - 78 downloads last month - 208 stars on GitHub - 1 maintainer
open-llm-benchmark 0.1.0
Evaluate the capability of open-source LLMs in Agent, formatted output, instruction following, lo...1 version - Latest release: about 1 month ago - 99 downloads last month - 0 stars on GitHub - 1 maintainer
boteval 0.1
Chat Bot Evaluation1 version - Latest release: 3 months ago - 17 downloads last month - 5 stars on GitHub - 1 maintainer
lighteval 0.3.0
A lightweight and configurable evaluation package8 versions - Latest release: 2 months ago - 2.07 thousand downloads last month - 299 stars on GitHub - 3 maintainers
llmevals 0.1.0
Eval2 versions - Latest release: 10 months ago - 30 downloads last month - 1,635 stars on GitHub - 1 maintainer
zenoml-audio-transcription 0.0.4
Audio Transcription for Zeno4 versions - Latest release: almost 2 years ago - 1 dependent repositories - 14 downloads last month - 205 stars on GitHub - 1 maintainer
irspack 0.3.1
Implicit feedback-based recommender systems, packed for practitioners.29 versions - Latest release: about 1 year ago - 1 dependent repositories - 3.24 thousand downloads last month - 28 stars on GitHub - 1 maintainer
lapixdl 0.10.0
Utils for Computer Vision Deep Learning research37 versions - Latest release: 12 months ago - 1 dependent repositories - 330 downloads last month - 9 stars on GitHub - 1 maintainer
Related Keywords
evaluation-metrics
17
evaluation
13
python
12
machine-learning
10
llmops
7
testing
7
llm-evaluation
6
llm
5
data-science
5
ai
5
ml
5
retrieval-augmented-generation
4
rag
4
large-language-models
4
llm-evaluation-metrics
4
llm-evaluation-framework
4
nlp
4
deep-learning
3
llms
3
Kolena
2
ML
2
evaluate-models
2
mlops
2
recommender-systems
2
huggingface
2
analysis-framework
2
thresh
2
natural-language-processing
2
annotation-tool
2
fid-score
1
fid
1
dcgan
1
frechet-inception-distance
1
gan
1
gan-evaluation
1
inception
1
FID
1
IS
1
evaluator
1
GAN
1
foundation-models
1
oncology
1
machine learning
1
information-retrieval
1
face-verification
1
face-recognition
1
reinforcement-learning-environments
1
reinforcement-learning-algorithms
1
reinforcement-learning
1
ACT3
1
act3
1
distributed-load-testing
1
load-testing
1
benchmarking
1
red-teaming
1
trustworthy-ai
1
llamacpp
1
llm-agent
1
llms-benchmarking
1
openai
1
vllm
1
chatbot
1
chatbot-framework
1
mturk
1
eigen
1
hyperparameter-optimization
1
knn-algorithm
1
matrix-factorization
1
optuna
1
pybind11
1
computer-vision
1
image-processing
1
generation
1
generative-adversarial-network
1
generative-model
1
inception-score
1
metrics
1
pytorch
1
pytorch-gan
1
pytorch-implmention
1
svhn-dataset
1
torchvision
1
recommendation
1
recommender-system
1
rl-training
1
synthetic-data
1
user-modeling
1
quica
1
inter-coder-agreement
1
inter-rater-agreement
1
deep-graph-networks
1
deep-learning-for-graphs
1
nel
1
geospatial
1
evaluations
1
climate
1
earth-science
1
environment
1
flood-inundation
1
forecast-skill
1