Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "evaluation-framework" keyword
athina 1.2.18
Python SDK to configure and run evaluations for your LLM-based application49 versions - Latest release: 1 day ago - 1.37 thousand downloads last month - 136 stars on GitHub - 1 maintainer
deepevals 0.2.0
Eval1 version - Latest release: 9 months ago - 27 downloads last month - 1,635 stars on GitHub - 1 maintainer
kaiko-eva 0.0.1
Evaluation Framework for oncology foundation models.4 versions - Latest release: about 2 months ago - 57 downloads last month - 44 stars on GitHub - 1 maintainer
thresh 1.1.1
Load and manage data collection with thresh.tools5 versions - Latest release: 9 months ago - 119 downloads last month - 13 stars on GitHub - 1 maintainer
tonic-validate 6.0.0
RAG evaluation metrics.24 versions - Latest release: 7 days ago - 2 dependent packages - 2.3 thousand downloads last month - 208 stars on GitHub - 1 maintainer
projectmoonshot-imda 0.3.4
A simple and modular tool to evaluate and red-team any LLM application.10 versions - Latest release: about 2 months ago - 162 downloads last month - 21 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
6 versions - Latest release: about 2 months ago - 15 dependent packages - 252 dependent repositories - 287 thousand downloads last month - 5,205 stars on GitHub - 4 maintainers
lm-eval 0.4.2
A framework for evaluating language models6 versions - Latest release: about 2 months ago - 15 dependent packages - 252 dependent repositories - 287 thousand downloads last month - 5,205 stars on GitHub - 4 maintainers
Top 9.3% on pypi.org
208 versions - Latest release: 3 days ago - 7 dependent packages - 1 dependent repositories - 34.5 thousand downloads last month - 1,635 stars on GitHub - 2 maintainers
deepeval 0.21.40
The open-source evaluation framework for LLMs.208 versions - Latest release: 3 days ago - 7 dependent packages - 1 dependent repositories - 34.5 thousand downloads last month - 1,635 stars on GitHub - 2 maintainers
corl 3.16.2
Core ACT3 Reinforcement Learning (RL) Library - Core framework and base implementations of common...6 versions - Latest release: about 1 month ago - 1 dependent package - 194 downloads last month - 27 stars on GitHub - 1 maintainer
testllm 0.14.1
Deep eval provides evaluation platform to accelerate development of LLMs and Agents1 version - Latest release: 8 months ago - 32 downloads last month - 1,635 stars on GitHub - 1 maintainer
kolena 1.18.0
Client for Kolena's machine learning testing platform.65 versions - Latest release: 2 days ago - 1 dependent repositories - 7.63 thousand downloads last month - 38 stars on GitHub - 1 maintainer
kolena-client 1.18.0
Client for Kolena's machine learning testing platform.70 versions - Latest release: 2 days ago - 1.91 thousand downloads last month - 38 stars on GitHub - 1 maintainer
tvalmetrics 1.0.2
RAG evaluation metrics.6 versions - Latest release: 5 months ago - 1 dependent package - 78 downloads last month - 208 stars on GitHub - 1 maintainer
open-llm-benchmark 0.1.0
Evaluate the capability of open-source LLMs in Agent, formatted output, instruction following, lo...1 version - Latest release: 13 days ago - 99 downloads last month - 0 stars on GitHub - 1 maintainer
boteval 0.1
Chat Bot Evaluation1 version - Latest release: 2 months ago - 17 downloads last month - 5 stars on GitHub - 1 maintainer
continuous-eval 0.3.7
Open-Source Evaluation for GenAI Application Pipelines.19 versions - Latest release: 21 days ago - 1.24 thousand downloads last month - 311 stars on GitHub - 1 maintainer
tvallogging 1.0.0
Logging for Tonic Validate4 versions - Latest release: 5 months ago - 47 downloads last month - 198 stars on GitHub - 1 maintainer
lighteval 0.3.0
A lightweight and configurable evaluation package8 versions - Latest release: about 2 months ago - 2.07 thousand downloads last month - 299 stars on GitHub - 3 maintainers
llmevals 0.1.0
Eval2 versions - Latest release: 9 months ago - 30 downloads last month - 1,635 stars on GitHub - 1 maintainer
sim4rec 0.0.2
Simulator for recommendation algorithms2 versions - Latest release: 10 months ago - 40 downloads last month - 43 stars on GitHub - 1 maintainer
zenoml-audio-transcription 0.0.4
Audio Transcription for Zeno4 versions - Latest release: over 1 year ago - 1 dependent repositories - 14 downloads last month - 205 stars on GitHub - 1 maintainer
zenoml-text-classification 0.0.2
Text Classification for Zeno2 versions - Latest release: over 1 year ago - 1 dependent repositories - 24 downloads last month - 209 stars on GitHub - 1 maintainer
zenoml-image-classification 0.0.3
Image Classification for Zeno3 versions - Latest release: over 1 year ago - 1 dependent repositories - 32 downloads last month - 209 stars on GitHub - 1 maintainer
rankeval 0.8.2
Tool for the analysis and evaluation of Learning to Rank models based on ensembles of regression ...8 versions - Latest release: over 4 years ago - 1 dependent repositories - 395 downloads last month - 87 stars on GitHub - 1 maintainer
quica 0.2.5
Quick Inter Coder Agreement in Python6 versions - Latest release: over 3 years ago - 1 dependent repositories - 60 downloads last month - 23 stars on GitHub - 1 maintainer
pydgn 1.5.6
A Python Package for Deep Graph Networks39 versions - Latest release: 22 days ago - 1 dependent repositories - 288 downloads last month - 214 stars on GitHub - 1 maintainer
orbis-eval 2.3.5
An Extendable Evaluation Pipeline for Named Entity Drill-Down Analysis21 versions - Latest release: about 2 years ago - 1 dependent repositories - 135 downloads last month - 8 stars on GitHub - 3 maintainers
irspack 0.3.1
Implicit feedback-based recommender systems, packed for practitioners.29 versions - Latest release: 12 months ago - 1 dependent repositories - 3.24 thousand downloads last month - 28 stars on GitHub - 1 maintainer
evalify 0.1.4
Evaluate your face or voice verification models literally in seconds.5 versions - Latest release: 11 months ago - 1 dependent repositories - 16 downloads last month - 19 stars on GitHub - 1 maintainer
vectory 0.1.7
Streamline the benchmark and experimentation process of your models that rely on generating embed...8 versions - Latest release: over 1 year ago - 89 downloads last month - 64 stars on GitHub - 1 maintainer
lapixdl 0.10.0
Utils for Computer Vision Deep Learning research37 versions - Latest release: 11 months ago - 1 dependent repositories - 330 downloads last month - 9 stars on GitHub - 1 maintainer
repsys-framework 0.4.1
Framework for developing and analyzing recommender systems.21 versions - Latest release: 9 months ago - 2 dependent repositories - 154 downloads last month - 34 stars on GitHub - 1 maintainer
pactus 0.4.2
Framework to evaluate Trajectory Classification Algorithms14 versions - Latest release: 8 months ago - 114 downloads last month - 43 stars on GitHub - 1 maintainer
evaluation-framework 1.3
Evaluation Framework for testing and comparing graph embedding techniques2 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 10 stars on GitHub - 1 maintainer
tieval 0.1.2
A framework for evaluation and development of temporal-aware models.11 versions - Latest release: 2 months ago - 1 dependent repositories - 72 downloads last month - 14 stars on GitHub - 1 maintainer
nlproc-tools 0.1.0
Load and manage data collection with nlproc.tools3 versions - Latest release: 10 months ago - 19 downloads last month - 10 stars on GitHub - 1 maintainer
zenoml 0.6.4
Interactive Evaluation Framework for Machine Learning51 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 577 downloads last month - 208 stars on GitHub - 1 maintainer
gan-evaluator 1.15
GAN Evaluator for IS and FID9 versions - Latest release: about 1 year ago - 16 downloads last month - 10 stars on GitHub - 1 maintainer
gval 0.2.5
Flexible, portable, and efficient geospatial evaluations for a variety of data.13 versions - Latest release: 4 months ago - 144 downloads last month - 22 stars on GitHub - 1 maintainer
zenoml-image-segmentation 0.0.1
Image Segmentation for Zeno1 version - Latest release: almost 2 years ago - 1 dependent repositories - 8 downloads last month - 208 stars on GitHub - 1 maintainer
edgerun-galileo 0.10.4
Galileo: A framework for distributed load testing experiments20 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 61 downloads last month - 10 stars on GitHub - 2 maintainers
Related Keywords
evaluation-metrics
17
evaluation
13
python
12
machine-learning
10
testing
7
llmops
7
llm-evaluation
6
ml
5
llm
5
ai
5
data-science
5
retrieval-augmented-generation
4
rag
4
llm-evaluation-framework
4
large-language-models
4
nlp
4
llm-evaluation-metrics
4
llms
3
deep-learning
3
thresh
2
natural-language-processing
2
annotation-tool
2
analysis-framework
2
huggingface
2
mlops
2
evaluate-models
2
recommender-systems
2
ML
2
Kolena
2
information-extraction
1
event classification
1
fid-score
1
event identification
1
trajectory
1
temporal-relations
1
machine learning
1
GAN
1
evaluator
1
IS
1
FID
1
inception
1
dcgan
1
fid
1
classification
1
classification-models
1
trajectory-analysis
1
transformers
1
graph-embedding
1
library
1
benchmark-framework
1
machine learning tasks
1
semantic tasks
1
comparison
1
embedding-algorithm
1
machine-learning-algorithms
1
semantic-tasks
1
temporal information
1
temporal information extraction
1
temporal relation classification
1
temporal relation extraction
1
temporal expression identification
1
environment
1
flood-inundation
1
forecast-skill
1
gdal
1
geography
1
geopandas
1
hydrology
1
land-surface
1
landcover
1
research
1
science
1
spatial-analysis
1
spatial-temporal
1
statistics
1
xarray
1
distributed-load-testing
1
load-testing
1
frechet-inception-distance
1
gan
1
gan-evaluation
1
generation
1
generative-adversarial-network
1
generative-model
1
inception-score
1
metrics
1
pytorch
1
pytorch-gan
1
pytorch-implmention
1
svhn-dataset
1
torchvision
1
geospatial
1
evaluations
1
climate
1
earth-science
1
llm-eval
1
reinforcement-learning-algorithms
1
reinforcement-learning-environments
1
llamacpp
1
llm-agent
1