Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "evaluation-framework" keyword

athina 1.2.18
Python SDK to configure and run evaluations for your LLM-based application
49 versions - Latest release: 1 day ago - 1.37 thousand downloads last month - 136 stars on GitHub - 1 maintainer
deepevals 0.2.0
Eval
1 version - Latest release: 9 months ago - 27 downloads last month - 1,635 stars on GitHub - 1 maintainer
kaiko-eva 0.0.1
Evaluation Framework for oncology foundation models.
4 versions - Latest release: about 2 months ago - 57 downloads last month - 44 stars on GitHub - 1 maintainer
thresh 1.1.1
Load and manage data collection with thresh.tools
5 versions - Latest release: 9 months ago - 119 downloads last month - 13 stars on GitHub - 1 maintainer
tonic-validate 6.0.0
RAG evaluation metrics.
24 versions - Latest release: 7 days ago - 2 dependent packages - 2.3 thousand downloads last month - 208 stars on GitHub - 1 maintainer
projectmoonshot-imda 0.3.4
A simple and modular tool to evaluate and red-team any LLM application.
10 versions - Latest release: about 2 months ago - 162 downloads last month - 21 stars on GitHub - 1 maintainer
Top 1.5% on pypi.org
lm-eval 0.4.2
A framework for evaluating language models
6 versions - Latest release: about 2 months ago - 15 dependent packages - 252 dependent repositories - 287 thousand downloads last month - 5,205 stars on GitHub - 4 maintainers
Top 9.3% on pypi.org
deepeval 0.21.40
The open-source evaluation framework for LLMs.
208 versions - Latest release: 3 days ago - 7 dependent packages - 1 dependent repositories - 34.5 thousand downloads last month - 1,635 stars on GitHub - 2 maintainers
corl 3.16.2
Core ACT3 Reinforcement Learning (RL) Library - Core framework and base implementations of common...
6 versions - Latest release: about 1 month ago - 1 dependent package - 194 downloads last month - 27 stars on GitHub - 1 maintainer
testllm 0.14.1
Deep eval provides evaluation platform to accelerate development of LLMs and Agents
1 version - Latest release: 8 months ago - 32 downloads last month - 1,635 stars on GitHub - 1 maintainer
kolena 1.18.0
Client for Kolena's machine learning testing platform.
65 versions - Latest release: 2 days ago - 1 dependent repositories - 7.63 thousand downloads last month - 38 stars on GitHub - 1 maintainer
kolena-client 1.18.0
Client for Kolena's machine learning testing platform.
70 versions - Latest release: 2 days ago - 1.91 thousand downloads last month - 38 stars on GitHub - 1 maintainer
tvalmetrics 1.0.2
RAG evaluation metrics.
6 versions - Latest release: 5 months ago - 1 dependent package - 78 downloads last month - 208 stars on GitHub - 1 maintainer
open-llm-benchmark 0.1.0
Evaluate the capability of open-source LLMs in Agent, formatted output, instruction following, lo...
1 version - Latest release: 13 days ago - 99 downloads last month - 0 stars on GitHub - 1 maintainer
boteval 0.1
Chat Bot Evaluation
1 version - Latest release: 2 months ago - 17 downloads last month - 5 stars on GitHub - 1 maintainer
continuous-eval 0.3.7
Open-Source Evaluation for GenAI Application Pipelines.
19 versions - Latest release: 21 days ago - 1.24 thousand downloads last month - 311 stars on GitHub - 1 maintainer
tvallogging 1.0.0
Logging for Tonic Validate
4 versions - Latest release: 5 months ago - 47 downloads last month - 198 stars on GitHub - 1 maintainer
lighteval 0.3.0
A lightweight and configurable evaluation package
8 versions - Latest release: about 2 months ago - 2.07 thousand downloads last month - 299 stars on GitHub - 3 maintainers
llmevals 0.1.0
Eval
2 versions - Latest release: 9 months ago - 30 downloads last month - 1,635 stars on GitHub - 1 maintainer
sim4rec 0.0.2
Simulator for recommendation algorithms
2 versions - Latest release: 10 months ago - 40 downloads last month - 43 stars on GitHub - 1 maintainer
zenoml-audio-transcription 0.0.4
Audio Transcription for Zeno
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 14 downloads last month - 205 stars on GitHub - 1 maintainer
zenoml-text-classification 0.0.2
Text Classification for Zeno
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 24 downloads last month - 209 stars on GitHub - 1 maintainer
zenoml-image-classification 0.0.3
Image Classification for Zeno
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 32 downloads last month - 209 stars on GitHub - 1 maintainer
rankeval 0.8.2
Tool for the analysis and evaluation of Learning to Rank models based on ensembles of regression ...
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 395 downloads last month - 87 stars on GitHub - 1 maintainer
quica 0.2.5
Quick Inter Coder Agreement in Python
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 60 downloads last month - 23 stars on GitHub - 1 maintainer
pydgn 1.5.6
A Python Package for Deep Graph Networks
39 versions - Latest release: 22 days ago - 1 dependent repositories - 288 downloads last month - 214 stars on GitHub - 1 maintainer
orbis-eval 2.3.5
An Extendable Evaluation Pipeline for Named Entity Drill-Down Analysis
21 versions - Latest release: about 2 years ago - 1 dependent repositories - 135 downloads last month - 8 stars on GitHub - 3 maintainers
irspack 0.3.1
Implicit feedback-based recommender systems, packed for practitioners.
29 versions - Latest release: 12 months ago - 1 dependent repositories - 3.24 thousand downloads last month - 28 stars on GitHub - 1 maintainer
evalify 0.1.4
Evaluate your face or voice verification models literally in seconds.
5 versions - Latest release: 11 months ago - 1 dependent repositories - 16 downloads last month - 19 stars on GitHub - 1 maintainer
vectory 0.1.7
Streamline the benchmark and experimentation process of your models that rely on generating embed...
8 versions - Latest release: over 1 year ago - 89 downloads last month - 64 stars on GitHub - 1 maintainer
lapixdl 0.10.0
Utils for Computer Vision Deep Learning research
37 versions - Latest release: 11 months ago - 1 dependent repositories - 330 downloads last month - 9 stars on GitHub - 1 maintainer
repsys-framework 0.4.1
Framework for developing and analyzing recommender systems.
21 versions - Latest release: 9 months ago - 2 dependent repositories - 154 downloads last month - 34 stars on GitHub - 1 maintainer
pactus 0.4.2
Framework to evaluate Trajectory Classification Algorithms
14 versions - Latest release: 8 months ago - 114 downloads last month - 43 stars on GitHub - 1 maintainer
evaluation-framework 1.3
Evaluation Framework for testing and comparing graph embedding techniques
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 10 stars on GitHub - 1 maintainer
tieval 0.1.2
A framework for evaluation and development of temporal-aware models.
11 versions - Latest release: 2 months ago - 1 dependent repositories - 72 downloads last month - 14 stars on GitHub - 1 maintainer
nlproc-tools 0.1.0
Load and manage data collection with nlproc.tools
3 versions - Latest release: 10 months ago - 19 downloads last month - 10 stars on GitHub - 1 maintainer
zenoml 0.6.4
Interactive Evaluation Framework for Machine Learning
51 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 577 downloads last month - 208 stars on GitHub - 1 maintainer
gan-evaluator 1.15
GAN Evaluator for IS and FID
9 versions - Latest release: about 1 year ago - 16 downloads last month - 10 stars on GitHub - 1 maintainer
gval 0.2.5
Flexible, portable, and efficient geospatial evaluations for a variety of data.
13 versions - Latest release: 4 months ago - 144 downloads last month - 22 stars on GitHub - 1 maintainer
zenoml-image-segmentation 0.0.1
Image Segmentation for Zeno
1 version - Latest release: almost 2 years ago - 1 dependent repositories - 8 downloads last month - 208 stars on GitHub - 1 maintainer
edgerun-galileo 0.10.4
Galileo: A framework for distributed load testing experiments
20 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 61 downloads last month - 10 stars on GitHub - 2 maintainers
Related Keywords
evaluation-metrics 17 evaluation 13 python 12 machine-learning 10 testing 7 llmops 7 llm-evaluation 6 ml 5 llm 5 ai 5 data-science 5 retrieval-augmented-generation 4 rag 4 llm-evaluation-framework 4 large-language-models 4 nlp 4 llm-evaluation-metrics 4 llms 3 deep-learning 3 thresh 2 natural-language-processing 2 annotation-tool 2 analysis-framework 2 huggingface 2 mlops 2 evaluate-models 2 recommender-systems 2 ML 2 Kolena 2 information-extraction 1 event classification 1 fid-score 1 event identification 1 trajectory 1 temporal-relations 1 machine learning 1 GAN 1 evaluator 1 IS 1 FID 1 inception 1 dcgan 1 fid 1 classification 1 classification-models 1 trajectory-analysis 1 transformers 1 graph-embedding 1 library 1 benchmark-framework 1 machine learning tasks 1 semantic tasks 1 comparison 1 embedding-algorithm 1 machine-learning-algorithms 1 semantic-tasks 1 temporal information 1 temporal information extraction 1 temporal relation classification 1 temporal relation extraction 1 temporal expression identification 1 environment 1 flood-inundation 1 forecast-skill 1 gdal 1 geography 1 geopandas 1 hydrology 1 land-surface 1 landcover 1 research 1 science 1 spatial-analysis 1 spatial-temporal 1 statistics 1 xarray 1 distributed-load-testing 1 load-testing 1 frechet-inception-distance 1 gan 1 gan-evaluation 1 generation 1 generative-adversarial-network 1 generative-model 1 inception-score 1 metrics 1 pytorch 1 pytorch-gan 1 pytorch-implmention 1 svhn-dataset 1 torchvision 1 geospatial 1 evaluations 1 climate 1 earth-science 1 llm-eval 1 reinforcement-learning-algorithms 1 reinforcement-learning-environments 1 llamacpp 1 llm-agent 1