Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "evaluation" keyword
Top 3.1% on pypi.org
166 versions - Latest release: 4 days ago - 86 dependent packages - 2,234 dependent repositories - 7.97 million downloads last month - 217 stars on GitHub - 1 maintainer
langsmith 0.1.57
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform.166 versions - Latest release: 4 days ago - 86 dependent packages - 2,234 dependent repositories - 7.97 million downloads last month - 217 stars on GitHub - 1 maintainer
Top 1.2% on pypi.org
15 versions - Latest release: 15 days ago - 222 dependent packages - 2,474 dependent repositories - 2.58 million downloads last month - 1,762 stars on GitHub - 3 maintainers
evaluate 0.4.2
HuggingFace community-driven open-source library of evaluation15 versions - Latest release: 15 days ago - 222 dependent packages - 2,474 dependent repositories - 2.58 million downloads last month - 1,762 stars on GitHub - 3 maintainers
Top 1.3% on pypi.org
69 versions - Latest release: about 1 month ago - 96 dependent packages - 4,263 dependent repositories - 1.84 million downloads last month - 967 stars on GitHub - 3 maintainers
sacrebleu 2.4.2
Hassle-free computation of shareable, comparable, and reproducible BLEU, chrF, and TER scores69 versions - Latest release: about 1 month ago - 96 dependent packages - 4,263 dependent repositories - 1.84 million downloads last month - 967 stars on GitHub - 3 maintainers
Top 1.8% on pypi.org
18 versions - Latest release: about 1 year ago - 59 dependent packages - 290 dependent repositories - 1.22 million downloads last month - 424 stars on GitHub - 1 maintainer
simpleeval 0.9.13 💰
A simple, safe single expression evaluator library.18 versions - Latest release: about 1 year ago - 59 dependent packages - 290 dependent repositories - 1.22 million downloads last month - 424 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
3 versions - Latest release: almost 3 years ago - 9 dependent packages - 773 dependent repositories - 282 thousand downloads last month - 870 stars on GitHub - 1 maintainer
torch-fidelity 0.3.0
High-fidelity performance metrics for generative models in PyTorch3 versions - Latest release: almost 3 years ago - 9 dependent packages - 773 dependent repositories - 282 thousand downloads last month - 870 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
321 versions - Latest release: about 5 hours ago - 17 dependent packages - 1 dependent repositories - 225 thousand downloads last month - 2,823 stars on GitHub - 1 maintainer
langfuse 2.31.0
A client library for accessing langfuse321 versions - Latest release: about 5 hours ago - 17 dependent packages - 1 dependent repositories - 225 thousand downloads last month - 2,823 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
9 versions - Latest release: over 1 year ago - 13 dependent packages - 398 dependent repositories - 127 thousand downloads last month - 1,326 stars on GitHub - 1 maintainer
motmetrics 1.4.0
Metrics for multiple object tracker benchmarking.9 versions - Latest release: over 1 year ago - 13 dependent packages - 398 dependent repositories - 127 thousand downloads last month - 1,326 stars on GitHub - 1 maintainer
Top 1.1% on pypi.org
43 versions - Latest release: 10 months ago - 31 dependent packages - 56 dependent repositories - 108 thousand downloads last month - 186 stars on GitHub - 2 maintainers
configspace 0.7.2
Creation and manipulation of parameter configuration spaces for automated algorithm configuration...43 versions - Latest release: 10 months ago - 31 dependent packages - 56 dependent repositories - 108 thousand downloads last month - 186 stars on GitHub - 2 maintainers
Top 3.3% on pypi.org
3 versions - Latest release: over 3 years ago - 9 dependent packages - 36 dependent repositories - 104 thousand downloads last month - 249 stars on GitHub - 1 maintainer
pytrec-eval 0.5
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.3 versions - Latest release: over 3 years ago - 9 dependent packages - 36 dependent repositories - 104 thousand downloads last month - 249 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
100 versions - Latest release: 6 days ago - 18 dependent repositories - 94.4 thousand downloads last month - 3,023 stars on GitHub - 1 maintainer
evo 1.28.0
Python package for the evaluation of odometry and SLAM100 versions - Latest release: 6 days ago - 18 dependent repositories - 94.4 thousand downloads last month - 3,023 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
9 versions - Latest release: about 1 month ago - 4 dependent repositories - 79.5 thousand downloads last month - 10 stars on GitHub - 1 maintainer
fore 0.1.7
fore ai packages9 versions - Latest release: about 1 month ago - 4 dependent repositories - 79.5 thousand downloads last month - 10 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
2 versions - Latest release: almost 3 years ago - 1 dependent package - 14 dependent repositories - 71.8 thousand downloads last month - 21 stars on GitHub - 1 maintainer
nf1 0.0.4
NF1: Normalized F1 score for community evaluation against ground truth2 versions - Latest release: almost 3 years ago - 1 dependent package - 14 dependent repositories - 71.8 thousand downloads last month - 21 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
7 versions - Latest release: 9 months ago - 25 dependent packages - 8 dependent repositories - 70 thousand downloads last month - 194 stars on GitHub - 1 maintainer
torcheval 0.0.7
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...7 versions - Latest release: 9 months ago - 25 dependent packages - 8 dependent repositories - 70 thousand downloads last month - 194 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
44 versions - Latest release: almost 6 years ago - 4 dependent packages - 50 dependent repositories - 45.4 thousand downloads last month - 1,430 stars on GitHub - 3 maintainers
pycm 0.9.5 💰
Multi-class confusion matrix library in Python44 versions - Latest release: almost 6 years ago - 4 dependent packages - 50 dependent repositories - 45.4 thousand downloads last month - 1,430 stars on GitHub - 3 maintainers
Top 5.2% on pypi.org
6 versions - Latest release: 7 months ago - 1 dependent package - 11 dependent repositories - 31.6 thousand downloads last month - 248 stars on GitHub - 1 maintainer
pytrec-eval-terrier 0.5.6
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.6 versions - Latest release: 7 months ago - 1 dependent package - 11 dependent repositories - 31.6 thousand downloads last month - 248 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
25 versions - Latest release: 10 months ago - 3 dependent packages - 6 dependent repositories - 16.4 thousand downloads last month - 19 stars on GitHub - 1 maintainer
evalidate 2.0.2
Validation and secure evaluation of untrusted python expressions25 versions - Latest release: 10 months ago - 3 dependent packages - 6 dependent repositories - 16.4 thousand downloads last month - 19 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
45 versions - Latest release: 6 months ago - 4 dependent packages - 7 dependent repositories - 13.2 thousand downloads last month - 348 stars on GitHub - 1 maintainer
ranx 0.3.19
ranx: A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion45 versions - Latest release: 6 months ago - 4 dependent packages - 7 dependent repositories - 13.2 thousand downloads last month - 348 stars on GitHub - 1 maintainer
fstring 1.7.4
Working with strings has never been prettier.11 versions - Latest release: almost 6 years ago - 5 dependent repositories - 12.5 thousand downloads last month - 8 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
28 versions - Latest release: over 1 year ago - 1 dependent repositories - 12.1 thousand downloads last month - 105 stars on GitHub - 2 maintainers
xlcalculator 0.5.0
Converts MS Excel formulas to Python and evaluates them.28 versions - Latest release: over 1 year ago - 1 dependent repositories - 12.1 thousand downloads last month - 105 stars on GitHub - 2 maintainers
Top 8.2% on pypi.org
479 versions - Latest release: 2 days ago - 2 dependent packages - 1 dependent repositories - 9.82 thousand downloads last month - 156 stars on GitHub - 1 maintainer
torcheval-nightly 2024.5.13
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...479 versions - Latest release: 2 days ago - 2 dependent packages - 1 dependent repositories - 9.82 thousand downloads last month - 156 stars on GitHub - 1 maintainer
synthesized-datasets 1.7
Publically available datasets for benchmarking and evaluation.15 versions - Latest release: 2 months ago - 3 dependent packages - 7.72 thousand downloads last month - 1 stars on GitHub - 1 maintainer
kolena 1.18.0
Client for Kolena's machine learning testing platform.65 versions - Latest release: 1 day ago - 1 dependent repositories - 7.63 thousand downloads last month - 38 stars on GitHub - 1 maintainer
alpaca-eval 0.6.2
AlpacaEval : An Automatic Evaluator of Instruction-following Models33 versions - Latest release: 26 days ago - 2 dependent packages - 7.19 thousand downloads last month - 1,062 stars on GitHub - 3 maintainers
Top 7.7% on pypi.org
23 versions - Latest release: about 1 month ago - 5 dependent repositories - 6.59 thousand downloads last month - 12 stars on GitHub - 1 maintainer
insight 1.0
A python library for monitoring, comparing and extracting insights from data.23 versions - Latest release: about 1 month ago - 5 dependent repositories - 6.59 thousand downloads last month - 12 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
9 versions - Latest release: almost 2 years ago - 6 dependent packages - 15 dependent repositories - 5.77 thousand downloads last month - 689 stars on GitHub - 1 maintainer
rliable 1.0.8
rliable: Reliable evaluation on reinforcement learning and machine learning benchmarks.9 versions - Latest release: almost 2 years ago - 6 dependent packages - 15 dependent repositories - 5.77 thousand downloads last month - 689 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
48 versions - Latest release: 12 days ago - 2 dependent packages - 1 dependent repositories - 5.39 thousand downloads last month - 2,015 stars on GitHub - 2 maintainers
uptrain 0.7.0
UpTrain - tool to evaluate LLM applications on aspects like factual accuracy, response quality, r...48 versions - Latest release: 12 days ago - 2 dependent packages - 1 dependent repositories - 5.39 thousand downloads last month - 2,015 stars on GitHub - 2 maintainers
replay-rec 0.16.0
RecSys Library17 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 3.8 thousand downloads last month - 122 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
655 versions - Latest release: 19 days ago - 2 dependent repositories - 3.58 thousand downloads last month - 3,951 stars on GitHub - 2 maintainers
coconut-develop 3.1.0.post0.dev11 💰
Simple, elegant, Pythonic functional programming.655 versions - Latest release: 19 days ago - 2 dependent repositories - 3.58 thousand downloads last month - 3,951 stars on GitHub - 2 maintainers
Top 6.7% on pypi.org
111 versions - Latest release: 1 day ago - 2 dependent repositories - 3.37 thousand downloads last month - 575 stars on GitHub - 1 maintainer
agenta 0.14.8
The SDK for agenta is an open-source LLMOps platform.111 versions - Latest release: 1 day ago - 2 dependent repositories - 3.37 thousand downloads last month - 575 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
19 versions - Latest release: 6 months ago - 13 dependent repositories - 3.08 thousand downloads last month - 402 stars on GitHub - 4 maintainers
errant 3.0.0
The ERRor ANnotation Toolkit (ERRANT). Automatically extract and classify edits in parallel sente...19 versions - Latest release: 6 months ago - 13 dependent repositories - 3.08 thousand downloads last month - 402 stars on GitHub - 4 maintainers
tmtoolkit 0.12.0
Text Mining and Topic Modeling Toolkit35 versions - Latest release: about 1 year ago - 2 dependent packages - 10 dependent repositories - 2.96 thousand downloads last month - 12 stars on GitHub - 2 maintainers
Top 3.6% on pypi.org
41 versions - Latest release: 2 months ago - 3 dependent packages - 22 dependent repositories - 2.81 thousand downloads last month - 3,951 stars on GitHub - 1 maintainer
coconut 3.1.0 💰
Simple, elegant, Pythonic functional programming.41 versions - Latest release: 2 months ago - 3 dependent packages - 22 dependent repositories - 2.81 thousand downloads last month - 3,951 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
9 versions - Latest release: about 5 years ago - 41 dependent repositories - 2.77 thousand downloads last month - 166 stars on GitHub - 1 maintainer
keras-metrics 1.1.0
Metrics for Keras model evaluation9 versions - Latest release: about 5 years ago - 41 dependent repositories - 2.77 thousand downloads last month - 166 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
2 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 2.67 thousand downloads last month - 687 stars on GitHub - 1 maintainer
tidecv 1.0.1
A General Toolbox for Identifying ObjectDetection Errors2 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 2.67 thousand downloads last month - 687 stars on GitHub - 1 maintainer
langcheck 0.7.1
Simple, Pythonic building blocks to evaluate LLM-based applications12 versions - Latest release: 7 days ago - 2.57 thousand downloads last month - 137 stars on GitHub - 3 maintainers
cer 1.2.0
Translation Edit Rate on the character level4 versions - Latest release: over 1 year ago - 4 dependent packages - 4 dependent repositories - 2.25 thousand downloads last month - 1 stars on GitHub - 1 maintainer
lighteval 0.3.0
A lightweight and configurable evaluation package8 versions - Latest release: about 2 months ago - 2.07 thousand downloads last month - 299 stars on GitHub - 3 maintainers
kolena-client 1.18.0
Client for Kolena's machine learning testing platform.70 versions - Latest release: 1 day ago - 1.91 thousand downloads last month - 38 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
13 versions - Latest release: about 9 hours ago - 3 dependent repositories - 1.85 thousand downloads last month - 31 stars on GitHub - 1 maintainer
codebleu 0.6.1
Unofficial CodeBLEU implementation that supports Linux, MacOS and Windows available on PyPI.13 versions - Latest release: about 9 hours ago - 3 dependent repositories - 1.85 thousand downloads last month - 31 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
19 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 1.85 thousand downloads last month - 275 stars on GitHub - 5 maintainers
rexmex 0.1.3
A General Purpose Recommender Metrics Library for Fair Evaluation.19 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 1.85 thousand downloads last month - 275 stars on GitHub - 5 maintainers
Top 6.5% on pypi.org
31 versions - Latest release: 9 months ago - 3 dependent packages - 5 dependent repositories - 1.79 thousand downloads last month - 74 stars on GitHub - 1 maintainer
table-evaluator 1.6.1
A package to evaluate how close a synthetic data set is to real data.31 versions - Latest release: 9 months ago - 3 dependent packages - 5 dependent repositories - 1.79 thousand downloads last month - 74 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
83 versions - Latest release: 22 days ago - 1 dependent repositories - 1.79 thousand downloads last month - 155 stars on GitHub - 4 maintainers
acconeer-exptool 7.10.0
Acconeer Exploration Tool83 versions - Latest release: 22 days ago - 1 dependent repositories - 1.79 thousand downloads last month - 155 stars on GitHub - 4 maintainers
Top 2.2% on pypi.org
49 versions - Latest release: 11 months ago - 12 dependent packages - 38 dependent repositories - 1.61 thousand downloads last month - 8 maintainers
bob 12.0.0
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometri...49 versions - Latest release: 11 months ago - 12 dependent packages - 38 dependent repositories - 1.61 thousand downloads last month - 8 maintainers
audmetric 1.2.1
Evaluate machine-learning models11 versions - Latest release: 3 months ago - 2 dependent packages - 2 dependent repositories - 1.56 thousand downloads last month - 1 stars on GitHub - 1 maintainer
Top 8.0% on pypi.org
8 versions - Latest release: 8 months ago - 7 dependent repositories - 1.46 thousand downloads last month - 36 stars on GitHub - 1 maintainer
pyprg 0.1
Creates the Precision-Recall-Gain curve and calculates the area under the curve8 versions - Latest release: 8 months ago - 7 dependent repositories - 1.46 thousand downloads last month - 36 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
1 version - Latest release: about 4 years ago - 2 dependent packages - 17 dependent repositories - 1.37 thousand downloads last month - 234 stars on GitHub - 1 maintainer
prdc 0.2
Compute precision, recall, density, and coverage metrics for two sets of vectors.1 version - Latest release: about 4 years ago - 2 dependent packages - 17 dependent repositories - 1.37 thousand downloads last month - 234 stars on GitHub - 1 maintainer
dyff-schema 0.5.3
Data models for the Dyff AI auditing platform.22 versions - Latest release: 4 days ago - 4 dependent packages - 1.27 thousand downloads last month - 0 stars on GitLab.com - 5 maintainers
Top 4.6% on pypi.org
7 versions - Latest release: 3 months ago - 2 dependent packages - 10 dependent repositories - 1.21 thousand downloads last month - 1,680 stars on GitHub - 1 maintainer
avalanche-lib 0.5.0 💰
Avalanche: a Comprehensive Framework for Continual Learning Research7 versions - Latest release: 3 months ago - 2 dependent packages - 10 dependent repositories - 1.21 thousand downloads last month - 1,680 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
enoslib 9.2.0
226 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 1.18 thousand downloads last month - 3 maintainersathina 1.2.17
Python SDK to configure and run evaluations for your LLM-based application48 versions - Latest release: about 17 hours ago - 1.16 thousand downloads last month - 135 stars on GitHub - 1 maintainer
autorag 0.1.11
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.26 versions - Latest release: 1 day ago - 1.15 thousand downloads last month - 515 stars on GitHub - 1 maintainer
fiddler-auditor 0.0.5
Auditing large language models made easy.12 versions - Latest release: 6 months ago - 1 dependent repositories - 980 downloads last month - 138 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
2 versions - Latest release: about 4 years ago - 10 dependent repositories - 955 downloads last month - 185 stars on GitHub - 1 maintainer
moverscore 1.0.3
MoverScore: Evaluating text generation with contextualized embeddings and earth mover distance2 versions - Latest release: about 4 years ago - 10 dependent repositories - 955 downloads last month - 185 stars on GitHub - 1 maintainer
chainforge 0.2.6
A Visual Programming Environment for Prompt Engineering87 versions - Latest release: 9 months ago - 954 downloads last month - 2,004 stars on GitHub - 1 maintainer
Top 8.5% on pypi.org
12 versions - Latest release: 12 months ago - 7 dependent repositories - 885 downloads last month - 49 stars on GitHub - 3 maintainers
hydrotools.nwis-client 3.3.1
A convenient interface to the USGS NWIS Instantaneous Values (IV) REST Service API.12 versions - Latest release: 12 months ago - 7 dependent repositories - 885 downloads last month - 49 stars on GitHub - 3 maintainers
trajectopy-core 3.1.0
Trajectory Evaluation in Python46 versions - Latest release: 7 days ago - 2 dependent packages - 883 downloads last month - 1 stars on GitHub - 1 maintainer
factscorelite 1.3.0
FactScore (Fine-grained atomic evaluation of factual precision in long form text generation) comp...10 versions - Latest release: 23 days ago - 882 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
5 versions - Latest release: almost 4 years ago - 2 dependent packages - 14 dependent repositories - 871 downloads last month - 62 stars on GitHub - 2 maintainers
smatch 1.0.4
Smatch (semantic match) tool5 versions - Latest release: almost 4 years ago - 2 dependent packages - 14 dependent repositories - 871 downloads last month - 62 stars on GitHub - 2 maintainers
Top 10.0% on pypi.org
10 versions - Latest release: almost 2 years ago - 4 dependent repositories - 847 downloads last month - 49 stars on GitHub - 3 maintainers
hydrotools.metrics 1.3.3
Variety of standard model evaluation metrics.10 versions - Latest release: almost 2 years ago - 4 dependent repositories - 847 downloads last month - 49 stars on GitHub - 3 maintainers
trajectopy 2.0.14
Trajectory Evaluation in Python43 versions - Latest release: 7 days ago - 838 downloads last month - 21 stars on GitHub - 1 maintainer
jurity 2.0.1
fairness and evaluation library12 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 829 downloads last month - 35 stars on GitHub - 5 maintainers
picai-eval 1.4.6
PICAI Evaluation5 versions - Latest release: 9 days ago - 1 dependent package - 829 downloads last month - 15 stars on GitHub - 1 maintainer
dyff-client 0.5.0
Python client for the Dyff AI auditing platform.12 versions - Latest release: 4 days ago - 2 dependent packages - 769 downloads last month - 0 stars on GitLab.com - 5 maintainers
Top 6.3% on pypi.org
10 versions - Latest release: about 2 years ago - 9 dependent repositories - 745 downloads last month - 57 stars on GitHub - 2 maintainers
pymia 0.3.2
A Python package for data handling and evaluation in deep learning-based medical image analysis.10 versions - Latest release: about 2 years ago - 9 dependent repositories - 745 downloads last month - 57 stars on GitHub - 2 maintainers
pycyclops 0.2.7
Framework for healthcare ML implementation49 versions - Latest release: 9 days ago - 1 dependent package - 742 downloads last month - 62 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
6 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 734 downloads last month - 12 stars on GitHub - 1 maintainer
synthesized-insight 0.7
Synthesized data insights and evaluation framework.6 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 734 downloads last month - 12 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
20 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 666 downloads last month - 76 stars on GitHub - 1 maintainer
django-access 0.1.2b2
Django-Access - the application introducing dynamic evaluation-based instance-level (row-level) a...20 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 666 downloads last month - 76 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
22 versions - Latest release: 11 months ago - 1 dependent package - 2 dependent repositories - 604 downloads last month - 178 stars on GitHub - 1 maintainer
jury 2.2.4
Evaluation toolkit for neural language generation.22 versions - Latest release: 11 months ago - 1 dependent package - 2 dependent repositories - 604 downloads last month - 178 stars on GitHub - 1 maintainer
zenoml 0.6.4
Interactive Evaluation Framework for Machine Learning51 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 577 downloads last month - 208 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
1 version - Latest release: over 5 years ago - 12 dependent repositories - 567 downloads last month - 46 stars on GitHub - 1 maintainer
bcubed 1.5
Simple extended BCubed implementation in Python for clustering evaluation1 version - Latest release: over 5 years ago - 12 dependent repositories - 567 downloads last month - 46 stars on GitHub - 1 maintainer
llmuses 0.3.0
Eval-Scope: Lightweight LLMs Evaluation Framework8 versions - Latest release: about 1 month ago - 1 dependent package - 561 downloads last month - 63 stars on GitHub - 1 maintainer
pycond 2020.10.10
Lightweight Condition Parsing and Building of Evaluation Expressions31 versions - Latest release: over 3 years ago - 2 dependent packages - 3 dependent repositories - 556 downloads last month - 23 stars on GitHub - 1 maintainer
umbrela 0.0.7
A Package for generating query-passage relevance assessment labels.7 versions - Latest release: 13 days ago - 508 downloads last month - 0 stars on GitHub - 1 maintainer
redlite 0.2.0
LLM testing on steroids58 versions - Latest release: 5 days ago - 488 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
17 versions - Latest release: 2 months ago - 1 dependent package - 5 dependent repositories - 481 downloads last month - 81 stars on GitHub - 1 maintainer
verif 1.3.0
A verification program for meteorological forecasts and observations17 versions - Latest release: 2 months ago - 1 dependent package - 5 dependent repositories - 481 downloads last month - 81 stars on GitHub - 1 maintainer
rag-eval 0.1.3
A RAG evaluation framework4 versions - Latest release: about 2 months ago - 471 downloads last month - 1 maintainer
nereval 0.2.5
Evaluation script for named entity recognition systems based on F1 score.3 versions - Latest release: almost 6 years ago - 1 dependent repositories - 469 downloads last month - 66 stars on GitHub - 1 maintainer
pyieoe 0.1.1
pyIEOE: a Python package to facilitate interpretable OPE evaluation2 versions - Latest release: over 2 years ago - 1 dependent package - 4 dependent repositories - 444 downloads last month - 29 stars on GitHub - 1 maintainer
python-grid5000 1.2.4
A python wrapper for the GitLab API.45 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 416 downloads last month - 2,162 stars on GitHub - 2 maintainers
phasellm 0.0.21
Wrappers for common large language models (LLMs) with support for evaluation.21 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 414 downloads last month - 1 maintainer
dyff 0.18.0
Meta-package to install the local SDK for the Dyff AI auditing platform.20 versions - Latest release: 4 days ago - 398 downloads last month - 5 maintainers
Top 9.4% on pypi.org
5 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 375 downloads last month - 59 stars on GitHub - 1 maintainer
mcdm 1.4
Python implementation of multiple-criteria decision-making algorithms5 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 375 downloads last month - 59 stars on GitHub - 1 maintainer
sed-scores-eval 0.0.3
(Threshold-Independent) Evaluation of Sound Event Detection Scores4 versions - Latest release: 24 days ago - 368 downloads last month - 22 stars on GitHub - 1 maintainer
rankereval 0.2.0
A fast implementation of ranking metrics for information retrieval and recommendation.3 versions - Latest release: over 2 years ago - 1 dependent repositories - 358 downloads last month - 28 stars on GitHub - 1 maintainer
dyff-audit 0.3.1
Audit tools for the Dyff AI auditing platform.12 versions - Latest release: 21 days ago - 1 dependent package - 355 downloads last month - 0 stars on GitLab.com - 5 maintainers
promptmodel 0.1.19
Prompt & model versioning on the cloud, built for developers.50 versions - Latest release: 5 days ago - 347 downloads last month - 11 stars on GitHub - 2 maintainers
psds-eval 0.5.3
A module to calculate Polyphonic Sound Detection Score8 versions - Latest release: over 1 year ago - 1 dependent repositories - 330 downloads last month - 2 stars on GitHub - 1 maintainer
panoptica 0.6.5
Panoptic Quality (PQ) computation for binary masks.53 versions - Latest release: 27 days ago - 1 dependent repositories - 324 downloads last month - 12 stars on GitHub - 1 maintainer
xturing 0.1.8
Fine-tuning, evaluation and data generation for LLMs19 versions - Latest release: 8 months ago - 321 downloads last month - 1 maintainer
Top 6.2% on pypi.org
5 versions - Latest release: over 2 years ago - 24 dependent repositories - 309 downloads last month - 1,053 stars on GitHub - 1 maintainer
xai 0.1.0
XAI - An industry-ready machine learning library that ensures explainable AI by design5 versions - Latest release: over 2 years ago - 24 dependent repositories - 309 downloads last month - 1,053 stars on GitHub - 1 maintainer
quotientai 0.0.4
CLI for evaluating large language models with Quotient4 versions - Latest release: 14 days ago - 300 downloads last month - 1 maintainer
sila2lib 0.2.5
sila2lib - a SiLA 2 python3 library4 versions - Latest release: over 3 years ago - 2 dependent repositories - 298 downloads last month - 9 stars on GitLab.com - 2 maintainers
vision-evaluation 0.2.14
Evaluation metric codes for various vision tasks.25 versions - Latest release: about 1 year ago - 2 dependent repositories - 288 downloads last month - 34 stars on GitHub - 2 maintainers
Top 7.2% on pypi.org
6 versions - Latest release: about 4 years ago - 2 dependent packages - 5 dependent repositories - 288 downloads last month - 115 stars on GitHub - 1 maintainer
neleval 3.1.1
Command-line evaluation tools for named entity linking and (cross-document) coreference resolution6 versions - Latest release: about 4 years ago - 2 dependent packages - 5 dependent repositories - 288 downloads last month - 115 stars on GitHub - 1 maintainer
charcut 1.1.1
Character-based MT evaluation and difference highlighting2 versions - Latest release: over 1 year ago - 2 dependent packages - 4 dependent repositories - 285 downloads last month - 1 stars on GitHub - 1 maintainer
ntqr 0.3.2
Tools for the logic of evaluation using unlabeled data5 versions - Latest release: 27 days ago - 254 downloads last month - 34 stars on GitHub - 1 maintainer
spiraleval 0.1.2 removed
Evaluation for characteristics3 versions - Latest release: 7 months ago - 249 downloads last month - 1 maintainer
opencompass 0.2.4
A comprehensive toolkit for large model evaluation10 versions - Latest release: 22 days ago - 248 downloads last month - 2,659 stars on GitHub - 1 maintainer
tidecv-light 1.0.1
A General Toolbox for Identifying ObjectDetection Errors1 version - Latest release: over 1 year ago - 236 downloads last month - 687 stars on GitHub - 1 maintainer
rank-eval 0.1.3
rank_eval: A Blazing Fast Python Library for Ranking Evaluation and Comparison5 versions - Latest release: over 2 years ago - 1 dependent repositories - 236 downloads last month - 352 stars on GitHub - 1 maintainer
Related Keywords
python
63
machine-learning
45
metrics
37
nlp
24
evaluation-metrics
22
ai
20
deep-learning
19
llm
19
bob
17
biometric recognition
16
data-science
15
benchmark
14
evaluation-framework
13
validation
13
large-language-models
12
pytorch
12
forecasting
12
simulation
12
ml
11
modeling
11
machine learning
11
verification
11
pandas
11
llmops
10
observations
10
noaa
10
hydrology
10
natural language processing
10
learning
10
NLP
9
testing
9
ranking
9
data
9
prompt-engineering
9
precision
8
classification
8
machine
8
monitoring
7
information retrieval
7
comparison
7
recall
7
expression
7
LLM
6
object-detection
6
langchain
6
detection
6
ML
6
computational linguistics
6
clustering
6
evaluate
6
framework
6
language
6
research
5
model
5
trec_eval
5
optimization
5
configuration
5
audit
5
information-retrieval
5
segmentation
5
metric
5
training
5
rag
5
natural-language-processing
5
python3
5
deep learning
5
data-analysis
5
django
5
statistics
5
hacktoberfest
5
llms
5
robots
5
lab automation
5
visualisation
5
openai
5
experiments
5
library
4
recommender-system
4
artificial-intelligence
4
computer-vision
4
lazy
4
rights-management
4
scikit-learn
4
assessment
4
dataset
4
llm-eval
4
access
4
analysis
4
rights
4
mathematics
4
runtime
4
permissions
4
evaluation-based
4
dynamic
4
guard
4
instance-level
4
authority
4
django-access
4
row-level
4
box
4