Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "evaluation" keyword

Top 1.2% on pypi.org
evaluate 0.4.2
HuggingFace community-driven open-source library of evaluation
15 versions - Latest release: 15 days ago - 222 dependent packages - 2,474 dependent repositories - 2.58 million downloads last month - 1,762 stars on GitHub - 3 maintainers
Top 1.3% on pypi.org
sacrebleu 2.4.2
Hassle-free computation of shareable, comparable, and reproducible BLEU, chrF, and TER scores
69 versions - Latest release: about 1 month ago - 96 dependent packages - 4,263 dependent repositories - 1.88 million downloads last month - 972 stars on GitHub - 3 maintainers
Top 3.1% on pypi.org
langsmith 0.1.57
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform.
166 versions - Latest release: 4 days ago - 86 dependent packages - 2,234 dependent repositories - 7.97 million downloads last month - 217 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
simpleeval 0.9.13 💰
A simple, safe single expression evaluator library.
18 versions - Latest release: about 1 year ago - 59 dependent packages - 290 dependent repositories - 1.22 million downloads last month - 424 stars on GitHub - 1 maintainer
Top 1.1% on pypi.org
configspace 0.7.2
Creation and manipulation of parameter configuration spaces for automated algorithm configuration...
43 versions - Latest release: 10 months ago - 31 dependent packages - 56 dependent repositories - 108 thousand downloads last month - 186 stars on GitHub - 2 maintainers
Top 4.9% on pypi.org
torcheval 0.0.7
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...
7 versions - Latest release: 9 months ago - 25 dependent packages - 8 dependent repositories - 70 thousand downloads last month - 194 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
langfuse 2.31.0
A client library for accessing langfuse
321 versions - Latest release: about 8 hours ago - 17 dependent packages - 1 dependent repositories - 225 thousand downloads last month - 2,823 stars on GitHub - 1 maintainer
Top 1.8% on pypi.org
motmetrics 1.4.0
Metrics for multiple object tracker benchmarking.
9 versions - Latest release: over 1 year ago - 13 dependent packages - 398 dependent repositories - 127 thousand downloads last month - 1,326 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
bob 12.0.0
Bob is a free signal-processing and machine learning toolbox originally developed by the Biometri...
49 versions - Latest release: 11 months ago - 12 dependent packages - 38 dependent repositories - 1.61 thousand downloads last month - 8 maintainers
Top 2.2% on pypi.org
torch-fidelity 0.3.0
High-fidelity performance metrics for generative models in PyTorch
3 versions - Latest release: almost 3 years ago - 9 dependent packages - 773 dependent repositories - 282 thousand downloads last month - 870 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
pytrec-eval 0.5
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
3 versions - Latest release: over 3 years ago - 9 dependent packages - 36 dependent repositories - 104 thousand downloads last month - 249 stars on GitHub - 1 maintainer
Top 7.3% on pypi.org
bob.bio.base 8.0.0
Tools for running biometric recognition experiments
32 versions - Latest release: 11 months ago - 7 dependent packages - 5 dependent repositories - 220 downloads last month - 10 maintainers
Top 3.5% on pypi.org
rliable 1.0.8
rliable: Reliable evaluation on reinforcement learning and machine learning benchmarks.
9 versions - Latest release: almost 2 years ago - 6 dependent packages - 15 dependent repositories - 5.77 thousand downloads last month - 689 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
ranx 0.3.19
ranx: A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion
45 versions - Latest release: 6 months ago - 4 dependent packages - 7 dependent repositories - 13.2 thousand downloads last month - 348 stars on GitHub - 1 maintainer
dyff-schema 0.5.3
Data models for the Dyff AI auditing platform.
22 versions - Latest release: 4 days ago - 4 dependent packages - 1.27 thousand downloads last month - 0 stars on GitLab.com - 5 maintainers
cer 1.2.0
Translation Edit Rate on the character level
4 versions - Latest release: over 1 year ago - 4 dependent packages - 4 dependent repositories - 2.25 thousand downloads last month - 1 stars on GitHub - 1 maintainer
Top 2.3% on pypi.org
pycm 0.9.5 💰
Multi-class confusion matrix library in Python
44 versions - Latest release: almost 6 years ago - 4 dependent packages - 50 dependent repositories - 45.4 thousand downloads last month - 1,430 stars on GitHub - 3 maintainers
hydrotools.-restclient 3.1.0
General REST api client with built in request caching and retries.
8 versions - Latest release: 7 months ago - 3 dependent packages - 49 stars on GitHub - 3 maintainers
Top 8.6% on pypi.org
evalidate 2.0.2
Validation and secure evaluation of untrusted python expressions
25 versions - Latest release: 10 months ago - 3 dependent packages - 6 dependent repositories - 16.4 thousand downloads last month - 19 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
table-evaluator 1.6.1
A package to evaluate how close a synthetic data set is to real data.
31 versions - Latest release: 9 months ago - 3 dependent packages - 5 dependent repositories - 1.79 thousand downloads last month - 74 stars on GitHub - 1 maintainer
bob.bio.face 8.0.0
Tools for running face recognition experiments
24 versions - Latest release: 11 months ago - 3 dependent packages - 2 dependent repositories - 56 downloads last month - 10 maintainers
Top 3.6% on pypi.org
coconut 3.1.0 💰
Simple, elegant, Pythonic functional programming.
41 versions - Latest release: 2 months ago - 3 dependent packages - 22 dependent repositories - 2.81 thousand downloads last month - 3,951 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
tidecv 1.0.1
A General Toolbox for Identifying ObjectDetection Errors
2 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 2.67 thousand downloads last month - 687 stars on GitHub - 1 maintainer
synthesized-datasets 1.7
Publically available datasets for benchmarking and evaluation.
15 versions - Latest release: 2 months ago - 3 dependent packages - 7.72 thousand downloads last month - 1 stars on GitHub - 1 maintainer
audmetric 1.2.1
Evaluate machine-learning models
11 versions - Latest release: 3 months ago - 2 dependent packages - 2 dependent repositories - 1.56 thousand downloads last month - 1 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
avalanche-lib 0.5.0 💰
Avalanche: a Comprehensive Framework for Continual Learning Research
7 versions - Latest release: 3 months ago - 2 dependent packages - 10 dependent repositories - 1.21 thousand downloads last month - 1,680 stars on GitHub - 1 maintainer
pycond 2020.10.10
Lightweight Condition Parsing and Building of Evaluation Expressions
31 versions - Latest release: over 3 years ago - 2 dependent packages - 3 dependent repositories - 556 downloads last month - 23 stars on GitHub - 1 maintainer
tmtoolkit 0.12.0
Text Mining and Topic Modeling Toolkit
35 versions - Latest release: about 1 year ago - 2 dependent packages - 10 dependent repositories - 2.96 thousand downloads last month - 12 stars on GitHub - 2 maintainers
alpaca-eval 0.6.2
AlpacaEval : An Automatic Evaluator of Instruction-following Models
33 versions - Latest release: 26 days ago - 2 dependent packages - 7.19 thousand downloads last month - 1,062 stars on GitHub - 3 maintainers
Top 8.2% on pypi.org
torcheval-nightly 2024.5.13
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...
479 versions - Latest release: 2 days ago - 2 dependent packages - 1 dependent repositories - 9.82 thousand downloads last month - 156 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
smatch 1.0.4
Smatch (semantic match) tool
5 versions - Latest release: almost 4 years ago - 2 dependent packages - 14 dependent repositories - 871 downloads last month - 62 stars on GitHub - 2 maintainers
Top 9.4% on pypi.org
mcdm 1.4
Python implementation of multiple-criteria decision-making algorithms
5 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 375 downloads last month - 59 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
uptrain 0.7.0
UpTrain - tool to evaluate LLM applications on aspects like factual accuracy, response quality, r...
48 versions - Latest release: 12 days ago - 2 dependent packages - 1 dependent repositories - 5.39 thousand downloads last month - 2,015 stars on GitHub - 2 maintainers
dyff-client 0.5.0
Python client for the Dyff AI auditing platform.
12 versions - Latest release: 4 days ago - 2 dependent packages - 769 downloads last month - 0 stars on GitLab.com - 5 maintainers
charcut 1.1.1
Character-based MT evaluation and difference highlighting
2 versions - Latest release: over 1 year ago - 2 dependent packages - 4 dependent repositories - 285 downloads last month - 1 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
prdc 0.2
Compute precision, recall, density, and coverage metrics for two sets of vectors.
1 version - Latest release: about 4 years ago - 2 dependent packages - 17 dependent repositories - 1.37 thousand downloads last month - 234 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
neleval 3.1.1
Command-line evaluation tools for named entity linking and (cross-document) coreference resolution
6 versions - Latest release: about 4 years ago - 2 dependent packages - 5 dependent repositories - 288 downloads last month - 115 stars on GitHub - 1 maintainer
trajectopy-core 3.1.0
Trajectory Evaluation in Python
46 versions - Latest release: 7 days ago - 2 dependent packages - 883 downloads last month - 1 stars on GitHub - 1 maintainer
oasis 0.1.3
Optimal Asymptotic Sequential Importance Sampling
2 versions - Latest release: almost 3 years ago - 1 dependent package - 3 dependent repositories - 66 downloads last month - 14 stars on GitHub - 1 maintainer
sacrebleu-macrof 2.0.1
Hassle-free computation of shareable, comparable, and reproducible BLEU, chrF, and TER scores
1 version - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 29 downloads last month - 907 stars on GitHub - 1 maintainer
dyff-audit 0.3.1
Audit tools for the Dyff AI auditing platform.
12 versions - Latest release: 21 days ago - 1 dependent package - 355 downloads last month - 0 stars on GitLab.com - 5 maintainers
phasellm 0.0.21
Wrappers for common large language models (LLMs) with support for evaluation.
21 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 414 downloads last month - 1 maintainer
picai-eval 1.4.6
PICAI Evaluation
5 versions - Latest release: 9 days ago - 1 dependent package - 829 downloads last month - 15 stars on GitHub - 1 maintainer
alexandra-ai-eval 0.1.0
Evaluation of finetuned models.
1 version - Latest release: about 1 year ago - 1 dependent package - 28 downloads last month - 8 stars on GitHub - 2 maintainers
pyieoe 0.1.1
pyIEOE: a Python package to facilitate interpretable OPE evaluation
2 versions - Latest release: over 2 years ago - 1 dependent package - 4 dependent repositories - 444 downloads last month - 29 stars on GitHub - 1 maintainer
er-evaluation 2.3.0 💰
An End-to-End Evaluation Framework for Entity Resolution Systems.
9 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 103 downloads last month - 9 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
django-access 0.1.2b2
Django-Access - the application introducing dynamic evaluation-based instance-level (row-level) a...
20 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 666 downloads last month - 76 stars on GitHub - 1 maintainer
python-grid5000 1.2.4
A python wrapper for the GitLab API.
45 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 416 downloads last month - 2,162 stars on GitHub - 2 maintainers
Top 5.2% on pypi.org
pytrec-eval-terrier 0.5.6
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
6 versions - Latest release: 7 months ago - 1 dependent package - 11 dependent repositories - 31.6 thousand downloads last month - 248 stars on GitHub - 1 maintainer
bob.bio.vein 5.0.0
Vein Recognition Library
16 versions - Latest release: 11 months ago - 1 dependent package - 76 downloads last month - 1 maintainer
myaml 1.0.1
M(ath)YAML: evaluate math expressions in YAML files.
4 versions - Latest release: over 3 years ago - 1 dependent package - 3 dependent repositories - 43 downloads last month - 2 stars on GitHub - 1 maintainer
omega-index-py3 0.3.2
Omega Index for evaluation of overlapping community structure
3 versions - Latest release: almost 3 years ago - 1 dependent package - 11 dependent repositories - 32 downloads last month - 1 maintainer
pycyclops 0.2.7
Framework for healthcare ML implementation
49 versions - Latest release: 9 days ago - 1 dependent package - 742 downloads last month - 62 stars on GitHub - 1 maintainer
llmuses 0.3.0
Eval-Scope: Lightweight LLMs Evaluation Framework
8 versions - Latest release: about 1 month ago - 1 dependent package - 561 downloads last month - 63 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
synthesized-insight 0.7
Synthesized data insights and evaluation framework.
6 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 734 downloads last month - 12 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
nf1 0.0.4
NF1: Normalized F1 score for community evaluation against ground truth
2 versions - Latest release: almost 3 years ago - 1 dependent package - 14 dependent repositories - 71.8 thousand downloads last month - 21 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
jury 2.2.4
Evaluation toolkit for neural language generation.
22 versions - Latest release: 11 months ago - 1 dependent package - 2 dependent repositories - 604 downloads last month - 178 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
enoslib 9.2.0
226 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 1.18 thousand downloads last month - 3 maintainers
jurity 2.0.1
fairness and evaluation library
12 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 829 downloads last month - 35 stars on GitHub - 5 maintainers
zenoml 0.6.4
Interactive Evaluation Framework for Machine Learning
51 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 577 downloads last month - 208 stars on GitHub - 1 maintainer
replay-rec 0.16.0
RecSys Library
17 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 4.05 thousand downloads last month - 125 stars on GitHub - 1 maintainer
bob.bio.spear 5.0.0
23 versions - Latest release: 11 months ago - 1 dependent package - 3 dependent repositories - 111 downloads last month - 10 maintainers
reseval 0.1.6
Reproducible Subjective Evaluation
9 versions - Latest release: 2 months ago - 1 dependent package - 1 dependent repositories - 78 downloads last month - 53 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
verif 1.3.0
A verification program for meteorological forecasts and observations
17 versions - Latest release: 2 months ago - 1 dependent package - 5 dependent repositories - 481 downloads last month - 81 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
rexmex 0.1.3
A General Purpose Recommender Metrics Library for Fair Evaluation.
19 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 1.85 thousand downloads last month - 275 stars on GitHub - 5 maintainers
Top 8.0% on pypi.org
pyprg 0.1
Creates the Precision-Recall-Gain curve and calculates the area under the curve
8 versions - Latest release: 8 months ago - 7 dependent repositories - 1.46 thousand downloads last month - 36 stars on GitHub - 1 maintainer
rke-score 0.0.7
Compute Renyi Kernel Entropy scores (RKE-MC and RRKE) for two sets of vectors.
5 versions - Latest release: 6 months ago - 34 downloads last month - 9 stars on GitHub - 1 maintainer
flare22-dsc-nsd-test 0.0.2
FLARE22_DSC_NSD_Evaluation
2 versions - Latest release: almost 2 years ago - 21 downloads last month - 52 stars on GitHub - 1 maintainer
take-ai-evaluation 0.2.3
Metrics and visualizations for evaluating chatbot's AI utilization.
5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 37 downloads last month - 1 maintainer
inspire 1.0.9
Helper library to participate in the INSPIRE challenge
12 versions - Latest release: about 9 years ago - 4 dependent repositories - 36 downloads last month - 2 stars on GitHub - 2 maintainers
disaggregators 0.1.2
HuggingFace community-driven open-source library for dataset disaggregation
3 versions - Latest release: over 1 year ago - 120 downloads last month - 66 stars on GitHub - 1 maintainer
spreadscript 0.0.3
spreadscript: Use a spreadsheet as a function.
3 versions - Latest release: about 6 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
coreference-eval 0.0.2
Common metrics and evaluation tools for coreference chains (jsonline format)
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 85 downloads last month - 4 stars on GitHub - 1 maintainer
compare-qrels 0.0.3
Qualitatively compare the qrels results of two IR systems.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 6 downloads last month - 0 stars on GitHub - 1 maintainer
deepvision-toolkit 0.1.6
PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/impleme...
5 versions - Latest release: about 1 year ago - 27 downloads last month - 31 stars on GitHub - 1 maintainer
autorag 0.1.11
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.
26 versions - Latest release: 1 day ago - 1.15 thousand downloads last month - 515 stars on GitHub - 1 maintainer
tsml-eval 0.3.0
A package for benchmarking time series machine learning tools.
7 versions - Latest release: 16 days ago - 179 downloads last month - 20 stars on GitHub - 1 maintainer
kolena-client 1.18.0
Client for Kolena's machine learning testing platform.
70 versions - Latest release: 1 day ago - 1.91 thousand downloads last month - 38 stars on GitHub - 1 maintainer
boolexp 0.1
Safe boolean expression evaluator
1 version - Latest release: over 10 years ago - 5 dependent repositories - 27 downloads last month - 1 maintainer
channelpack 0.7.0
Callable container of Numpy arrays with support for masking and slicing
4 versions - Latest release: about 3 years ago - 2 dependent repositories - 78 downloads last month - 0 stars on GitHub - 1 maintainer
evalmate 0.3.0
Evalmate is a set of tools for evaluating audio related machine learning tasks.
3 versions - Latest release: over 5 years ago - 1 dependent repositories - 35 downloads last month - 3 stars on GitHub - 1 maintainer
tno.sdg.tabular.eval.utility-metrics 0.3.0
Utility metrics for tabular data
1 version - Latest release: 3 months ago - 8 downloads last month - 2 stars on GitHub - 1 maintainer
tsmetrics 0.1.0
Evaluation metrics for time series analysis
1 version - Latest release: over 6 years ago - 1 dependent repositories - 3 downloads last month - 4 stars on GitHub - 1 maintainer
bob.ip.tensorflow-extractor 0.0.7
Feature extractor using tensorflow CNNs
7 versions - Latest release: over 3 years ago - 11 downloads last month - 1 maintainer
langcheck 0.7.1
Simple, Pythonic building blocks to evaluate LLM-based applications
12 versions - Latest release: 7 days ago - 2.57 thousand downloads last month - 137 stars on GitHub - 3 maintainers
Top 8.9% on pypi.org
codebleu 0.6.1
Unofficial CodeBLEU implementation that supports Linux, MacOS and Windows available on PyPI.
13 versions - Latest release: about 12 hours ago - 3 dependent repositories - 1.85 thousand downloads last month - 31 stars on GitHub - 1 maintainer
ragstack-ai-langsmith 0.0.1a1 removed
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform.
1 version - Latest release: 6 months ago - 1 maintainer
json-criteria 0.2.0
Python library designed for evaluating data against serializable JSON criteria
6 versions - Latest release: 17 days ago - 25 downloads last month - 1 stars on GitHub - 1 maintainer
panoptica 0.6.5
Panoptic Quality (PQ) computation for binary masks.
53 versions - Latest release: 27 days ago - 1 dependent repositories - 324 downloads last month - 12 stars on GitHub - 1 maintainer
v-stream 0.1.2
STREAM: Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models
8 versions - Latest release: 4 months ago - 47 downloads last month - 9 stars on GitHub - 1 maintainer
evaluation-framework 1.3
Evaluation Framework for testing and comparing graph embedding techniques
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 10 stars on GitHub - 1 maintainer
abed 0.1.3
A command line tool for easily managing benchmark experiments
7 versions - Latest release: about 2 years ago - 72 downloads last month - 4 stars on GitHub - 1 maintainer
bob.ip.caffe-extractor 2.0.3
Feature extractor using caffe CNNs
6 versions - Latest release: over 4 years ago - 28 downloads last month - 6 maintainers
calc4ap 1.0.1
Easy AP Calculator with Python
2 versions - Latest release: almost 3 years ago - 32 downloads last month - 0 stars on GitHub - 1 maintainer
costra 1.0
Tool for automatic evaluation of Czech sentence embeddings using Costra 1.1 dataset
1 version - Latest release: over 3 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
gradgpad 2.1.0
gradgpad
12 versions - Latest release: over 1 year ago - 1 dependent repositories - 85 downloads last month - 13 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
moverscore 1.0.3
MoverScore: Evaluating text generation with contextualized embeddings and earth mover distance
2 versions - Latest release: about 4 years ago - 10 dependent repositories - 955 downloads last month - 185 stars on GitHub - 1 maintainer
ddplt 0.0.2
A package with code from my ML projects that has a potential of being reusable
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 18 downloads last month - 0 stars on GitHub - 1 maintainer
evalne 0.4.0
Open Source Network Embedding Evaluation toolkit
4 versions - Latest release: almost 2 years ago - 1 dependent repositories - 11 downloads last month - 102 stars on GitHub - 1 maintainer
bt4vt 1.0.1
Bias Tests for Voice Technologies
2 versions - Latest release: over 1 year ago - 25 downloads last month - 11 stars on GitHub - 2 maintainers