Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "evaluation" keyword

b3score 0.2
BCUBED extrinsic clustering metric
2 versions - Latest release: almost 8 years ago - 1 dependent repositories - 20 downloads last month - 4 stars on GitHub - 2 maintainers
Top 9.7% on pypi.org
uptrain 0.6.13
UpTrain - tool to evaluate LLM applications on aspects like factual accuracy, response quality, r...
47 versions - Latest release: 17 days ago - 1 dependent repositories - 5.9 thousand downloads last month - 1,994 stars on GitHub - 2 maintainers
llama-index-callbacks-uptrain 0.1.2
llama-index callbacks uptrain integration
2 versions - Latest release: about 2 months ago - 74 downloads last month - 1,994 stars on GitHub - 2 maintainers
promptbench 0.0.2
PromptBench is a powerful tool designed to scrutinize and analyze the interaction of large langua...
6 versions - Latest release: 5 months ago - 225 downloads last month - 2,058 stars on GitHub - 2 maintainers
sci-annot-eval 0.0.9
The evaluation component of the sci-annot framework
5 versions - Latest release: 9 months ago - 1 dependent repositories - 32 downloads last month - 0 stars on GitHub - 2 maintainers
Top 5.2% on pypi.org
pytrec-eval-terrier 0.5.6
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
6 versions - Latest release: 7 months ago - 11 dependent repositories - 24.1 thousand downloads last month - 244 stars on GitHub - 2 maintainers
Top 3.3% on pypi.org
pytrec-eval 0.5
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
3 versions - Latest release: over 3 years ago - 7 dependent packages - 36 dependent repositories - 171 thousand downloads last month - 244 stars on GitHub - 2 maintainers
pytrec-eval-git 0.5
Provides Python bindings for popular Information Retrieval measures implemented within trec_eval.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 14 downloads last month - 235 stars on GitHub - 2 maintainers
boolexp 0.1
Safe boolean expression evaluator
1 version - Latest release: over 10 years ago - 5 dependent repositories - 27 downloads last month - 2 maintainers
mlrl-testbed 0.9.0
Provides utilities for the training and evaluation of multi-label rule learning algorithms
6 versions - Latest release: 10 months ago - 1 dependent repositories - 26 downloads last month - 19 stars on GitHub - 2 maintainers
evaldet 0.4.0
Evaluation for Detection and Tracking
13 versions - Latest release: 11 months ago - 1 dependent repositories - 103 downloads last month - 2 stars on GitHub - 2 maintainers
Top 2.3% on pypi.org
pycm 0.9.5 💰
Multi-class confusion matrix library in Python
44 versions - Latest release: almost 6 years ago - 4 dependent packages - 50 dependent repositories - 49.9 thousand downloads last month - 1,423 stars on GitHub - 3 maintainers
bob.thesis.tiago 1.0.1
Tools to reproduce the experiments of the Ph.D. thesis from Tiago de Freitas Pereira
2 versions - Latest release: over 5 years ago - 10 downloads last month - 2 maintainers
picai-eval 1.4.5
PICAI Evaluation
4 versions - Latest release: 8 months ago - 1 dependent package - 590 downloads last month - 15 stars on GitHub - 2 maintainers
pycond 2020.10.10
Lightweight Condition Parsing and Building of Evaluation Expressions
31 versions - Latest release: over 3 years ago - 1 dependent package - 3 dependent repositories - 556 downloads last month - 23 stars on GitHub - 2 maintainers
evalpm 0.1.2
A framework for creating and evaluating immission models for Particulate Matter
3 versions - Latest release: 7 months ago - 9 downloads last month - 2 stars on GitHub - 2 maintainers
drift-anomaly-evaluator 0.0.1
An initial evaluation of drift anomaly detection models
1 version - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 1 maintainer
topic999 1.0.1.dev1
Topic Model User Evaluation
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 11 downloads last month - 2 maintainers
chainforge 0.2.6
A Visual Programming Environment for Prompt Engineering
86 versions - Latest release: 8 months ago - 850 downloads last month - 1,975 stars on GitHub - 2 maintainers
lara-django 0.2.6
LARA-django is a python django project of the Lab Automation Suite LARA - (lara.uni-greifswald.de...
1 version - Latest release: about 4 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitLab.com - 2 maintainers
Top 6.7% on pypi.org
agenta 0.13.8
The SDK for agenta is an open-source LLMOps platform.
99 versions - Latest release: about 18 hours ago - 2 dependent repositories - 2.71 thousand downloads last month - 575 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
evo 1.27.1
Python package for the evaluation of odometry and SLAM
98 versions - Latest release: 2 days ago - 18 dependent repositories - 86.5 thousand downloads last month - 3,023 stars on GitHub - 1 maintainer
pyntcireval 0.0.3
Python version of NTCIREVAL
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 18 downloads last month - 22 stars on GitHub - 2 maintainers
Top 8.2% on pypi.org
torcheval-nightly 2024.4.28
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...
465 versions - Latest release: about 23 hours ago - 2 dependent packages - 1 dependent repositories - 6.33 thousand downloads last month - 156 stars on GitHub - 1 maintainer
autorag 0.1.7
Automatically Evaluate RAG pipelines with your own data. Find optimal structure for new RAG product.
22 versions - Latest release: about 24 hours ago - 935 downloads last month - 515 stars on GitHub - 2 maintainers
Top 5.6% on pypi.org
prdc 0.2
Compute precision, recall, density, and coverage metrics for two sets of vectors.
1 version - Latest release: about 4 years ago - 1 dependent package - 17 dependent repositories - 1.37 thousand downloads last month - 234 stars on GitHub - 2 maintainers
Top 1.1% on pypi.org
configspace 0.7.2
Creation and manipulation of parameter configuration spaces for automated algorithm configuration...
43 versions - Latest release: 9 months ago - 22 dependent packages - 56 dependent repositories - 110 thousand downloads last month - 186 stars on GitHub - 2 maintainers
deepvision-toolkit 0.1.6
PyTorch and TensorFlow/Keras image models with automatic weight conversions and equal API/impleme...
5 versions - Latest release: about 1 year ago - 27 downloads last month - 31 stars on GitHub - 1 maintainer
sacrebleu-macrof 2.0.1
Hassle-free computation of shareable, comparable, and reproducible BLEU, chrF, and TER scores
1 version - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 29 downloads last month - 907 stars on GitHub - 1 maintainer
process-tracing 0.1.0a1
ptrace based process tracing utilities for python
11 versions - Latest release: 8 months ago - 1 dependent repositories - 191 downloads last month - 2 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
sacrebleu 2.4.2
Hassle-free computation of shareable, comparable, and reproducible BLEU, chrF, and TER scores
69 versions - Latest release: 17 days ago - 62 dependent packages - 4,263 dependent repositories - 1.84 million downloads last month - 967 stars on GitHub - 6 maintainers
sprint-toolkit 0.0.3
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval
3 versions - Latest release: 10 months ago - 15 downloads last month - 36 stars on GitHub - 2 maintainers
evalmate 0.3.0
Evalmate is a set of tools for evaluating audio related machine learning tasks.
3 versions - Latest release: over 5 years ago - 1 dependent repositories - 35 downloads last month - 3 stars on GitHub - 2 maintainers
json-criteria 0.2.0
Python library designed for evaluating data against serializable JSON criteria
6 versions - Latest release: 1 day ago - 25 downloads last month - 1 stars on GitHub - 1 maintainer
clusters-features 1.0.3
The Clusters-Features package allows data science users to compute high-level linear algebra oper...
2 versions - Latest release: over 2 years ago - 45 downloads last month - 28 stars on GitHub - 2 maintainers
icdar21-mapseg-eval 1.0.4
Evaluation tools for ICDAR21 Competition on Historical Map Segmentation (MapSeg).
5 versions - Latest release: almost 3 years ago - 1 dependent repositories - 33 downloads last month - 2 stars on GitHub - 2 maintainers
Top 1.8% on pypi.org
motmetrics 1.4.0
Metrics for multiple object tracker benchmarking.
9 versions - Latest release: over 1 year ago - 11 dependent packages - 398 dependent repositories - 92.4 thousand downloads last month - 1,323 stars on GitHub - 1 maintainer
fiddler-auditor 0.0.5
Auditing large language models made easy.
12 versions - Latest release: 6 months ago - 1 dependent repositories - 980 downloads last month - 138 stars on GitHub - 2 maintainers
Top 1.8% on pypi.org
simpleeval 0.9.13 💰
A simple, safe single expression evaluator library.
18 versions - Latest release: about 1 year ago - 43 dependent packages - 290 dependent repositories - 1.17 million downloads last month - 421 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
torcheval 0.0.7
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...
7 versions - Latest release: 8 months ago - 10 dependent packages - 8 dependent repositories - 61.7 thousand downloads last month - 186 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
xai 0.1.0
XAI - An industry-ready machine learning library that ensures explainable AI by design
5 versions - Latest release: over 2 years ago - 24 dependent repositories - 310 downloads last month - 1,032 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
torch-fidelity 0.3.0
High-fidelity performance metrics for generative models in PyTorch
3 versions - Latest release: almost 3 years ago - 6 dependent packages - 773 dependent repositories - 282 thousand downloads last month - 870 stars on GitHub - 2 maintainers
Top 7.2% on pypi.org
neleval 3.1.1
Command-line evaluation tools for named entity linking and (cross-document) coreference resolution
6 versions - Latest release: about 4 years ago - 2 dependent packages - 5 dependent repositories - 288 downloads last month - 115 stars on GitHub - 2 maintainers
antgo 0.1.24
machine learning experiment platform
46 versions - Latest release: 10 months ago - 1 dependent repositories - 155 downloads last month - 16 stars on GitHub - 2 maintainers
Top 3.1% on pypi.org
langsmith 0.1.51
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform.
160 versions - Latest release: 4 days ago - 46 dependent packages - 2,234 dependent repositories - 7.75 million downloads last month - 217 stars on GitHub - 2 maintainers
pyieoe 0.1.1
pyIEOE: a Python package to facilitate interpretable OPE evaluation
2 versions - Latest release: over 2 years ago - 4 dependent repositories - 444 downloads last month - 29 stars on GitHub - 2 maintainers
Top 9.9% on pypi.org
django-access 0.1.2b2
Django-Access - the application introducing dynamic evaluation-based instance-level (row-level) a...
20 versions - Latest release: 3 months ago - 4 dependent repositories - 666 downloads last month - 76 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
langfuse 2.27.2
A client library for accessing langfuse
309 versions - Latest release: 3 days ago - 4 dependent packages - 1 dependent repositories - 184 thousand downloads last month - 2,823 stars on GitHub - 2 maintainers
trustllm 0.2.4
TrustLLM
6 versions - Latest release: 3 months ago - 89 downloads last month - 274 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
smatch 1.0.4
Smatch (semantic match) tool
5 versions - Latest release: almost 4 years ago - 1 dependent package - 14 dependent repositories - 771 downloads last month - 62 stars on GitHub - 4 maintainers
Top 6.3% on pypi.org
pymia 0.3.2
A Python package for data handling and evaluation in deep learning-based medical image analysis.
10 versions - Latest release: about 2 years ago - 9 dependent repositories - 818 downloads last month - 57 stars on GitHub - 4 maintainers
Top 1.2% on pypi.org
evaluate 0.4.1
HuggingFace community-driven open-source library of evaluation
14 versions - Latest release: 7 months ago - 128 dependent packages - 2,474 dependent repositories - 2.57 million downloads last month - 1,762 stars on GitHub - 4 maintainers
vision-evaluation 0.2.14
Evaluation metric codes for various vision tasks.
25 versions - Latest release: about 1 year ago - 2 dependent repositories - 282 downloads last month - 34 stars on GitHub - 2 maintainers
langcheck 0.6.0
Simple, Pythonic building blocks to evaluate LLM-based applications
9 versions - Latest release: 21 days ago - 2.68 thousand downloads last month - 136 stars on GitHub - 6 maintainers
rankereval 0.2.0
A fast implementation of ranking metrics for information retrieval and recommendation.
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 358 downloads last month - 28 stars on GitHub - 1 maintainer
ragrank 0.1.0
An evaluation library for RAG models
6 versions - Latest release: 2 months ago - 60 downloads last month - 3 stars on GitHub - 2 maintainers
fdatasets 1.12.1 removed
HuggingFace/Datasets is an open library of NLP datasets.
1 version - Latest release: about 2 years ago - 14,671 stars on GitHub
naeval 0.2.0
Comparing quality and performance of NLP systems for Russian language
1 version - Latest release: about 4 years ago - 2 dependent repositories - 15 downloads last month - 44 stars on GitHub - 2 maintainers
python-adc-eval 0.2.0
ADC Evaluation Library
8 versions - Latest release: 5 days ago - 96 downloads last month - 1 stars on GitHub - 2 maintainers
Top 6.6% on pypi.org
fore 0.1.7
fore ai packages
9 versions - Latest release: 27 days ago - 4 dependent repositories - 79.5 thousand downloads last month - 10 stars on GitHub - 2 maintainers
identitychain 0.0.1
Code Model Trust Evaluation
1 version - Latest release: 7 months ago - 4 downloads last month - 5 stars on GitHub - 2 maintainers
Top 7.7% on pypi.org
insight 1.0
A python library for monitoring, comparing and extracting insights from data.
23 versions - Latest release: 27 days ago - 5 dependent repositories - 6.83 thousand downloads last month - 12 stars on GitHub - 2 maintainers
dyff-client 0.3.2
Python client for the Dyff AI auditing platform.
10 versions - Latest release: 5 days ago - 473 downloads last month - 0 stars on GitLab.com - 6 maintainers
dyff 0.16.0
Meta-package to install the local SDK for the Dyff AI auditing platform.
18 versions - Latest release: 5 days ago - 257 downloads last month - 6 maintainers
evaluators 1.0.3
Various scene understanding and perception evaluation metrics.
3 versions - Latest release: 12 months ago - 35 downloads last month - 28,614 stars on GitHub - 2 maintainers
trajectopy 2.0.10
Trajectory Evaluation in Python
39 versions - Latest release: 6 days ago - 378 downloads last month - 20 stars on GitHub - 2 maintainers
Top 8.9% on pypi.org
codebleu 0.6.0
Unofficial CodeBLEU implementation that supports Linux, MacOS and Windows available on PyPI.
12 versions - Latest release: about 2 months ago - 3 dependent repositories - 1.52 thousand downloads last month - 31 stars on GitHub - 2 maintainers
Top 3.6% on pypi.org
coconut 3.1.0 💰
Simple, elegant, Pythonic functional programming.
41 versions - Latest release: about 2 months ago - 2 dependent packages - 22 dependent repositories - 2.68 thousand downloads last month - 3,920 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
coconut-develop 3.1.0.post0.dev10 💰
Simple, elegant, Pythonic functional programming.
654 versions - Latest release: 9 days ago - 2 dependent repositories - 3.11 thousand downloads last month - 3,920 stars on GitHub - 2 maintainers
orbis-eval 2.3.5
An Extendable Evaluation Pipeline for Named Entity Drill-Down Analysis
21 versions - Latest release: about 2 years ago - 1 dependent repositories - 135 downloads last month - 8 stars on GitHub - 6 maintainers
dyff-audit 0.3.1
Audit tools for the Dyff AI auditing platform.
12 versions - Latest release: 5 days ago - 355 downloads last month - 0 stars on GitLab.com - 10 maintainers
ntqr 0.3.2
Tools for the logic of evaluation using unlabeled data
5 versions - Latest release: 11 days ago - 254 downloads last month - 34 stars on GitHub - 2 maintainers
Top 4.6% on pypi.org
avalanche-lib 0.5.0 💰
Avalanche: a Comprehensive Framework for Continual Learning Research
7 versions - Latest release: 2 months ago - 2 dependent packages - 10 dependent repositories - 1.29 thousand downloads last month - 1,663 stars on GitHub - 1 maintainer
kolena 1.15.0
Client for Kolena's machine learning testing platform.
62 versions - Latest release: 5 days ago - 1 dependent repositories - 7.38 thousand downloads last month - 38 stars on GitHub - 2 maintainers
kolena-client 1.15.0
Client for Kolena's machine learning testing platform.
67 versions - Latest release: 5 days ago - 2.24 thousand downloads last month - 38 stars on GitHub - 2 maintainers
Top 9.7% on pypi.org
acconeer-exptool 7.10.0
Acconeer Exploration Tool
83 versions - Latest release: 6 days ago - 1 dependent repositories - 1.79 thousand downloads last month - 155 stars on GitHub - 4 maintainers
tsml-eval 0.2.1
A package for benchmarking time series machine learning tools.
6 versions - Latest release: 6 days ago - 95 downloads last month - 20 stars on GitHub - 2 maintainers
lighthouz 0.0.5
Lighthouz AI Python SDK
3 versions - Latest release: 3 months ago - 54 downloads last month - 3 stars on GitHub - 2 maintainers
opencompass 0.2.4
A comprehensive toolkit for large model evaluation
10 versions - Latest release: 6 days ago - 176 downloads last month - 2,370 stars on GitHub - 2 maintainers
pyspark-easy 1.5
Makes pyspark dataframe exploration easy
10 versions - Latest release: about 3 years ago - 1 dependent repositories - 38 downloads last month - 0 stars on GitHub - 2 maintainers
waffle-hub 0.3.1
Waffle hub
31 versions - Latest release: 3 months ago - 192 downloads last month - 39 stars on GitHub - 2 maintainers
nereval 0.2.5
Evaluation script for named entity recognition systems based on F1 score.
3 versions - Latest release: almost 6 years ago - 1 dependent repositories - 469 downloads last month - 66 stars on GitHub - 2 maintainers
checkmarker 0.1.0 removed
A tool to automatically create and evaluate assessments.
1 version - Latest release: 10 months ago
trajectopy-core 2.7.0
Trajectory Evaluation in Python
43 versions - Latest release: 11 days ago - 2 dependent packages - 414 downloads last month - 1 stars on GitHub - 2 maintainers
factscorelite 1.3.0
FactScore (Fine-grained atomic evaluation of factual precision in long form text generation) comp...
10 versions - Latest release: 7 days ago - 882 downloads last month - 0 stars on GitHub - 1 maintainer
zenoml-image-segmentation 0.0.1
Image Segmentation for Zeno
1 version - Latest release: almost 2 years ago - 1 dependent repositories - 8 downloads last month - 208 stars on GitHub - 1 maintainer
llama-index-packs-rag-evaluator 0.1.3
llama-index packs rag_evaluator integration
5 versions - Latest release: 2 months ago - 65 downloads last month - 2 maintainers
alexandra-ai-eval 0.1.0
Evaluation of finetuned models.
1 version - Latest release: about 1 year ago - 1 dependent package - 28 downloads last month - 8 stars on GitHub - 4 maintainers
redlite 0.1.0
LLM testing on steroids
52 versions - Latest release: 11 days ago - 446 downloads last month - 0 stars on GitHub - 1 maintainer
sed-scores-eval 0.0.3
(Threshold-Independent) Evaluation of Sound Event Detection Scores
4 versions - Latest release: 8 days ago - 368 downloads last month - 22 stars on GitHub - 2 maintainers
sacreeos 1.0.2
SacreEOS is a signature generator and implementation helper for the Self Critical Sequence Training
1 version - Latest release: 11 months ago - 8 downloads last month - 0 stars on GitHub - 2 maintainers
bob.bio.htface 1.0.6
Tools for running heterogeneous face recognition experiments
6 versions - Latest release: over 3 years ago - 12 downloads last month - 2 maintainers
evalplatform 1.2.0
Evaluation Platform
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 23 downloads last month - 2 stars on GitHub - 2 maintainers
ward-metrics 0.9.5
Tools for event-based evaluation for activity recognition problems.
6 versions - Latest release: about 6 years ago - 2 dependent repositories - 15 downloads last month - 6 stars on GitHub - 2 maintainers
econll 0.2.5
Extended CoNLL Utilities for Shallow Parsing
8 versions - Latest release: 3 months ago - 52 downloads last month - 2 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
bcubed 1.5
Simple extended BCubed implementation in Python for clustering evaluation
1 version - Latest release: over 5 years ago - 12 dependent repositories - 567 downloads last month - 46 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
nf1 0.0.4
NF1: Normalized F1 score for community evaluation against ground truth
2 versions - Latest release: almost 3 years ago - 1 dependent package - 14 dependent repositories - 71.8 thousand downloads last month - 21 stars on GitHub - 2 maintainers
charcut 1.1.1
Character-based MT evaluation and difference highlighting
2 versions - Latest release: over 1 year ago - 1 dependent package - 4 dependent repositories - 285 downloads last month - 1 stars on GitHub - 2 maintainers
seqscore 0.5.0
SeqScore: Scoring for named entity recognition and other sequence labeling tasks
8 versions - Latest release: 9 months ago - 1 dependent repositories - 62 downloads last month - 19 stars on GitHub - 1 maintainer
phasellm 0.0.21
Wrappers for common large language models (LLMs) with support for evaluation.
21 versions - Latest release: 2 months ago - 1 dependent repositories - 414 downloads last month - 2 maintainers