Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "evaluation" keyword

hydrotools.-restclient 3.1.0
General REST api client with built in request caching and retries.
8 versions - Latest release: 7 months ago - 3 dependent packages - 49 stars on GitHub - 3 maintainers
hydrotools.nwm-client-new 7.4.0
Retrieve National Water Model data from various sources.
3 versions - Latest release: 2 months ago - 16 downloads last month - 51 stars on GitHub - 1 maintainer
Top 8.2% on pypi.org
torcheval-nightly 2024.5.17
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...
486 versions - Latest release: 14 days ago - 2 dependent packages - 1 dependent repositories - 11.8 thousand downloads last month - 156 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
errant 3.0.0
The ERRor ANnotation Toolkit (ERRANT). Automatically extract and classify edits in parallel sente...
19 versions - Latest release: 7 months ago - 13 dependent repositories - 2.93 thousand downloads last month - 410 stars on GitHub - 4 maintainers
errant-prep 3.2.3
The ERRor ANnotation Toolkit (ERRANT). Automatically extract and classify edits in parall...
23 versions - Latest release: 4 months ago - 79 downloads last month - 410 stars on GitHub - 1 maintainer
tsmetrics 0.1.0
Evaluation metrics for time series analysis
1 version - Latest release: over 6 years ago - 1 dependent repositories - 17 downloads last month - 4 stars on GitHub - 1 maintainer
rag-eval 0.1.3
A RAG evaluation framework
4 versions - Latest release: 2 months ago - 30 downloads last month - 0 stars on GitLab.com - 1 maintainer
tiger-eval 0.0.2
Text Generation Evaluation Toolkit
2 versions - Latest release: 5 months ago - 31 downloads last month - 1 maintainer
skflex 1.0.2
skflex provides a suite of flexible utility functions for use with the sklearn library
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 50 downloads last month - 0 stars on GitHub - 1 maintainer
fair-test 0.1.4
A library to define and publish FAIR metrics tests APIs complying with the FAIRMetrics working gr...
12 versions - Latest release: over 1 year ago - 1 dependent repositories - 103 downloads last month - 7 stars on GitHub - 1 maintainer
alpaca-eval 0.6.2
AlpacaEval : An Automatic Evaluator of Instruction-following Models
33 versions - Latest release: about 1 month ago - 2 dependent packages - 6.38 thousand downloads last month - 1,062 stars on GitHub - 3 maintainers
phasellm 0.0.21
Wrappers for common large language models (LLMs) with support for evaluation.
22 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 375 downloads last month - 1 maintainer
topiceval 3.0.0.dev1
Topic Model User Evaluation
69 versions - Latest release: over 6 years ago - 1 dependent repositories - 477 downloads last month - 1 maintainer
expression-parse-eval 0.13.0
Mathematical expression calculator in Python
4 versions - Latest release: about 2 years ago - 1 dependent repositories - 25 downloads last month - 3 stars on GitHub - 1 maintainer
bob.bio.caffe-face 1.1.3
Face Feature extraction using caffe pre-trained models
5 versions - Latest release: over 4 years ago - 19 downloads last month - 6 maintainers
Top 9.7% on pypi.org
uptrain 0.7.1
UpTrain - tool to evaluate LLM applications on aspects like factual accuracy, response quality, r...
49 versions - Latest release: 17 days ago - 2 dependent packages - 1 dependent repositories - 5.18 thousand downloads last month - 2,017 stars on GitHub - 2 maintainers
llama-index-callbacks-uptrain 0.2.0
UpTrain Callback for performing evaluations on the LlamaIndex pipeline
3 versions - Latest release: 17 days ago - 240 downloads last month - 2,017 stars on GitHub - 1 maintainer
clayrs 0.5.1
Complexly represent contents, build recommender systems, evaluate them. All in one place!
12 versions - Latest release: 11 months ago - 38 downloads last month - 32 stars on GitHub - 1 maintainer
jurity 2.0.1
fairness and evaluation library
12 versions - Latest release: 4 months ago - 1 dependent package - 4 dependent repositories - 971 downloads last month - 35 stars on GitHub - 5 maintainers
v-stream 0.1.2
STREAM: Spatio-TempoRal Evaluation and Analysis Metric for Video Generative Models
8 versions - Latest release: 4 months ago - 94 downloads last month - 14 stars on GitHub - 1 maintainer
embedding-evaluator 0.0.1
Embedding Evaluator
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 13 downloads last month - 1 maintainer
inspire 1.0.9
Helper library to participate in the INSPIRE challenge
12 versions - Latest release: over 9 years ago - 4 dependent repositories - 70 downloads last month - 2 stars on GitHub - 2 maintainers
metaquantus 0.0.5
MetaQuantus is a XAI performance tool for identifying reliable metrics.
5 versions - Latest release: 9 months ago - 72 downloads last month - 24 stars on GitHub - 1 maintainer
sila2comlib 0.2.0
sila2comlib - a SiLA 2 python3 communication library
1 version - Latest release: over 4 years ago - 1 dependent repositories - 22 downloads last month - 9 stars on GitLab.com - 2 maintainers
sila2lib 0.2.5
sila2lib - a SiLA 2 python3 library
4 versions - Latest release: over 3 years ago - 2 dependent repositories - 72 downloads last month - 9 stars on GitLab.com - 2 maintainers
sila2codegenerator 0.2.0
SiLA2 code generator for Python3
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 40 downloads last month - 9 stars on GitLab.com - 2 maintainers
Top 4.8% on pypi.org
tidecv 1.0.1
A General Toolbox for Identifying ObjectDetection Errors
2 versions - Latest release: over 3 years ago - 3 dependent packages - 6 dependent repositories - 3.04 thousand downloads last month - 687 stars on GitHub - 1 maintainer
tidecv-light 1.0.1
A General Toolbox for Identifying ObjectDetection Errors
1 version - Latest release: over 1 year ago - 74 downloads last month - 687 stars on GitHub - 1 maintainer
meeteval 0.3.0
MeetEval - A meeting transcription evaluation toolkit
5 versions - Latest release: about 1 month ago - 245 downloads last month - 53 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
torch-fidelity 0.3.0
High-fidelity performance metrics for generative models in PyTorch
3 versions - Latest release: almost 3 years ago - 9 dependent packages - 773 dependent repositories - 291 thousand downloads last month - 870 stars on GitHub - 1 maintainer
recmetrics-sweep 0.0.0
Execute all desired rec metrics at once, quickly, including with different values of k.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 22 downloads last month - 0 stars on GitHub - 1 maintainer
bt4vt 1.0.1
Bias Tests for Voice Technologies
2 versions - Latest release: over 1 year ago - 30 downloads last month - 11 stars on GitHub - 2 maintainers
maihem 1.4.2
LLM evaluations and synthetic data generation with the MAIHEM models
8 versions - Latest release: about 1 month ago - 382 downloads last month - 12 stars on GitHub - 1 maintainer
ragrank 0.1.0
An evaluation library for RAG models
7 versions - Latest release: 3 months ago - 148 downloads last month - 20 stars on GitHub - 1 maintainer
inginious 0.8.7
An intelligent grader that allows secured and automated testing of code made by students.
17 versions - Latest release: about 1 year ago - 7 dependent repositories - 109 downloads last month - 187 stars on GitHub - 2 maintainers
alpaca-farm 0.2.0
An automatic evaluator for instruction-following language models. Human-validated, high-quality, ...
11 versions - Latest release: 3 months ago - 225 downloads last month - 1,062 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
verif 1.3.0
A verification program for meteorological forecasts and observations
17 versions - Latest release: 3 months ago - 1 dependent package - 5 dependent repositories - 499 downloads last month - 81 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
torcheval 0.0.7
A library for providing a simple interface to create new metrics and an easy-to-use toolkit for m...
7 versions - Latest release: 9 months ago - 25 dependent packages - 8 dependent repositories - 68.4 thousand downloads last month - 194 stars on GitHub - 1 maintainer
lusmu 0.2
A lazy/forced evaluation library
1 version - Latest release: over 10 years ago - 2 dependent repositories - 7 downloads last month - 52 stars on GitHub - 1 maintainer
sacrebleu-macrof 2.0.1
Hassle-free computation of shareable, comparable, and reproducible BLEU, chrF, and TER scores
1 version - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 31 downloads last month - 907 stars on GitHub - 1 maintainer
langcheck 0.7.1
Simple, Pythonic building blocks to evaluate LLM-based applications
12 versions - Latest release: 23 days ago - 2.71 thousand downloads last month - 140 stars on GitHub - 3 maintainers
pycyclops 0.2.8
Framework for healthcare ML implementation
50 versions - Latest release: 16 days ago - 1 dependent package - 742 downloads last month - 62 stars on GitHub - 1 maintainer
Top 8.0% on pypi.org
pyprg 0.1
Creates the Precision-Recall-Gain curve and calculates the area under the curve
8 versions - Latest release: 9 months ago - 7 dependent repositories - 2.01 thousand downloads last month - 36 stars on GitHub - 1 maintainer
dinglehopper 0.9.6
The OCR evaluation tool
7 versions - Latest release: 25 days ago - 157 downloads last month - 54 stars on GitHub - 1 maintainer
ml3m 0.0.20
Evaluting your LLM performance
20 versions - Latest release: 8 months ago - 163 downloads last month - 37,327 stars on GitHub - 1 maintainer
reseval 0.1.6
Reproducible Subjective Evaluation
9 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 78 downloads last month - 53 stars on GitHub - 1 maintainer
Top 6.5% on pypi.org
table-evaluator 1.6.1
A package to evaluate how close a synthetic data set is to real data.
31 versions - Latest release: 9 months ago - 3 dependent packages - 5 dependent repositories - 1.79 thousand downloads last month - 74 stars on GitHub - 1 maintainer
Top 6.0% on pypi.org
smatch 1.0.4
Smatch (semantic match) tool
5 versions - Latest release: about 4 years ago - 2 dependent packages - 14 dependent repositories - 871 downloads last month - 62 stars on GitHub - 2 maintainers
Top 5.3% on pypi.org
ranx 0.3.19
ranx: A Blazing-Fast Python Library for Ranking Evaluation, Comparison, and Fusion
45 versions - Latest release: 6 months ago - 4 dependent packages - 7 dependent repositories - 13.2 thousand downloads last month - 348 stars on GitHub - 1 maintainer
oasis 0.1.3
Optimal Asymptotic Sequential Importance Sampling
2 versions - Latest release: almost 3 years ago - 1 dependent package - 3 dependent repositories - 66 downloads last month - 14 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
xai 0.1.0
XAI - An industry-ready machine learning library that ensures explainable AI by design
5 versions - Latest release: over 2 years ago - 24 dependent repositories - 309 downloads last month - 1,053 stars on GitHub - 1 maintainer
coreference-eval 0.0.2
Common metrics and evaluation tools for coreference chains (jsonline format)
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 85 downloads last month - 4 stars on GitHub - 1 maintainer
python-grid5000 1.2.4
A python wrapper for the GitLab API.
45 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 416 downloads last month - 2,162 stars on GitHub - 2 maintainers
configspacenni 0.4.7.3
Creation and manipulation of parameter configuration spaces for automated algorithm configuration...
3 versions - Latest release: over 2 years ago - 124 downloads last month - 1 maintainer
Top 3.6% on pypi.org
coconut 3.1.0 πŸ’°
Simple, elegant, Pythonic functional programming.
41 versions - Latest release: 3 months ago - 3 dependent packages - 22 dependent repositories - 2.81 thousand downloads last month - 3,951 stars on GitHub - 1 maintainer
yunke_langfuse 2.7.6
A client library for accessing langfuse
2 versions - Latest release: 4 months ago - 23 downloads last month - 2,823 stars on GitHub - 1 maintainer
checkmarker 0.1.0 removed
A tool to automatically create and evaluate assessments.
1 version - Latest release: 11 months ago
ctxpro 0.0.5
Simple toolkit that extracts ambiguities in documents that require context to resolve.
5 versions - Latest release: 4 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
costra 1.0
Tool for automatic evaluation of Czech sentence embeddings using Costra 1.1 dataset
1 version - Latest release: over 3 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
disaggregators 0.1.2
HuggingFace community-driven open-source library for dataset disaggregation
3 versions - Latest release: over 1 year ago - 120 downloads last month - 66 stars on GitHub - 1 maintainer
panoptica 0.6.5
Panoptic Quality (PQ) computation for binary masks.
53 versions - Latest release: about 1 month ago - 1 dependent repositories - 324 downloads last month - 12 stars on GitHub - 1 maintainer
multimedeval 0.1.1
A Python tool to evaluate the performance of VLM on the medical domain.
2 versions - Latest release: 2 months ago - 56 downloads last month - 18 stars on GitHub - 1 maintainer
umbrela 0.0.7
A Package for generating query-passage relevance assessment labels.
7 versions - Latest release: 29 days ago - 508 downloads last month - 0 stars on GitHub - 1 maintainer
frd-score 0.0.1
Package for calculating FrΓ©chet Radiomics Distance (FRD)
1 version - Latest release: 30 days ago - 79 downloads last month - 0 stars on GitHub - 1 maintainer
booookscore 0.1.3
Official package for our ICLR 2024 paper, "BooookScore: A systematic exploration of book-length s...
4 versions - Latest release: about 2 months ago - 140 downloads last month - 53 stars on GitHub - 1 maintainer
factuality 1.0.14
Benchmarking long-form factuality in large language models. Original code for our paper "Long-for...
4 versions - Latest release: about 2 months ago - 108 downloads last month - 444 stars on GitHub - 1 maintainer
faster-etapr 0.1.2
Faster implementation of the enhanced time-aware precision and recall (eTaPR) for the evaluation ...
3 versions - Latest release: about 2 months ago - 110 downloads last month - 1 stars on GitHub - 1 maintainer
easy-lm-eval 0.1.2
A library for easy evaluation of language models
3 versions - Latest release: 3 months ago - 20 downloads last month - 3 stars on GitHub - 1 maintainer
tno.sdg.tabular.eval.utility-metrics 0.3.0
Utility metrics for tabular data
1 version - Latest release: 3 months ago - 8 downloads last month - 2 stars on GitHub - 1 maintainer
llama-index-packs-llama-dataset-metadata 0.1.4
llama-index packs llama_dataset_metadata integration
5 versions - Latest release: about 2 months ago - 53 downloads last month - 1 maintainer
lighthouz 0.0.5
Lighthouz AI Python SDK
3 versions - Latest release: 4 months ago - 54 downloads last month - 3 stars on GitHub - 1 maintainer
mt-thresholds 0.0.4
Tool to check how metric deltas for machine translation reflect on system-level human accuracies.
4 versions - Latest release: 4 months ago - 40 downloads last month - 2 stars on GitHub - 1 maintainer
sensirion-uart-svm4x 2.0.3
SHDLC driver for the Sensirion SVM4X sensor family
1 version - Latest release: 7 months ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
ragstack-ai-langsmith 0.0.1a1 removed
Client library to connect to the LangSmith LLM Tracing and Evaluation Platform.
1 version - Latest release: 7 months ago - 1 maintainer
metric-eval 1.0.2
a python package for evaluating evaluation metrics
3 versions - Latest release: 7 months ago - 14 downloads last month - 6 stars on GitHub - 1 maintainer
synthesized-datasets 1.7
Publically available datasets for benchmarking and evaluation.
15 versions - Latest release: 3 months ago - 3 dependent packages - 7.72 thousand downloads last month - 1 stars on GitHub - 1 maintainer
rke-score 0.0.7
Compute Renyi Kernel Entropy scores (RKE-MC and RRKE) for two sets of vectors.
5 versions - Latest release: 7 months ago - 34 downloads last month - 9 stars on GitHub - 1 maintainer
lighteval 0.3.0
A lightweight and configurable evaluation package
8 versions - Latest release: 2 months ago - 2.07 thousand downloads last month - 299 stars on GitHub - 3 maintainers
identitychain 0.1.0
Evaluation Framework for Code Large Language Models (Code LLMs)
2 versions - Latest release: 27 days ago - 7 downloads last month - 6 stars on GitHub - 1 maintainer
spiraleval 0.1.2 removed
Evaluation for characteristics
3 versions - Latest release: 8 months ago - 249 downloads last month - 1 maintainer
trajectopy-core 3.1.0
Trajectory Evaluation in Python
46 versions - Latest release: 23 days ago - 2 dependent packages - 883 downloads last month - 1 stars on GitHub - 1 maintainer
algomaster 0.1.3
The Regression class simplifies regression analysis by providing a convenient and flexible approa...
5 versions - Latest release: 8 months ago - 19 downloads last month - 1 maintainer
xtuning 0.0.0
Fine-tuning, evaluation and data generation for LLMs
1 version - Latest release: about 1 year ago - 3 downloads last month - 1 maintainer
trajectopy 2.0.14
Trajectory Evaluation in Python
43 versions - Latest release: 23 days ago - 838 downloads last month - 21 stars on GitHub - 1 maintainer
spaghettiwithbqn 0.1.2
BQN evaluation in python.
1 version - Latest release: over 1 year ago - 5 downloads last month - 0 stars on GitHub - 1 maintainer
id-marl-eval 0.0.4
A Python library for Multi-Agent Reinforcement Learning evaluation.
4 versions - Latest release: 3 months ago - 39 downloads last month - 44 stars on GitHub - 1 maintainer
daze 0.1.1
Better multi-class confusion matrix plots for Scikit-Learn, incorporating per-class and overall e...
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 29 downloads last month - 3 stars on GitHub - 1 maintainer
ner-evaluator 1.0.4
Evaluate Named Entity Recognition (NER) models
4 versions - Latest release: almost 2 years ago - 15 downloads last month - 1 stars on GitHub - 1 maintainer
zenoml-audio-transcription 0.0.4
Audio Transcription for Zeno
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 14 downloads last month - 205 stars on GitHub - 1 maintainer
zenoml-text-classification 0.0.2
Text Classification for Zeno
2 versions - Latest release: over 1 year ago - 1 dependent repositories - 24 downloads last month - 209 stars on GitHub - 1 maintainer
zenoml-image-classification 0.0.3
Image Classification for Zeno
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 32 downloads last month - 209 stars on GitHub - 1 maintainer
morphoeval 0.3.0
Evaluation for morphological analysis and segmentation
1 version - Latest release: almost 2 years ago - 19 downloads last month - 2 stars on GitHub - 1 maintainer
evalne-gui 0.1.0
Plotly Dash based GUI for EvalNE
1 version - Latest release: almost 2 years ago - 15 downloads last month - 3 stars on GitHub - 1 maintainer
picai-eval 1.4.6
PICAI Evaluation
5 versions - Latest release: 25 days ago - 1 dependent package - 829 downloads last month - 15 stars on GitHub - 1 maintainer
math-parser 0.1.2
This package evaluates mathematics expressions written in string safely.
1 version - Latest release: almost 2 years ago - 1 dependent repositories - 42 downloads last month - 0 stars on GitHub - 1 maintainer
lambre 2.0.2
a tool to measure the grammatical well-formedness of multilingual texts
4 versions - Latest release: almost 2 years ago - 1 dependent repositories - 33 downloads last month - 8 stars on GitHub - 1 maintainer
zpyshell 0.1.2.0
Command line shell with script languages, like python
2 versions - Latest release: almost 7 years ago - 1 dependent repositories - 15 downloads last month - 7 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
xlcalculator 0.5.0
Converts MS Excel formulas to Python and evaluates them.
28 versions - Latest release: over 1 year ago - 1 dependent repositories - 12.1 thousand downloads last month - 105 stars on GitHub - 2 maintainers
unimorph 0.0.4
Annotated morphology in the world's languages
4 versions - Latest release: about 4 years ago - 1 dependent repositories - 139 downloads last month - 24 stars on GitHub - 1 maintainer
taqyeem 0.0.1
A python library for recording and reporting evaluation of ml models
1 version - Latest release: over 3 years ago - 1 dependent repositories - 12 downloads last month - 1 maintainer