An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

npmjs.org "evaluation-framework" keyword

@agentv/eval 2.17.2
Evaluation SDK for AgentV - build custom code judges
35 versions - Latest release: about 5 hours ago - 2.2 thousand downloads last month - 10 stars on GitHub - 1 maintainer
@agentv/core 2.17.2
Primitive runtime components for AgentV
74 versions - Latest release: about 5 hours ago - 2.26 thousand downloads last month - 10 stars on GitHub - 1 maintainer
agentv 2.17.2
CLI entry point for AgentV
76 versions - Latest release: about 5 hours ago - 2.31 thousand downloads last month - 10 stars on GitHub - 1 maintainer
create-empiricalrun 0.1.1
Setup your empiricalrun project effortlessly.
2 versions - Latest release: almost 2 years ago - 13 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/types 0.11.0
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...
17 versions - Latest release: almost 2 years ago - 23 downloads last month - 146 stars on GitHub - 2 maintainers
genkitx-promptfoo 0.1.13
Genkit AI framework plugin for Promptfoo.
14 versions - Latest release: over 1 year ago - 8 downloads last month - 4 stars on GitHub - 1 maintainer
@alexcarol/promptfoo 0.119.11 💰
LLM eval & testing toolkit
1 version - Latest release: 3 months ago - 9 downloads last month - 10,529 stars on GitHub - 1 maintainer
promptfoo 0.120.26 💰
LLM eval & testing toolkit
398 versions - Latest release: 7 days ago - 1 dependent repositories - 416 thousand downloads last month - 3,018 stars on GitHub - 5 maintainers
@empiricalrun/scorer 0.4.0
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...
18 versions - Latest release: almost 2 years ago - 15 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/core 0.8.2
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...
20 versions - Latest release: almost 2 years ago - 31 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/playwright-utils 0.40.1
Playwright utils for test code repos of our customers
322 versions - Latest release: 15 days ago - 24.9 thousand downloads last month - 161 stars on GitHub - 1 maintainer
@empiricalrun/llm 0.25.2
Package to connect and trace LLM calls.
109 versions - Latest release: 2 months ago - 123 thousand downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/reporter 0.27.0
Utility package for parsing and analyzing Playwright test reports.
91 versions - Latest release: about 2 months ago - 2.58 thousand downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/fetch 0.3.1
This module provides a fetch instance wrapper designed to handle retries and timeouts seamlessly....
3 versions - Latest release: almost 2 years ago - 9 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/test-gen 0.79.4
## Usage
401 versions - Latest release: 17 days ago - 15.1 thousand downloads last month - 146 stars on GitHub - 1 maintainer
empiricalrun 0.15.3
Empirical CLI for authentication and test generation
9 versions - Latest release: 17 days ago - 23 downloads last month - 146 stars on GitHub - 2 maintainers
@empiricalrun/ai 0.10.1
SDK for calling different LLM APIs using OpenAI format
15 versions - Latest release: almost 2 years ago - 33 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/r2-uploader 0.9.1
Node.js library to upload files to R2
24 versions - Latest release: about 1 month ago - 14.3 thousand downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/test-run 0.14.2
## 1. Introduction
56 versions - Latest release: about 1 month ago - 7.57 thousand downloads last month - 161 stars on GitHub - 1 maintainer
@empiricalrun/cli 0.12.1 deprecated
[![npm](https://img.shields.io/npm/v/@empiricalrun/cli)](https://npmjs.com/package/@empiricalrun/...
26 versions - Latest release: almost 2 years ago - 13 downloads last month - 146 stars on GitHub - 2 maintainers