npmjs.org "evaluation-framework" keyword

@agentv/eval 2.17.2

Evaluation SDK for AgentV - build custom code judges
35 versions - Latest release: about 5 hours ago - 2.2 thousand downloads last month - 10 stars on GitHub - 1 maintainer

@agentv/core 2.17.2

Primitive runtime components for AgentV
74 versions - Latest release: about 5 hours ago - 2.26 thousand downloads last month - 10 stars on GitHub - 1 maintainer

agentv 2.17.2

CLI entry point for AgentV
76 versions - Latest release: about 5 hours ago - 2.31 thousand downloads last month - 10 stars on GitHub - 1 maintainer

create-empiricalrun 0.1.1

Setup your empiricalrun project effortlessly.
2 versions - Latest release: almost 2 years ago - 13 downloads last month - 146 stars on GitHub - 1 maintainer

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...
17 versions - Latest release: almost 2 years ago - 23 downloads last month - 146 stars on GitHub - 2 maintainers

genkitx-promptfoo 0.1.13

Genkit AI framework plugin for Promptfoo.
14 versions - Latest release: over 1 year ago - 8 downloads last month - 4 stars on GitHub - 1 maintainer

@alexcarol/promptfoo 0.119.11 💰

LLM eval & testing toolkit
1 version - Latest release: 3 months ago - 9 downloads last month - 10,529 stars on GitHub - 1 maintainer

promptfoo 0.120.26 💰

LLM eval & testing toolkit
398 versions - Latest release: 7 days ago - 1 dependent repositories - 416 thousand downloads last month - 3,018 stars on GitHub - 5 maintainers

@empiricalrun/scorer 0.4.0

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...
18 versions - Latest release: almost 2 years ago - 15 downloads last month - 146 stars on GitHub - 1 maintainer

@empiricalrun/core 0.8.2

Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...
20 versions - Latest release: almost 2 years ago - 31 downloads last month - 146 stars on GitHub - 1 maintainer

@empiricalrun/playwright-utils 0.40.1

Playwright utils for test code repos of our customers
322 versions - Latest release: 15 days ago - 24.9 thousand downloads last month - 161 stars on GitHub - 1 maintainer

@empiricalrun/llm 0.25.2

Package to connect and trace LLM calls.
109 versions - Latest release: 2 months ago - 123 thousand downloads last month - 146 stars on GitHub - 1 maintainer

@empiricalrun/reporter 0.27.0

Utility package for parsing and analyzing Playwright test reports.
91 versions - Latest release: about 2 months ago - 2.58 thousand downloads last month - 146 stars on GitHub - 1 maintainer

@empiricalrun/fetch 0.3.1

This module provides a fetch instance wrapper designed to handle retries and timeouts seamlessly....
3 versions - Latest release: almost 2 years ago - 9 downloads last month - 146 stars on GitHub - 1 maintainer

@empiricalrun/test-gen 0.79.4

## Usage
401 versions - Latest release: 17 days ago - 15.1 thousand downloads last month - 146 stars on GitHub - 1 maintainer

empiricalrun 0.15.3

Empirical CLI for authentication and test generation
9 versions - Latest release: 17 days ago - 23 downloads last month - 146 stars on GitHub - 2 maintainers

@empiricalrun/ai 0.10.1

SDK for calling different LLM APIs using OpenAI format
15 versions - Latest release: almost 2 years ago - 33 downloads last month - 146 stars on GitHub - 1 maintainer

@empiricalrun/r2-uploader 0.9.1

Node.js library to upload files to R2
24 versions - Latest release: about 1 month ago - 14.3 thousand downloads last month - 146 stars on GitHub - 1 maintainer

@empiricalrun/test-run 0.14.2

## 1. Introduction
56 versions - Latest release: about 1 month ago - 7.57 thousand downloads last month - 161 stars on GitHub - 1 maintainer

@empiricalrun/cli 0.12.1 deprecated

[![npm](https://img.shields.io/npm/v/@empiricalrun/cli)](https://npmjs.com/package/@empiricalrun/...
26 versions - Latest release: almost 2 years ago - 13 downloads last month - 146 stars on GitHub - 2 maintainers

Related Keywords

llm 17 llmops 17 testing 17 llm-inference 14 test-automation 14 testing-framework 14 agentic-ai 3 llms 3 vscode 3 prompt-testing 3 llm-evaluation-framework 3 llm-evaluation 3 llm-eval 3 prompts 3 evaluation 3 ci 2 ci-cd 2 cicd 2 prompt-engineering 2 rag 2 pentesting 1 red-teaming 1 vulnerability-scanners 1 prompt 1 plugin 1 genkitx 1 firebase 1 generative-ai 1 genai 1 ai 1 eval 1 promptfoo 1 genkit-plugin 1 genkit 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Packages