npmjs.org "evaluation-framework" keyword
@agentv/eval 2.17.2
Evaluation SDK for AgentV - build custom code judges35 versions - Latest release: about 5 hours ago - 2.2 thousand downloads last month - 10 stars on GitHub - 1 maintainer
@agentv/core 2.17.2
Primitive runtime components for AgentV74 versions - Latest release: about 5 hours ago - 2.26 thousand downloads last month - 10 stars on GitHub - 1 maintainer
agentv 2.17.2
CLI entry point for AgentV76 versions - Latest release: about 5 hours ago - 2.31 thousand downloads last month - 10 stars on GitHub - 1 maintainer
create-empiricalrun 0.1.1
Setup your empiricalrun project effortlessly.2 versions - Latest release: almost 2 years ago - 13 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/types 0.11.0
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...17 versions - Latest release: almost 2 years ago - 23 downloads last month - 146 stars on GitHub - 2 maintainers
genkitx-promptfoo 0.1.13
Genkit AI framework plugin for Promptfoo.14 versions - Latest release: over 1 year ago - 8 downloads last month - 4 stars on GitHub - 1 maintainer
@alexcarol/promptfoo 0.119.11 💰
LLM eval & testing toolkit1 version - Latest release: 3 months ago - 9 downloads last month - 10,529 stars on GitHub - 1 maintainer
promptfoo 0.120.26 💰
LLM eval & testing toolkit398 versions - Latest release: 7 days ago - 1 dependent repositories - 416 thousand downloads last month - 3,018 stars on GitHub - 5 maintainers
@empiricalrun/scorer 0.4.0
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...18 versions - Latest release: almost 2 years ago - 15 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/core 0.8.2
Test and evaluate LLMs and model configurations, across all the scenarios that matter for your ap...20 versions - Latest release: almost 2 years ago - 31 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/playwright-utils 0.40.1
Playwright utils for test code repos of our customers322 versions - Latest release: 15 days ago - 24.9 thousand downloads last month - 161 stars on GitHub - 1 maintainer
@empiricalrun/llm 0.25.2
Package to connect and trace LLM calls.109 versions - Latest release: 2 months ago - 123 thousand downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/reporter 0.27.0
Utility package for parsing and analyzing Playwright test reports.91 versions - Latest release: about 2 months ago - 2.58 thousand downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/fetch 0.3.1
This module provides a fetch instance wrapper designed to handle retries and timeouts seamlessly....3 versions - Latest release: almost 2 years ago - 9 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/test-gen 0.79.4
## Usage401 versions - Latest release: 17 days ago - 15.1 thousand downloads last month - 146 stars on GitHub - 1 maintainer
empiricalrun 0.15.3
Empirical CLI for authentication and test generation9 versions - Latest release: 17 days ago - 23 downloads last month - 146 stars on GitHub - 2 maintainers
@empiricalrun/ai 0.10.1
SDK for calling different LLM APIs using OpenAI format15 versions - Latest release: almost 2 years ago - 33 downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/r2-uploader 0.9.1
Node.js library to upload files to R224 versions - Latest release: about 1 month ago - 14.3 thousand downloads last month - 146 stars on GitHub - 1 maintainer
@empiricalrun/test-run 0.14.2
## 1. Introduction56 versions - Latest release: about 1 month ago - 7.57 thousand downloads last month - 161 stars on GitHub - 1 maintainer
@empiricalrun/cli 0.12.1 deprecated
[](https://npmjs.com/package/@empiricalrun/...26 versions - Latest release: almost 2 years ago - 13 downloads last month - 146 stars on GitHub - 2 maintainers
Related Keywords
llm
17
llmops
17
testing
17
llm-inference
14
test-automation
14
testing-framework
14
agentic-ai
3
llms
3
vscode
3
prompt-testing
3
llm-evaluation-framework
3
llm-evaluation
3
llm-eval
3
prompts
3
evaluation
3
ci
2
ci-cd
2
cicd
2
prompt-engineering
2
rag
2
pentesting
1
red-teaming
1
vulnerability-scanners
1
prompt
1
plugin
1
genkitx
1
firebase
1
generative-ai
1
genai
1
ai
1
eval
1
promptfoo
1
genkit-plugin
1
genkit
1