npmjs.org "agent-testing" keyword
tenro 0.0.0
AI agent testing framework1 version - Latest release: 3 months ago - 3 downloads last month - 1 maintainer
@langwatch/scenario 0.4.7
A TypeScript library for testing AI agents using scenarios19 versions - Latest release: about 17 hours ago - 10.4 thousand downloads last month - 617 stars on GitHub - 3 maintainers
@agentforge/testing 0.15.3
Testing utilities for TypeScript AI agents, including mock LLMs, mock tools, state builders, fixt...65 versions - Latest release: 1 day ago - 1.98 thousand downloads last month - 1 stars on GitHub - 1 maintainer
agent-triage 0.2.0 💰
Diagnose your AI agents in production — extract behavioral policies from prompts, evaluate traces...3 versions - Latest release: 1 day ago - 1 maintainer
@sarthi/the-school 2.0.0
Shannon-style AI Agent Curriculum - Self-testing platform with 15 grades from basics to advanced ...1 version - Latest release: 6 days ago - 1 maintainer
@agtlantis/eval 0.1.2
LLM-as-Judge based AI Agent testing library with multi-turn conversations, AI simulated users, an...3 versions - Latest release: 4 days ago - 169 downloads last month - 1 maintainer
agentassay 0.1.0
Token-efficient regression testing for non-deterministic AI agent workflows. Part of Qualixar.1 version - Latest release: 8 days ago - 1 maintainer
newo 3.4.0
NEWO CLI: Professional command-line tool with modular architecture for NEWO AI Agent development....33 versions - Latest release: 3 months ago - 94 downloads last month - 0 stars on GitHub - 1 maintainer
nomos-sdk 1.0.0
TypeScript/JavaScript SDK for interacting with Nomos agents5 versions - Latest release: 9 months ago - 15 downloads last month - 83 stars on GitHub - 1 maintainer
@identro/eval 0.1.11
AI Agent Evaluation System - Test and evaluate AI agents across frameworks12 versions - Latest release: 3 months ago - 42 downloads last month - 0 stars on GitHub - 1 maintainer
agentrails 1.1.0
Safeguard your AI agents - keep them grounded and on the rails30 versions - Latest release: 5 months ago - 1 maintainer
cdp-agent-tester 1.6.1
Universal testing SDK for CDP AgentKit agents using AI-generated personalities with comprehensive...15 versions - Latest release: 5 months ago - 28 downloads last month - 1 maintainer
fireglobe-sdk-client 1.6.15
Universal testing SDK for blockchain agents using AI-generated personalities with comprehensive m...14 versions - Latest release: 5 months ago - 30 downloads last month - 1 maintainer
@basalt-ai/cobalt 0.2.0
Unit testing for AI Agents — test, evaluate, and track your AI experiments2 versions - Latest release: 26 days ago - 167 downloads last month - 7 stars on GitHub - 4 maintainers
@merchantguard/mystery-shopper 1.0.2
Probe AI agents before you trust them — 10 automated probes for security, reliability, ethics, an...2 versions - Latest release: about 1 month ago - 1 maintainer
@wundr.io/agent-eval 1.0.6
Agent evaluation framework with LLM-based grading for AI agent quality assessment2 versions - Latest release: 3 months ago - 1 maintainer
Related Keywords
testing
10
ai
10
llm
8
ai-testing
5
agent
4
typescript
4
langchain
3
evaluation
3
llm-testing
3
ai-agents
3
blockscout
2
real-time-analysis
2
agentkit
2
ai-evaluation
2
prompt-engineering
2
ai-agent
2
cli
2
gpt
2
conversation
2
transaction-analysis
2
openai
2
crypto
2
web3
2
coinbase
2
agents
2
defi
2
personality-testing
2
blockchain
2
triage
1
migration
1
verification
1
webhook-automation
1
nomos
1
chatbot
1
sdk
1
api-client
1
account-migration
1
skill-management
1
workspace
1
guidance
1
nsl
1
customer-attributes
1
project-attributes
1
connectors
1
webhooks
1
integrations
1
sandbox
1
conversations
1
personas
1
chat-history
1
benchmarking
1
continuous-improvement
1
feedback-loop
1
quality-assessment
1
llm-grading
1
agent-eval
1
moltbook
1
merchantguard
1
compliance
1
ethics
1
reliability
1
security
1
mystery-shopper
1
probing
1
llm-judge
1
benchmark
1
asi
1
cdp
1
grounding
1
validation
1
guardrails
1
safety
1
e2e
1
quality-assurance
1
crewai
1
step-guided-agent
1
flow-based-agent
1
agentic-ai
1
evaluation-framework
1
ai-observability
1
langsmith
1
opentelemetry
1
ai-diagnostics
1
ai-quality
1
chatbot-testing
1
conversation-analysis
1
policy-compliance
1
policy
1
prompt-testing
1
llm-ops
1
llm-evaluation
1
agent-observability
1
agent-monitoring
1
ai-agent-debugging
1
langchain-typescript
1
mock-tools
1
mock-llm
1
workflow-testing
1
fixtures
1
mocks
1