proxy.golang.org "llm-evaluation-framework" keyword
View the packages on the proxy.golang.org package registry that are tagged with the "llm-evaluation-framework" keyword.
Top 6.4% on proxy.golang.org
1 version - Latest release: 8 months ago - 8,334 stars on GitHub
github.com/promptfoo/promptfoo v0.103.14 💰
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for L...1 version - Latest release: 8 months ago - 8,334 stars on GitHub
Top 5.9% on proxy.golang.org
Latest release: 3 days ago - 8,334 stars on GitHub
github.com/promptfoo/promptfoo/examples/golang-provider 💰
Package main implements a promptfoo provider that uses OpenAI's API. It demonstrates a simple imp...Latest release: 3 days ago - 8,334 stars on GitHub
Top 5.8% on proxy.golang.org
24 versions - Latest release: 2 days ago - 198 stars on GitHub
github.com/cvs-health/langfair v0.7.1
LangFair is a Python library for conducting use-case level LLM bias and fairness assessments24 versions - Latest release: 2 days ago - 198 stars on GitHub
Top 6.7% on proxy.golang.org
210 versions - 10,626 stars on GitHub
github.com/confident-ai/deepeval
The LLM Evaluation Framework210 versions - 10,626 stars on GitHub
Top 6.4% on proxy.golang.org
1 version - Latest release: 8 months ago - 3,018 stars on GitHub
github.com/typpo/promptfoo v0.103.14 💰
Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for ...1 version - Latest release: 8 months ago - 3,018 stars on GitHub
Top 6.7% on proxy.golang.org
246 versions - Latest release: about 1 month ago - 1,929 stars on GitHub
github.com/mr-gpt/deepeval v3.3.5+incompatible
The LLM Evaluation Framework246 versions - Latest release: about 1 month ago - 1,929 stars on GitHub
Top 5.4% on proxy.golang.org
Latest release: 16 days ago - 1 stars on GitHub
github.com/petmal/mindtrial/pkg/mistralai
MindTrial: Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/...Latest release: 16 days ago - 1 stars on GitHub
Top 5.7% on proxy.golang.org
15 versions - Latest release: 16 days ago - 1 stars on GitHub
github.com/petmal/mindtrial v0.7.2
MindTrial: Evaluate and compare AI language models (LLMs) on text-based tasks with optional file/...15 versions - Latest release: 16 days ago - 1 stars on GitHub
Top 5.6% on proxy.golang.org
12 versions - Latest release: 11 months ago - 4 stars on GitHub
github.com/yukinagae/genkitx-promptfoo v0.1.13
Community Plugin for Genkit to use Promptfoo12 versions - Latest release: 11 months ago - 4 stars on GitHub
Related Keywords
llm-evaluation
7
evaluation-framework
6
llm
5
testing
4
evaluation
4
llm-eval
4
llmops
4
prompts
4
prompt-testing
4
llm-evaluation-metrics
3
rag
3
prompt-engineering
3
cicd
3
ci-cd
3
ci
3
llm-comparison
2
evaluation-metrics
2
ai-benchmark
2
ai-evaluation-tools
2
ai-model-comparison
2
ai-tool
2
anthropic
2
artificial-intelligence-projects
2
csv-reports
2
customizable
2
deepseek
2
golang-cli
2
google-gemini-ai
2
html-reports
2
language-models-ai
2
llm-benchmarking
2
nlp
2
openai
2
opensource
2
yaml-configuration
2
ai
2
vulnerability-scanners
2
red-teaming
2
pentesting
2
firebase
1
genkit
1
genkit-plugin
1
genkitx
1
plugin
1
prompt
1
promptfoo
1
responsible-ai
1
python
1
large-language-models
1
fairness-testing
1
fairness-ml
1
fairness-ai
1
fairness
1
ethical-ai
1
bias-detection
1
bias
1
artificial-intelligence
1
ai-safety
1