proxy.golang.org "evaluation-framework" keyword
Top 6.7% on proxy.golang.org
18 versions - Latest release: 3 months ago - 2,009 stars on GitHub
github.com/huggingface/lighteval v0.13.0
Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends18 versions - Latest release: 3 months ago - 2,009 stars on GitHub
Top 6.4% on proxy.golang.org
1 version - Latest release: about 1 year ago - 3,018 stars on GitHub
github.com/typpo/promptfoo v0.103.14 💰
Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for ...1 version - Latest release: about 1 year ago - 3,018 stars on GitHub
Top 6.4% on proxy.golang.org
1 version - Latest release: about 1 year ago - 8,724 stars on GitHub
github.com/promptfoo/promptfoo v0.103.14 💰
Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for L...1 version - Latest release: about 1 year ago - 8,724 stars on GitHub
Top 7.5% on proxy.golang.org
20 versions - Latest release: 10 months ago - 29 stars on GitHub
github.com/symflower/eval-symflower-codegen-testing v1.1.0
DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code...20 versions - Latest release: 10 months ago - 29 stars on GitHub
Top 6.6% on proxy.golang.org
Latest release: about 1 month ago - 39 stars on GitHub
github.com/continuous-security/efda
Evaluation Framework for Dependency Analysis (EFDA)Latest release: about 1 month ago - 39 stars on GitHub
Top 5.4% on proxy.golang.org
38 versions - Latest release: over 2 years ago - 216 stars on GitHub
github.com/zeno-ml/zeno v0.6.4
AI Data Management & Evaluation Platform38 versions - Latest release: over 2 years ago - 216 stars on GitHub
Top 5.9% on proxy.golang.org
20 versions - Latest release: 9 days ago - 8,798 stars on GitHub
github.com/promptfoo/promptfoo/examples/golang-provider v0.0.0-20260302231834-056e9c0640a5 💰
Package main implements a promptfoo provider that uses OpenAI's API. It demonstrates a simple imp...20 versions - Latest release: 9 days ago - 8,798 stars on GitHub
Top 6.7% on proxy.golang.org
26 versions - Latest release: over 1 year ago - 208 stars on GitHub
github.com/TonicAI/tvalmetrics v6.1.1+incompatible
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applica...26 versions - Latest release: over 1 year ago - 208 stars on GitHub
Top 7.6% on proxy.golang.org
20 versions - Latest release: 10 months ago - 29 stars on GitHub
github.com/symflower/eval-codegen-testing v1.1.0
DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code...20 versions - Latest release: 10 months ago - 29 stars on GitHub
Top 9.1% on proxy.golang.org
Latest release: 3 months ago - 37 stars on GitHub
github.com/srcclr/efda/golang/modules/modules-basic
Evaluation Framework for Dependency Analysis (EFDA)Latest release: 3 months ago - 37 stars on GitHub
Top 6.7% on proxy.golang.org
276 versions - Latest release: 23 days ago - 1,929 stars on GitHub
github.com/mr-gpt/deepeval v3.8.5+incompatible
The LLM Evaluation Framework276 versions - Latest release: 23 days ago - 1,929 stars on GitHub
Top 6.7% on proxy.golang.org
15 versions - Latest release: 26 days ago - 10,287 stars on GitHub
github.com/EleutherAI/lm-evaluation-harness v0.4.11
A framework for few-shot evaluation of language models.15 versions - Latest release: 26 days ago - 10,287 stars on GitHub
Top 6.7% on proxy.golang.org
79 versions - Latest release: 10 months ago - 292 stars on GitHub
github.com/athina-ai/athina-evals v1.7.39
Python SDK for running evaluations on LLM generated responses79 versions - Latest release: 10 months ago - 292 stars on GitHub
Top 9.1% on proxy.golang.org
Latest release: 5 months ago - 35 stars on GitHub
github.com/srcclr/efda
Evaluation Framework for Dependency Analysis (EFDA)Latest release: 5 months ago - 35 stars on GitHub
Top 6.9% on proxy.golang.org
20 versions - Latest release: 10 months ago - 29 stars on GitHub
github.com/symflower/eval-dev-quality v1.1.0
DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code...20 versions - Latest release: 10 months ago - 29 stars on GitHub
Top 6.7% on proxy.golang.org
26 versions - Latest release: over 1 year ago - 208 stars on GitHub
github.com/tonicai/tvalmetrics v6.1.1+incompatible
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applica...26 versions - Latest release: over 1 year ago - 208 stars on GitHub
Top 6.7% on proxy.golang.org
210 versions - 12,542 stars on GitHub
github.com/confident-ai/deepeval
The LLM Evaluation Framework210 versions - 12,542 stars on GitHub
Top 6.7% on proxy.golang.org
14 versions - Latest release: about 1 month ago - 10,287 stars on GitHub
github.com/eleutherai/lm-evaluation-harness v0.4.10
A framework for few-shot evaluation of language models.14 versions - Latest release: about 1 month ago - 10,287 stars on GitHub
Top 5.6% on proxy.golang.org
12 versions - Latest release: over 1 year ago - 4 stars on GitHub
github.com/yukinagae/genkitx-promptfoo v0.1.13
Community Plugin for Genkit to use Promptfoo12 versions - Latest release: over 1 year ago - 4 stars on GitHub
Related Keywords
evaluation
10
llm-evaluation
7
llmops
7
evaluation-metrics
6
llm
6
llm-evaluation-framework
6
llm-eval
5
llms
5
rag
5
prompt-testing
4
prompts
4
testing
4
languages
3
dependency-analysis
3
software-quality
3
software-development
3
prompt-engineering
3
cicd
3
ci-cd
3
ci
3
transformer
2
language-model
2
llm-evaluation-metrics
2
retrieval-augmented-generation
2
large-language-models
2
ai
2
vulnerability-scanners
2
red-teaming
2
pentesting
2
promptfoo
1
prompt
1
plugin
1
genkitx
1
genkit-plugin
1
genkit
1
firebase
1
llm-ops
1
llm-evaluation-toolkit
1
huggingface
1
python
1
machine-learning
1
data-science
1