pypi.org "llm-as-a-judge" keyword
View the packages on the pypi.org package registry that are tagged with the "llm-as-a-judge" keyword.
docling-sdg 0.1.3
Docling for Synthetic Data Generation (SDG) provides a set of tools to create artificial data fro...2 versions - Latest release: 23 days ago - 300 downloads last month - 6 stars on GitHub - 1 maintainer
root-signals 1.5.4
The Python SDK for API of Root Signals18 versions - Latest release: 10 days ago - 1.51 thousand downloads last month - 11 stars on GitHub - 2 maintainers
antibodies-rafaelsandroni 0.0.1
Antibodies for LLM hallucinations1 version - Latest release: 10 months ago - 45 downloads last month - 0 stars on GitHub - 1 maintainer
llm-antibodies 0.0.1
Antibodies for LLM hallucinations1 version - Latest release: 10 months ago - 37 downloads last month - 0 stars on GitHub - 1 maintainer
xfinder 0.2.6
An Robust and Pinpoint Answer Extractor for LLM Evaluation9 versions - Latest release: 3 months ago - 230 downloads last month - 152 stars on GitHub - 1 maintainer
prometheus-eval 0.1.20
A package for evaluating the performance of language models with Prometheus21 versions - Latest release: 8 months ago - 1.38 thousand downloads last month - 880 stars on GitHub - 1 maintainer
Related Keywords
llm
4
llm-as-evaluator
4
python
3
evaluation
3
nli
2
llms
2
hallucinations
2
hallucination-detection
2
chatglm
1
dataset
1
gpt
1
judge-model
1
key-answer-extraction
1
large-language-models
1
lm-evaluation
1
open-compass
1
phi
1
qwen
1
regex
1
reliability
1
reliable-evaluation
1
xfinder
1
gpt4
1
litellm
1
llmops
1
vllm
1
AI
1
artificial intelligence
1
docling
1
document understanding
1
large language models
1
prompt engineering
1
sdg
1
synthetic data generation
1
ai
1
documents
1
question-answering
1
evals
1
observability
1
LLM
1
NLP
1
answer extraction
1
reliable evaluation
1
benchmark
1