Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "llms-benchmarking" keyword
parea-ai 0.2.155
Parea python sdk185 versions - Latest release: 22 days ago - 2 dependent packages - 12 thousand downloads last month - 21 stars on GitHub - 2 maintainers
chem-bench 0.1.0
Benchmark chemistry performance of LLMs1 version - Latest release: 2 months ago - 54 downloads last month - 34 stars on GitHub - 1 maintainer
liah 0.1.6
Insert a Lie in a Haystack and evaluate the model's ability to detect it.3 versions - Latest release: about 2 months ago - 91 downloads last month - 0 stars on GitHub - 1 maintainer
open-llm-benchmark 0.1.0
Evaluate the capability of open-source LLMs in Agent, formatted output, instruction following, lo...1 version - Latest release: about 1 month ago - 99 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
llm
3
vllm
1
openai
1
llm-agent
1
llamacpp
1
large-language-models
1
huggingface
1
evaluation-framework
1
needle-in-haystack
1
long-context
1
needle in a haystack
1
safety
1
materials-science
1
machine-learning
1
llms
1
chemistry
1
benchmark
1
prompt-engineering
1
metrics
1
llmops
1
llm-tools
1
llm-evaluation-toolkit
1
llm-evaluation-framework
1
llm-evaluation
1
llm-eval
1
good-first-issue
1
generative-ai
1