An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "llm-testing" keyword

View the packages on the pypi.org package registry that are tagged with the "llm-testing" keyword.

ccheck 0.1.4
A human-friendly framework for testing and evaluating LLMs, RAGs, and chatbots.
5 versions - Latest release: 11 months ago - 69 downloads last month - 81 stars on GitHub - 1 maintainer
llamator 3.4.0
Framework for testing vulnerabilities of GenAI systems.
18 versions - Latest release: 8 days ago - 305 downloads last month - 152 stars on GitHub - 1 maintainer
rhesis-sdk 0.2.4
SDK for testing and validating LLM applications
15 versions - Latest release: 14 days ago - 254 downloads last month - 115 stars on GitHub - 1 maintainer
api-test-ninja 1.0.10
API Testing Framework to automate and simplify API testing using LLM Agents and tests defined in ...
11 versions - Latest release: 5 months ago - 165 downloads last month - 2 stars on GitHub - 1 maintainer
nlptest 1.5.0
John Snow Labs provides a library for delivering safe & effective NLP models.
41 versions - Latest release: over 2 years ago - 1 dependent repositories - 230 downloads last month - 536 stars on GitHub - 2 maintainers
sentiebl 0.1.1
Systematic Elicitation of Non-Trivial and Insecure Emergent Behaviors in LLMs
2 versions - Latest release: about 1 month ago
agentneo 1.2.3
A powerful tracing library for monitoring and analyzing AI agents, LLM calls, and tool interactions.
34 versions - Latest release: 9 months ago - 249 downloads last month - 16,179 stars on GitHub - 1 maintainer
tinyqabenchmarkpp 1.2.3 💰
A tiny synthetic QA LLM benchmark dataset generator using LiteLLM.
7 versions - Latest release: 5 months ago - 39 downloads last month - 8 stars on GitHub - 1 maintainer
Related Keywords
llm 6 testing 3 responsible-ai 2 robustness 2 testing-framework 2 ai 2 ai-security 2 rag 2 open-source 2 llm-evaluation 2 large-language-models 2 llmops 2 ai-safety 2 ai-testing 2 nlp 2 ml-safety 1 ml-testing 1 llm-test 1 llm-evaluation-toolkit 1 mlops 1 model-assessment 1 trustworthy-ai 1 gpt-oss 1 tinybenchmarks 1 llm-as-evaluator 1 ethics-in-ai 1 benchmarks 1 benchmark-framework 1 artificial-intelligence 1 accuracy 1 representation 1 fairness 1 bias 1 NLP 1 testing-with-ai 1 restapi-test 1 pytest-api-test 1 synthetic-data 1 smoke-test 1 qa-dataset 1 litellm 1 huggingface-datasets 1 evaluation 1 dataset 1 benchmark 1 llm-tracing 1 ai-tool-interaction-monitoring 1 ai-performance-optimization 1 ai-evaluation-tools 1 ai-application-debugging 1 ai-agent-monitoring 1 agents 1 agentneo 1 agentic-ai-development 1 agentic-ai 1 llm-auditor 1 sentiebl 1 prompt-injection 1 vulnerability-analysis 1 ollama 1 openai 1 red-teaming 1 http-testing 1 owasp 1 misinformation 1 llm-security 1 llm-read-team 1 jailbreak 1 hallucinations 1 attack 1 agent 1 testing-tools 1 summarization-testing 1 rag-testing 1 prompt-test 1 llm-evaluation-framework 1 generative-ai-testing 1 ci 1 chatbot-testing 1 chatbot-framework 1 ai-testing-tool 1 ai-chat 1 Validation 1 Testing 1 Chatbot 1 RAG 1 LLM 1 automated-api-test 1 test automation 1 test api 1 rest api testing 1 pytest api testing 1 openapi testing 1 openai api testing 1 llm testing 1 integration testing 1 api-testing 1 api validation 1 validation 1 trustworthiness 1