pypi.org "llm-testing" keyword
View the packages on the pypi.org package registry that are tagged with the "llm-testing" keyword.
ccheck 0.1.4
A human-friendly framework for testing and evaluating LLMs, RAGs, and chatbots.5 versions - Latest release: 11 months ago - 69 downloads last month - 81 stars on GitHub - 1 maintainer
llamator 3.4.0
Framework for testing vulnerabilities of GenAI systems.18 versions - Latest release: 8 days ago - 305 downloads last month - 152 stars on GitHub - 1 maintainer
rhesis-sdk 0.2.4
SDK for testing and validating LLM applications15 versions - Latest release: 14 days ago - 254 downloads last month - 115 stars on GitHub - 1 maintainer
api-test-ninja 1.0.10
API Testing Framework to automate and simplify API testing using LLM Agents and tests defined in ...11 versions - Latest release: 5 months ago - 165 downloads last month - 2 stars on GitHub - 1 maintainer
nlptest 1.5.0
John Snow Labs provides a library for delivering safe & effective NLP models.41 versions - Latest release: over 2 years ago - 1 dependent repositories - 230 downloads last month - 536 stars on GitHub - 2 maintainers
sentiebl 0.1.1
Systematic Elicitation of Non-Trivial and Insecure Emergent Behaviors in LLMs2 versions - Latest release: about 1 month ago
agentneo 1.2.3
A powerful tracing library for monitoring and analyzing AI agents, LLM calls, and tool interactions.34 versions - Latest release: 9 months ago - 249 downloads last month - 16,179 stars on GitHub - 1 maintainer
tinyqabenchmarkpp 1.2.3 💰
A tiny synthetic QA LLM benchmark dataset generator using LiteLLM.7 versions - Latest release: 5 months ago - 39 downloads last month - 8 stars on GitHub - 1 maintainer
Related Keywords
llm
6
testing
3
responsible-ai
2
robustness
2
testing-framework
2
ai
2
ai-security
2
rag
2
open-source
2
llm-evaluation
2
large-language-models
2
llmops
2
ai-safety
2
ai-testing
2
nlp
2
ml-safety
1
ml-testing
1
llm-test
1
llm-evaluation-toolkit
1
mlops
1
model-assessment
1
trustworthy-ai
1
gpt-oss
1
tinybenchmarks
1
llm-as-evaluator
1
ethics-in-ai
1
benchmarks
1
benchmark-framework
1
artificial-intelligence
1
accuracy
1
representation
1
fairness
1
bias
1
NLP
1
testing-with-ai
1
restapi-test
1
pytest-api-test
1
synthetic-data
1
smoke-test
1
qa-dataset
1
litellm
1
huggingface-datasets
1
evaluation
1
dataset
1
benchmark
1
llm-tracing
1
ai-tool-interaction-monitoring
1
ai-performance-optimization
1
ai-evaluation-tools
1
ai-application-debugging
1
ai-agent-monitoring
1
agents
1
agentneo
1
agentic-ai-development
1
agentic-ai
1
llm-auditor
1
sentiebl
1
prompt-injection
1
vulnerability-analysis
1
ollama
1
openai
1
red-teaming
1
http-testing
1
owasp
1
misinformation
1
llm-security
1
llm-read-team
1
jailbreak
1
hallucinations
1
attack
1
agent
1
testing-tools
1
summarization-testing
1
rag-testing
1
prompt-test
1
llm-evaluation-framework
1
generative-ai-testing
1
ci
1
chatbot-testing
1
chatbot-framework
1
ai-testing-tool
1
ai-chat
1
Validation
1
Testing
1
Chatbot
1
RAG
1
LLM
1
automated-api-test
1
test automation
1
test api
1
rest api testing
1
pytest api testing
1
openapi testing
1
openai api testing
1
llm testing
1
integration testing
1
api-testing
1
api validation
1
validation
1
trustworthiness
1