pypi.org "llm-testing" keyword
View the packages on the pypi.org package registry that are tagged with the "llm-testing" keyword.
llamator 3.1.0
Framework for testing vulnerabilities of large language models (LLM).15 versions - Latest release: about 6 hours ago - 571 downloads last month - 96 stars on GitHub - 1 maintainer
rhesis-sdk 0.1.7
SDK for testing and validating LLM applications8 versions - Latest release: 2 days ago - 230 downloads last month - 17 stars on GitHub - 1 maintainer
ccheck 0.1.4
A human-friendly framework for testing and evaluating LLMs, RAGs, and chatbots.5 versions - Latest release: 5 months ago - 154 downloads last month - 55 stars on GitHub - 1 maintainer
agentneo 1.2.3
A powerful tracing library for monitoring and analyzing AI agents, LLM calls, and tool interactions.34 versions - Latest release: 3 months ago - 2.44 thousand downloads last month - 15,636 stars on GitHub - 1 maintainer
nlptest 1.5.0
John Snow Labs provides a library for delivering safe & effective NLP models.41 versions - Latest release: almost 2 years ago - 1 dependent repositories - 637 downloads last month - 498 stars on GitHub - 2 maintainers
Related Keywords
llm
4
ai
2
large-language-models
2
open-source
2
llm-evaluation
2
responsible-ai
2
robustness
2
nlp
2
ai-testing
2
testing
2
rag
2
llm-tracing
1
ai-tool-interaction-monitoring
1
ai-performance-optimization
1
ai-evaluation-tools
1
ai-application-debugging
1
ai-agent-monitoring
1
agents
1
agentneo
1
agentic-ai-development
1
testing-tools
1
testing-framework
1
summarization-testing
1
rag-testing
1
prompt-test
1
application-insights
1
trustworthy-ai
1
model-assessment
1
mlops
1
ml-testing
1
ml-safety
1
llm-test
1
llm-evaluation-toolkit
1
llm-as-evaluator
1
ethics-in-ai
1
benchmarks
1
benchmark-framework
1
artificial-intelligence
1
ai-safety
1
accuracy
1
representation
1
fairness
1
bias
1
NLP
1
llmops
1
machine-learning
1
vulnerability-assessment
1
security-tools
1
red-teaming
1
red-team-tools
1
red-team
1
rag-evaluation
1
python
1
owasp
1
misinformation
1
llm-security
1
llm-read-team
1
jailbreak
1
hallucinations
1
attack
1
ai-security
1
llm-evaluation-framework
1
generative-ai-testing
1
ci
1
chatbot-testing
1
chatbot-framework
1
ai-testing-tool
1
ai-chat
1
Validation
1
Testing
1
Chatbot
1
RAG
1
LLM
1
validation
1
trustworthiness
1
reliability
1
quality-assessment
1
compliance
1