pypi.org "code-evaluation" keyword
ballerina-platform-codebleu 0.7.1
Unofficial CodeBLEU implementation that supports Linux, MacOS and Windows available on PyPI.1 version - Latest release: 9 months ago - 12 downloads last month - 113 stars on GitHub - 1 maintainer
ballerina-codebleu 0.7.1
Unofficial CodeBLEU implementation that supports Linux, MacOS and Windows available on PyPI.1 version - Latest release: 9 months ago - 13 downloads last month - 113 stars on GitHub - 1 maintainer
codeoptix 0.1.3
Agentic Code Optimization & Deep Evaluation for Superior Coding Agent Experience. Built by Supera...4 versions - Latest release: 4 months ago - 31 downloads last month - 1 maintainer
llm-testlab 0.2.0
Comprehensive testing suite for LLM evaluation: hallucination detection, consistency, robustness,...3 versions - Latest release: 6 months ago - 18 downloads last month - 6 stars on GitHub - 1 maintainer
codesafe 0.0.3
An open-source Python library for code encryption, decryption, and safe evaluation using Python's...3 versions - Latest release: 7 months ago - 23 downloads last month - 2 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
14 versions - Latest release: almost 2 years ago - 3 dependent repositories - 4.51 thousand downloads last month - 113 stars on GitHub - 1 maintainer
codebleu 0.7.0
Unofficial CodeBLEU implementation that supports Linux, MacOS and Windows available on PyPI.14 versions - Latest release: almost 2 years ago - 3 dependent repositories - 4.51 thousand downloads last month - 113 stars on GitHub - 1 maintainer
Related Keywords
nlp
4
evaluation
4
evaluation-metrics
3
code-generation
3
metrics
3
code generation
3
evaluate
3
programming
3
natural language processing
3
bleu
3
code
3
codebleu
3
ai
2
huggingface
1
language-models
1
llama
1
llm
1
open-source
1
openai
1
prompt-injection
1
security
1
up-for-grabs
1
code-obfuscation
1
codesafe
1
eval
1
python
1
safe-eval
1
safe-evaluation
1
sandboxing
1
hallucination
1
good-first-issue
1
consistency
1
machine-learning
1
NLP
1
semantic-similarity
1
hallucination-detection
1
AI
1
testing
1
LLM
1
agentic-coding
1
deep-evaluation
1
agent-experience
1
behavioral-optimization
1
coding-agents
1