pypi.org "speculative-decoding" keyword
View the packages on the pypi.org package registry that are tagged with the "speculative-decoding" keyword.
eagle-llm 1.2.1
Accelerating LLMs by 3x with No Quality Loss3 versions - Latest release: about 1 year ago - 154 downloads last month - 1,182 stars on GitHub - 1 maintainer
draftretriever 0.1.1
REST: Retrieval-Based Speculative Decoding, NAACL 20242 versions - Latest release: 5 months ago - 178 downloads last month - 198 stars on GitHub - 1 maintainer
tokenswift 0.1.1
Framework for Accelerating LLM Generation3 versions - Latest release: about 2 months ago - 126 downloads last month - 85 stars on GitHub - 1 maintainer
aphrodite-engine 0.6.5 💰
The inference engine for PygmalionAI models29 versions - Latest release: 4 months ago - 2.42 thousand downloads last month - 1,374 stars on GitHub - 1 maintainer
Top 6.4% on pypi.org
16 versions - Latest release: 11 months ago - 1 dependent package - 4 dependent repositories - 2.65 thousand downloads last month - 2,133 stars on GitHub - 1 maintainer
intel-extension-for-transformers 1.4.2
Repository of Intel® Intel Extension for Transformers16 versions - Latest release: 11 months ago - 1 dependent package - 4 dependent repositories - 2.65 thousand downloads last month - 2,133 stars on GitHub - 1 maintainer
Related Keywords
llm-inference
4
retrieval
2
large-language-models
1
post-training static quantization
1
post-training dynamic quantization
1
quantization-aware training
1
tuning strategy
1
4-bits
1
autoround
1
chatbot
1
chatpdf
1
gaudi3
1
habana
1
intel-optimized-llamacpp
1
large-language-model
1
llm-cpu
1
neural-chat
1
neural-chat-7b
1
rag
1
streamingllm
1
deepseek
1
inference
1
llm-serving
1
llms
1
qwen
1
transformer
1
api-rest
1
cuda
1
inference-engine
1
inferentia
1
intel
1
lora
1
machine-learning
1
rocm
1
tpu
1
quantization
1
auto-tuning
1