speculative-decoding | pypi.org keywords

pypi.org "speculative-decoding" keyword

View the packages on the pypi.org package registry that are tagged with the "speculative-decoding" keyword.

eagle-llm 1.2.1

Accelerating LLMs by 3x with No Quality Loss
3 versions - Latest release: about 1 year ago - 154 downloads last month - 1,182 stars on GitHub - 1 maintainer

draftretriever 0.1.1

REST: Retrieval-Based Speculative Decoding, NAACL 2024
2 versions - Latest release: 5 months ago - 178 downloads last month - 198 stars on GitHub - 1 maintainer

tokenswift 0.1.1

Framework for Accelerating LLM Generation
3 versions - Latest release: about 2 months ago - 126 downloads last month - 85 stars on GitHub - 1 maintainer

aphrodite-engine 0.6.5 💰

The inference engine for PygmalionAI models
29 versions - Latest release: 4 months ago - 2.42 thousand downloads last month - 1,374 stars on GitHub - 1 maintainer

Top 6.4% on pypi.org

intel-extension-for-transformers 1.4.2

Repository of Intel® Intel Extension for Transformers
16 versions - Latest release: 11 months ago - 1 dependent package - 4 dependent repositories - 2.65 thousand downloads last month - 2,133 stars on GitHub - 1 maintainer

Related Keywords

llm-inference 4 retrieval 2 large-language-models 1 post-training static quantization 1 post-training dynamic quantization 1 quantization-aware training 1 tuning strategy 1 4-bits 1 autoround 1 chatbot 1 chatpdf 1 gaudi3 1 habana 1 intel-optimized-llamacpp 1 large-language-model 1 llm-cpu 1 neural-chat 1 neural-chat-7b 1 rag 1 streamingllm 1 deepseek 1 inference 1 llm-serving 1 llms 1 qwen 1 transformer 1 api-rest 1 cuda 1 inference-engine 1 inferentia 1 intel 1 lora 1 machine-learning 1 rocm 1 tpu 1 quantization 1 auto-tuning 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Packages