pypi.org : flashinfer-python
FlashInfer: Kernel Library for LLM Serving
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/flashinfer-python
Keywords:
attention
, cuda
, distributed-inference
, gpu
, jit
, large-large-models
, llm-inference
, moe
, nvidia
, pytorch
License: Apache-2.0
Latest release: 29 days ago
First release: 9 months ago
Downloads: 322,549 last month
Stars: 3,823 on GitHub
Forks: 522 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 1 day ago