pypi.org : flashinfer-python
FlashInfer: Kernel Library for LLM Serving
Registry
-
Source
- Documentation
- JSON
- codemeta.json
purl: pkg:pypi/flashinfer-python
Keywords:
attention
, cuda
, distributed-inference
, gpu
, jit
, large-large-models
, llm-inference
, moe
, nvidia
, pytorch
License: Apache-2.0
Latest release: 14 days ago
First release: 10 months ago
Downloads: 317,252 last month
Stars: 3,942 on GitHub
Forks: 537 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 6 days ago