An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

spack.io "gpu" keyword

View the packages on the spack.io package registry that are tagged with the "gpu" keyword.

cutlass
CUDA Templates for Linear Algebra Subroutines
Latest release: 6 days ago - 8,604 stars on GitHub
Top 3.3% on spack.io
nccl 2.29.2-1
Optimized primitives for collective multi-GPU communication.
51 versions - Latest release: 6 days ago - 14 dependent packages - 4,124 stars on GitHub - 1 maintainer
omega-h 9.34.1
Omega_h is a C++11 library providing data structures and algorithms for adaptive discretizations....
13 versions - Latest release: almost 4 years ago - 2 dependent packages - 102 stars on GitHub - 1 maintainer
alpaka 0.8.0
Abstraction Library for Parallel Kernel Acceleration.
6 versions - Latest release: almost 4 years ago - 1 dependent package - 393 stars on GitHub - 1 maintainer
hipfort 7.1.1
Radeon Open Compute Parallel Primitives Library
47 versions - Latest release: 10 days ago - 81 stars on GitHub - 4 maintainers
nekrs 21.0
nekRS is an open-source Navier Stokes solver based on the spectral element method targeting class...
1 version - Latest release: almost 4 years ago - 1 dependent package - 356 stars on GitHub - 2 maintainers
care 0.3.0
CHAI and RAJA extensions (includes data structures and algorithms).
4 versions - Latest release: almost 4 years ago - 31 stars on GitHub - 2 maintainers
py-mpi4jax 0.3.11.post3
Zero-copy MPI communication of JAX arrays, for turbo-charged HPC applications in Python.
1 version - Latest release: almost 3 years ago - 496 stars on GitHub - 1 maintainer
arborx 1.2
ArborX is a performance-portable library for geometric search
5 versions - Latest release: almost 4 years ago - 4 dependent packages - 210 stars on GitHub - 1 maintainer
Top 8.9% on spack.io
umpire 6.0.0
An application-focused API for memory management on NUMA & GPU architectures
29 versions - Latest release: almost 4 years ago - 8 dependent packages - 382 stars on GitHub - 3 maintainers
hipsycl 0.9.1
hipSYCL is an implementation of the SYCL standard programming model over NVIDIA CUDA/AMD HIP
3 versions - Latest release: almost 4 years ago - 586 stars on GitHub - 1 maintainer
py-fastfold 0.2.0
Optimizing Protein Structure Prediction Model Training and Inference on GPU Clusters.
1 version - Latest release: about 3 years ago - 453 stars on GitHub - 1 maintainer
py-transformer-engine
A library for accelerating Transformer models on NVIDIA GPUs, including fp8 precision on Hopper ...
Latest release: 20 days ago - 2,840 stars on GitHub - 1 maintainer
aluminum 1.0.0
Aluminum provides a generic interface to high-performance communication libraries, with a focus o...
12 versions - Latest release: almost 4 years ago - 3 dependent packages - 85 stars on GitHub - 2 maintainers
nvtop 3.0.1
Nvtop stands for Neat Videocard TOP, a (h)top like task monitor for AMD and NVIDIA GPUS. It can h...
10 versions - Latest release: about 3 years ago - 9,582 stars on GitHub - 1 maintainer
py-cuml 0.15.0
cuML is a suite of libraries that implement machine learning algorithms and mathematical primitiv...
1 version - Latest release: almost 4 years ago - 4,968 stars on GitHub - 1 maintainer
py-qiskit-aer 0.11.1
Aer is a high performance simulator for quantum circuits that includes noise models
2 versions - Latest release: about 3 years ago - 596 stars on GitHub - 1 maintainer
Top 9.8% on spack.io
rocprim 7.1.1
Radeon Open Compute Parallel Primitives Library
49 versions - Latest release: 23 days ago - 10 dependent packages - 134 stars on GitHub - 4 maintainers
tsne-cuda 3.0.1
tsne-cuda is an optimized CUDA version of FIt-SNE algorithm with associated python modules. Autho...
2 versions - Latest release: 3 months ago - 1,881 stars on GitHub - 1 maintainer
chai 2.4.0
Copy-hiding array interface for data migration between memory spaces
13 versions - Latest release: almost 4 years ago - 2 dependent packages - 109 stars on GitHub - 4 maintainers
rocm-tensile 7.1.1
Radeon Open Compute Tensile library
49 versions - Latest release: about 1 month ago - 173 stars on GitHub - 4 maintainers
Top 9.5% on spack.io
py-gpustat 0.6.0
An utility to monitor NVIDIA GPU status and usage.
2 versions - Latest release: almost 4 years ago - 1 dependent package - 4,200 stars on GitHub - 1 maintainer
Top 9.5% on spack.io
rocfft 7.1.0
Radeon Open Compute FFT library
48 versions - Latest release: about 1 month ago - 7 dependent packages - 133 stars on GitHub - 5 maintainers
py-heat 1.6.0
Heat is a flexible and seamless open-source software for high performance data analytics and mach...
8 versions - Latest release: about 1 month ago - 158 stars on GitHub - 3 maintainers
libceed 0.1
The CEED API Library: Code for Efficient Extensible Discretizations.
8 versions - Latest release: almost 4 years ago - 2 dependent packages - 236 stars on GitHub - 4 maintainers
py-stringzilla 4.2.1
Search, hash, sort, and process strings faster via SWAR and SIMD
1 version - Latest release: 4 months ago - 2,903 stars on GitHub
rocrand 7.1.0
The rocRAND project provides functions that generate pseudo-random and quasi-random numbers.
45 versions - Latest release: about 1 month ago - 8 dependent packages - 130 stars on GitHub - 4 maintainers
sirius 7.3.0
Domain specific library for electronic structure calculations
30 versions - Latest release: almost 4 years ago - 2 dependent packages - 91 stars on GitHub - 5 maintainers
celeritas 0.5.1
Celeritas is a new Monte Carlo transport code designed for high- performance (GPU-targeted) simul...
19 versions - Latest release: about 1 year ago - 88 stars on GitHub - 2 maintainers
mpibind 0.8.0
A portable runtime library that automatically maps parallel applications to heterogeneous hardwar...
4 versions - Latest release: almost 4 years ago - 46 stars on GitHub - 1 maintainer
cans 1.1.4
CaNS (Canonical Navier-Stokes) is a code for massively-parallel numerical simulations of fluid fl...
4 versions - Latest release: over 3 years ago - 250 stars on GitHub - 4 maintainers
elbencho 2.0-7
Elbencho storage benchmark
8 versions - Latest release: over 2 years ago - 234 stars on GitHub - 1 maintainer
babelstream
Measure memory transfer rates to/from global device memory on GPUs. This benchmark is similar in ...
Latest release: about 2 months ago - 348 stars on GitHub - 3 maintainers
bohrium 0.9.1
Library for automatic acceleration of array operations
3 versions - Latest release: almost 4 years ago - 218 stars on GitHub - 1 maintainer
cuda-memtest master
Maintained and updated fork of cuda_memtest. original homepage: http://sourceforge.net/projects/c...
1 version - Latest release: almost 4 years ago - 134 stars on GitHub - 1 maintainer
Top 8.2% on spack.io
sycl
hipSYCL is an implementation of the SYCL standard programming model over NVIDIA CUDA/AMD HIP
Latest release: about 2 months ago - 4 dependent packages - 673 stars on GitHub - 1 maintainer
neon
NeoN is a PDE solver for CFD frameworks.
Latest release: about 2 months ago - 75 stars on GitHub - 2 maintainers
tiled-mm
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to bo...
Latest release: about 2 months ago - 17 stars on GitHub - 3 maintainers
rpp 7.0.2
Radeon Performance Primitives (RPP) library is a comprehensive high- performance computer vision ...
26 versions - Latest release: 3 months ago - 1 dependent package - 66 stars on GitHub - 2 maintainers