Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
spack.io "cuda" keyword
Top 2.2% on spack.io
15 versions - Latest release: about 2 years ago - 82 dependent packages - 3,319 stars on GitHub - 3 maintainers
hip 5.0.2
HIP is a C++ Runtime API and Kernel Language that allows developers to create portable applicatio...15 versions - Latest release: about 2 years ago - 82 dependent packages - 3,319 stars on GitHub - 3 maintainers
Top 9.8% on spack.io
32 versions - Latest release: 2 days ago - 10 dependent packages - 134 stars on GitHub - 3 maintainers
rocprim 6.1.0
Radeon Open Compute Parallel Primitives Library32 versions - Latest release: 2 days ago - 10 dependent packages - 134 stars on GitHub - 3 maintainers
rocrand 5.3.0
The rocRAND project provides functions that generate pseudo-random and quasi-random numbers.20 versions - Latest release: over 1 year ago - 7 dependent packages - 99 stars on GitHub - 3 maintainers
Top 8.2% on spack.io
Latest release: about 1 month ago - 4 dependent packages - 673 stars on GitHub - 1 maintainer
sycl
hipSYCL is an implementation of the SYCL standard programming model over NVIDIA CUDA/AMD HIPLatest release: about 1 month ago - 4 dependent packages - 673 stars on GitHub - 1 maintainer
ascent 0.7.1
Ascent is an open source many-core capable lightweight in situ visualization and analysis infrast...4 versions - Latest release: about 2 years ago - 4 dependent packages - 128 stars on GitHub - 1 maintainer
arborx 1.2
ArborX is a performance-portable library for geometric search5 versions - Latest release: about 2 years ago - 4 dependent packages - 99 stars on GitHub - 1 maintainer
aluminum 1.0.0
Aluminum provides a generic interface to high-performance communication libraries, with a focus o...12 versions - Latest release: about 2 years ago - 3 dependent packages - 57 stars on GitHub - 2 maintainers
omega-h 9.34.1
Omega_h is a C++11 library providing data structures and algorithms for adaptive discretizations....13 versions - Latest release: about 2 years ago - 2 dependent packages - 102 stars on GitHub - 1 maintainer
spfft 1.0.6
Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support.15 versions - Latest release: about 2 years ago - 2 dependent packages - 41 stars on GitHub - 2 maintainers
Top 9.9% on spack.io
2 versions - Latest release: about 2 years ago - 2 dependent packages - 309 stars on GitHub
librmm 0.15.0
RMM: RAPIDS Memory Manager. Achieving optimal performance in GPU-centric workflows frequently req...2 versions - Latest release: about 2 years ago - 2 dependent packages - 309 stars on GitHub
libceed 0.1
The CEED API Library: Code for Efficient Extensible Discretizations.8 versions - Latest release: about 2 years ago - 2 dependent packages - 141 stars on GitHub - 4 maintainers
sirius 7.3.0
Domain specific library for electronic structure calculations30 versions - Latest release: about 2 years ago - 2 dependent packages - 91 stars on GitHub - 5 maintainers
hiop 0.5.4
HiOp is an optimization solver for solving certain mathematical optimization problems expressed a...21 versions - Latest release: about 2 years ago - 1 dependent package - 169 stars on GitHub - 3 maintainers
Top 7.2% on spack.io
6 versions - Latest release: about 2 years ago - 1 dependent package - 12,515 stars on GitHub
kaldi 2015-10-07
Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2...6 versions - Latest release: about 2 years ago - 1 dependent package - 12,515 stars on GitHub
cosma 2.5.1
Distributed Communication-Optimal Matrix-Matrix Multiplication Library8 versions - Latest release: about 2 years ago - 1 dependent package - 143 stars on GitHub - 5 maintainers
py-rmm 0.15.0
RMM: RAPIDS Memory Manager. Achieving optimal performance in GPU-centric workflows frequently req...1 version - Latest release: about 2 years ago - 1 dependent package - 309 stars on GitHub - 1 maintainer
spla 1.5.4
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix...14 versions - Latest release: about 2 years ago - 1 dependent package - 13 stars on GitHub - 2 maintainers
celeritas 0.4.3
Celeritas is a new Monte Carlo transport code designed for high- performance (GPU-targeted) simul...16 versions - Latest release: 4 days ago - 30 stars on GitHub - 1 maintainer
cutlass
CUDA Templates for Linear Algebra SubroutinesLatest release: 4 days ago - 2,557 stars on GitHub
hipsycl 0.9.1
hipSYCL is an implementation of the SYCL standard programming model over NVIDIA CUDA/AMD HIP3 versions - Latest release: about 2 years ago - 586 stars on GitHub - 1 maintainer
cuda-memtest master
Maintained and updated fork of cuda_memtest. original homepage: http://sourceforge.net/projects/c...1 version - Latest release: about 2 years ago - 85 stars on GitHub - 1 maintainer
dbcsr develop
Distributed Block Compressed Sparse Row matrix library.1 version - Latest release: about 2 years ago - 90 stars on GitHub - 2 maintainers
rocalution 6.0.2
rocALUTION is a sparse linear algebra library with focus on exploring fine-grained parallelism on...31 versions - Latest release: 3 months ago - 60 stars on GitHub - 3 maintainers
py-fastfold 0.2.0
Optimizing Protein Structure Prediction Model Training and Inference on GPU Clusters.1 version - Latest release: over 1 year ago - 453 stars on GitHub - 1 maintainer
py-dace
DaCe is a fast parallel programming framework that takes code in Python/NumPy and other programmi...Latest release: 2 days ago - 442 stars on GitHub - 1 maintainer
hipfort 6.0.2
Radeon Open Compute Parallel Primitives Library29 versions - Latest release: 3 months ago - 57 stars on GitHub - 3 maintainers
py-transformer-engine
A library for accelerating Transformer models on NVIDIA GPUs, including fp8 precision on Hopper ...Latest release: about 9 hours ago - 1,446 stars on GitHub - 1 maintainer
bohrium 0.9.1
Library for automatic acceleration of array operations3 versions - Latest release: about 2 years ago - 218 stars on GitHub - 1 maintainer
babelstream
Measure memory transfer rates to/from global device memory on GPUs. This benchmark is similar in ...Latest release: 23 days ago - 219 stars on GitHub - 4 maintainers
nnvm 20170418
nnvm is a modular, decentralized and lightweight part to help build deep learning libraries.2 versions - Latest release: about 2 years ago - 1,651 stars on GitHub
tiled-mm
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to bo...Latest release: 15 days ago - 17 stars on GitHub - 3 maintainers
py-cuml 0.15.0
cuML is a suite of libraries that implement machine learning algorithms and mathematical primitiv...1 version - Latest release: about 2 years ago - 3,201 stars on GitHub - 1 maintainer
py-pynvtx 0.3.3
A thin python wrapper for the nvToolsExt (NVTX) library, using pybind11. This wrapper is meant to...1 version - Latest release: 8 months ago - 1 stars on GitHub - 2 maintainers
Related Keywords
gpu
18
mpi
11
rocm
11
hpc
10
hip
6
opencl
6
high-performance-computing
5
cpp
4
gpgpu
4
linear-algebra
4
deep-learning
3
parallel-computing
3
nvidia
3
openmp
3
sycl
3
solver
3
gpu-acceleration
3
gpu-computing
3
matrix-multiplication
3
parallel
3
matmul
2
parallelism
2
blas
2
machine-learning
2
fortran
2
fft
2
kokkos
2
pytorch
2
memory-allocation
2
amd
2
memory-management
2
c-plus-plus
2
rapids
2
optimization
2
radiuss
2
random
2
sparse
2
clang
2
high-performance
2
nvidia-cuda
2
gemm
2
api
1
communication-optimal
1
alphafold2
1
nvtx-markers
1
cplusplus
1
sparse-matrix
1
openmp-parallelization
1
cp2k
1
validation
1
research
1
memtest
1
error-monitoring
1
devops-tools
1
cloud-computing
1
deep-learning-library
1
monte-carlo
1
pdgemm
1
scalapack
1
hep
1
computational-physics
1
nvtx
1
machine-learning-algorithms
1
rocblasxt
1
rocblas
1
cublasxt
1
cublas
1
tvm
1
nnvm
1
metal
1
deployment
1
computation-graph
1
raja
1
parallel-processing
1
openacc
1
memory-bandwidth
1
benchmark
1
numpy
1
multi-core
1
python
1
jax
1
fp8
1
interoperability
1
vivado-hls
1
programming-language
1
high-level-synthesis
1
fpga
1
protein-structure
1
protein-folding
1
habana-gaudi
1
evoformer
1
fft-library
1
triangulation
1
snl-science-libs
1
sandia-national-laboratories
1
meshing
1
mesh-generation
1
mesh
1
geometry
1
cpp14
1