Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
spack.io "cuda" keyword
cosma 2.5.1
Distributed Communication-Optimal Matrix-Matrix Multiplication Library8 versions - Latest release: about 2 years ago - 1 dependent package - 143 stars on GitHub - 5 maintainers
sirius 7.3.0
Domain specific library for electronic structure calculations30 versions - Latest release: about 2 years ago - 2 dependent packages - 91 stars on GitHub - 5 maintainers
libceed 0.1
The CEED API Library: Code for Efficient Extensible Discretizations.8 versions - Latest release: about 2 years ago - 2 dependent packages - 141 stars on GitHub - 4 maintainers
babelstream
Measure memory transfer rates to/from global device memory on GPUs. This benchmark is similar in ...Latest release: 23 days ago - 219 stars on GitHub - 4 maintainers
Top 9.8% on spack.io
32 versions - Latest release: 2 days ago - 10 dependent packages - 134 stars on GitHub - 3 maintainers
rocprim 6.1.0
Radeon Open Compute Parallel Primitives Library32 versions - Latest release: 2 days ago - 10 dependent packages - 134 stars on GitHub - 3 maintainers
rocrand 5.3.0
The rocRAND project provides functions that generate pseudo-random and quasi-random numbers.20 versions - Latest release: over 1 year ago - 7 dependent packages - 99 stars on GitHub - 3 maintainers
tiled-mm
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to bo...Latest release: 15 days ago - 17 stars on GitHub - 3 maintainers
hiop 0.5.4
HiOp is an optimization solver for solving certain mathematical optimization problems expressed a...21 versions - Latest release: about 2 years ago - 1 dependent package - 169 stars on GitHub - 3 maintainers
Top 2.2% on spack.io
15 versions - Latest release: about 2 years ago - 82 dependent packages - 3,319 stars on GitHub - 3 maintainers
hip 5.0.2
HIP is a C++ Runtime API and Kernel Language that allows developers to create portable applicatio...15 versions - Latest release: about 2 years ago - 82 dependent packages - 3,319 stars on GitHub - 3 maintainers
hipfort 6.0.2
Radeon Open Compute Parallel Primitives Library29 versions - Latest release: 3 months ago - 57 stars on GitHub - 3 maintainers
rocalution 6.0.2
rocALUTION is a sparse linear algebra library with focus on exploring fine-grained parallelism on...31 versions - Latest release: 3 months ago - 60 stars on GitHub - 3 maintainers
spfft 1.0.6
Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support.15 versions - Latest release: about 2 years ago - 2 dependent packages - 41 stars on GitHub - 2 maintainers
spla 1.5.4
Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix...14 versions - Latest release: about 2 years ago - 1 dependent package - 13 stars on GitHub - 2 maintainers
dbcsr develop
Distributed Block Compressed Sparse Row matrix library.1 version - Latest release: about 2 years ago - 90 stars on GitHub - 2 maintainers
aluminum 1.0.0
Aluminum provides a generic interface to high-performance communication libraries, with a focus o...12 versions - Latest release: about 2 years ago - 3 dependent packages - 57 stars on GitHub - 2 maintainers
py-pynvtx 0.3.3
A thin python wrapper for the nvToolsExt (NVTX) library, using pybind11. This wrapper is meant to...1 version - Latest release: 8 months ago - 1 stars on GitHub - 2 maintainers
celeritas 0.4.3
Celeritas is a new Monte Carlo transport code designed for high- performance (GPU-targeted) simul...16 versions - Latest release: 4 days ago - 30 stars on GitHub - 1 maintainer
hipsycl 0.9.1
hipSYCL is an implementation of the SYCL standard programming model over NVIDIA CUDA/AMD HIP3 versions - Latest release: about 2 years ago - 586 stars on GitHub - 1 maintainer
cuda-memtest master
Maintained and updated fork of cuda_memtest. original homepage: http://sourceforge.net/projects/c...1 version - Latest release: about 2 years ago - 85 stars on GitHub - 1 maintainer
py-rmm 0.15.0
RMM: RAPIDS Memory Manager. Achieving optimal performance in GPU-centric workflows frequently req...1 version - Latest release: about 2 years ago - 1 dependent package - 309 stars on GitHub - 1 maintainer
py-fastfold 0.2.0
Optimizing Protein Structure Prediction Model Training and Inference on GPU Clusters.1 version - Latest release: over 1 year ago - 453 stars on GitHub - 1 maintainer
py-dace
DaCe is a fast parallel programming framework that takes code in Python/NumPy and other programmi...Latest release: 1 day ago - 442 stars on GitHub - 1 maintainer
py-transformer-engine
A library for accelerating Transformer models on NVIDIA GPUs, including fp8 precision on Hopper ...Latest release: about 5 hours ago - 1,446 stars on GitHub - 1 maintainer
bohrium 0.9.1
Library for automatic acceleration of array operations3 versions - Latest release: about 2 years ago - 218 stars on GitHub - 1 maintainer
Top 8.2% on spack.io
Latest release: 30 days ago - 4 dependent packages - 673 stars on GitHub - 1 maintainer
sycl
hipSYCL is an implementation of the SYCL standard programming model over NVIDIA CUDA/AMD HIPLatest release: 30 days ago - 4 dependent packages - 673 stars on GitHub - 1 maintainer
ascent 0.7.1
Ascent is an open source many-core capable lightweight in situ visualization and analysis infrast...4 versions - Latest release: about 2 years ago - 4 dependent packages - 128 stars on GitHub - 1 maintainer
arborx 1.2
ArborX is a performance-portable library for geometric search5 versions - Latest release: about 2 years ago - 4 dependent packages - 99 stars on GitHub - 1 maintainer
omega-h 9.34.1
Omega_h is a C++11 library providing data structures and algorithms for adaptive discretizations....13 versions - Latest release: about 2 years ago - 2 dependent packages - 102 stars on GitHub - 1 maintainer
py-cuml 0.15.0
cuML is a suite of libraries that implement machine learning algorithms and mathematical primitiv...1 version - Latest release: about 2 years ago - 3,201 stars on GitHub - 1 maintainer
Top 9.9% on spack.io
2 versions - Latest release: about 2 years ago - 2 dependent packages - 309 stars on GitHub
librmm 0.15.0
RMM: RAPIDS Memory Manager. Achieving optimal performance in GPU-centric workflows frequently req...2 versions - Latest release: about 2 years ago - 2 dependent packages - 309 stars on GitHub
cutlass
CUDA Templates for Linear Algebra SubroutinesLatest release: 4 days ago - 2,557 stars on GitHub
Top 7.2% on spack.io
6 versions - Latest release: about 2 years ago - 1 dependent package - 12,515 stars on GitHub
kaldi 2015-10-07
Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2...6 versions - Latest release: about 2 years ago - 1 dependent package - 12,515 stars on GitHub
nnvm 20170418
nnvm is a modular, decentralized and lightweight part to help build deep learning libraries.2 versions - Latest release: about 2 years ago - 1,651 stars on GitHub
Related Keywords
gpu
18
mpi
11
rocm
11
hpc
10
opencl
6
hip
6
high-performance-computing
5
linear-algebra
4
gpgpu
4
cpp
4
openmp
3
sycl
3
parallel
3
deep-learning
3
nvidia
3
gpu-computing
3
solver
3
matrix-multiplication
3
parallel-computing
3
gpu-acceleration
3
fortran
2
sparse
2
random
2
fft
2
blas
2
machine-learning
2
c-plus-plus
2
radiuss
2
amd
2
gemm
2
optimization
2
memory-allocation
2
matmul
2
memory-management
2
rapids
2
nvidia-cuda
2
high-performance
2
clang
2
parallelism
2
pytorch
2
kokkos
2
vivado-hls
1
hipsycl
1
programming-language
1
high-level-synthesis
1
numpy
1
multi-core
1
alphafold2
1
python
1
evoformer
1
habana-gaudi
1
fpga
1
protein-structure
1
jax
1
protein-folding
1
fp8
1
opensycl
1
sandia-national-laboratories
1
snl-science-libs
1
triangulation
1
machine-learning-algorithms
1
deep-learning-library
1
kaldi
1
shell
1
speaker-id
1
speaker-verification
1
speech
1
speech-recognition
1
speech-to-text
1
computation-graph
1
deployment
1
metal
1
nnvm
1
tvm
1
analysis
1
data-viz
1
rendering
1
scientific-computing
1
bounding-volume-hierarchy
1
clustering
1
dbscan
1
distributed
1
hdbscan
1
knn-search
1
nearest-neighbors
1
cmake
1
cpp14
1
geometry
1
mesh
1
mesh-generation
1
meshing
1
validation
1
constrained-optimization
1
bfgs
1
acopf
1
rocblasxt
1
rocblas
1
cublasxt
1
cublas
1
rng
1