Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
spack.io "matrix-multiplication" keyword
blas 2023.0.0
AMD Optimized BLIS. BLIS is a portable software framework for instantiating high-performance BLAS...195 versions - Latest release: over 1 year ago - 174 dependent packages - 1,738 stars on GitHub - 2 maintainers
Top 6.4% on spack.io
12 versions - Latest release: about 2 years ago - 3 dependent packages - 1,488 stars on GitHub
blis 0.8.1
BLIS is a portable software framework for instantiating high-performance BLAS-like dense linear a...12 versions - Latest release: about 2 years ago - 3 dependent packages - 1,488 stars on GitHub
cosma 2.5.1
Distributed Communication-Optimal Matrix-Matrix Multiplication Library8 versions - Latest release: about 2 years ago - 1 dependent package - 143 stars on GitHub - 5 maintainers
dbcsr develop
Distributed Block Compressed Sparse Row matrix library.1 version - Latest release: about 2 years ago - 90 stars on GitHub - 2 maintainers
hipblaslt 6.0.2
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and ext...2 versions - Latest release: 19 days ago - 33 stars on GitHub - 3 maintainers
nnpack 2018-04-05
Acceleration package for neural networks on multi-core CPUs.7 versions - Latest release: about 2 years ago - 1,595 stars on GitHub
py-blis 0.9.1
Cython BLIS: Fast BLAS-like operations from Python and Cython, without the tears3 versions - Latest release: 10 months ago - 2 dependent packages - 199 stars on GitHub - 1 maintainer
rocm-tensile 6.0.2
Radeon Open Compute Tensile library31 versions - Latest release: 3 months ago - 173 stars on GitHub - 3 maintainers
tiled-mm
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to bo...Latest release: 23 days ago - 17 stars on GitHub - 3 maintainers
xnnpack 2020-02-24
High-efficiency floating-point neural network inference operators for mobile, server, and Web4 versions - Latest release: about 2 years ago - 1,344 stars on GitHub
Related Keywords
blas
6
linear-algebra
5
neural-networks
4
gemm
3
amd
3
rocm
3
cuda
3
neural-network
3
hpc
3
high-performance-computing
3
high-performance
3
blis
3
blas-libraries
3
simd
2
gpu
2
multithreading
2
inference
2
cpu
2
radeon-open-compute
2
machine-learning
2
hip
2
gpu-computing
2
assembly
2
linear-algebra-library
2
matrix
2
matrix-calculations
2
matrix-functions
2
matrix-library
2
optimization
2
gpu-acceleration
2
matmul
2
mpi
2
rocblas
1
nvidia
1
cublasxt
1
rocblasxt
1
convolutional-neural-network
1
convolutional-neural-networks
1
inference-optimization
1
cublas
1
tensors
1
tensor-contraction
1
radeon
1
python
1
opencl
1
mobile-inference
1
dnn
1
auto-tuning
1
openblas
1
numpy
1
cython
1
winograd-transform
1
fast-fourier-transform
1
communication-optimal
1
convolutional-layers
1
pdgemm
1
scalapack
1
cp2k
1
sparse-matrix
1
openmp-parallelization
1