Abstract: The performance of sparse matrix vector multiplication (SpMV) is important to computational scientists. Compressed sparse row (CSR) is the most frequently used format to store sparse ...
This repository contains the artifact for the SC '25 paper submission "KAMI: Communication-Avoiding General Matrix Multiplication within a Single GPU." The NVIDIA GH200 is installed with Ubuntu 22.04 ...
Quantum-inspired adaptive tiling for high-performance matrix multiplication. Uses WKB tunneling physics with the golden ratio to derive optimal tile sizes from real-time CPU state. 15%+ gains on ...