Implementation of Incomplete-Cholesky preconditioned conjugate-gradient for symmetric PSD systems using CUDA 12.0.
The Matrix-Vector multiplication has been implemented in CSR format with one warp taking care of one row of matrix. The sparse lower and upper triangular solver have been implemented using cuSPARSE library. Considering the IC decomposition is deprecated in the new version of cuSPARSE, it has been calculated manually on the host side.