Skip to content

batch bicgstab with batch csr structure problem on gpu #1630

Answered by pratikvn
AbdelhadiKara asked this question in Q&A
Discussion options

You must be logged in to vote

It seems the issue indeed might be due to a limitation of V100. It seems to be work fine on a A100.

This is Ginkgo 1.9.0 (develop)
    running with core module 1.9.0 (develop)
    the reference module is  1.9.0 (develop)
    the OpenMP    module is  not compiled
    the CUDA      module is  1.9.0 (develop)
    the HIP       module is  not compiled
    the DPCPP     module is  not compiled
Residual norm sqrt(r^T r):
 Exec: cuda
 System no. 0: residual norm = 4.17165e-11, implicit residual norm = 4.17166e-11, iterations = 122
Solver type: batch::bicgstab
Matrix size: (4099, 4099)
Num batch entries: 1
Entire solve took: 0.0568908 seconds.

Additionally, for these relatively large matrices, I …

Replies: 2 comments 10 replies

Comment options

You must be logged in to vote
7 replies
@AbdelhadiKara
Comment options

@pratikvn
Comment options

@AbdelhadiKara
Comment options

@pratikvn
Comment options

@AbdelhadiKara
Comment options

Comment options

You must be logged in to vote
3 replies
@AbdelhadiKara
Comment options

@blegouix
Comment options

@pratikvn
Comment options

Answer selected by pratikvn
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
type:batched-functionality This is related to the batched functionality in Ginkgo
4 participants