Add CUDA-Enabled Matrix Multiplication #22

tf-mac · 2024-03-08T17:01:54Z

This adds a couple of changes to this project

Enables CUDA in the project, under the __CUDACC__ define
Adds a CUDA-aware MPI distribution scheme
Adds GALATIC to the includes, allowing access to its methods
Combines all of these to add the method Mult_AnXBn_DoubleBuff_CUDA

Initial results are below:

- Begin to add optimizations (remove excessive copies, reverse multiplication order, etc.) - Add some debug statements to find seg fault, to be removed later - Need to offload some kernels (tupling, merging, decompressing) to GPU, and expand to CSC

richardlett · 2024-03-10T06:06:55Z

tf-mac added 27 commits June 29, 2023 15:01

Add initial CUDA work

4cbc1fb

Fix gitignore

768af83

Couple minor changes

8759a96

Fix parfriends.h

a8be3bb

Make working CUDA

a658d8f

Add GALATIC and etc.

8d6a493

Update to newer ideas

d91f5fd

Remove GALATIC submodule

bd09742

And move GALATIC back:

897088f

Add galatic to stage 1

853c985

Move CUDA calls to proper CUDA doublebuffer

03b63cf

Finish proof of concept

5eb0ae4

Add first optimizations

f92d58b

- Begin to add optimizations (remove excessive copies, reverse multiplication order, etc.) - Add some debug statements to find seg fault, to be removed later - Need to offload some kernels (tupling, merging, decompressing) to GPU, and expand to CSC

Mild bug fix

53ca5b5

Various additions

5c7a99a

Whatever is different

c438756

Add new dist scheme

2b47264

Add up to now work

ff20355

Add a wrapper + maybe fix dist

207cfa4

Fix various bugs

2d706e0

Working bar a memory leak

b273e27

Various fixes

15cacf2

Close to ready for pull request, just a bit more cleanup

447d629

Clean up the makefiles

cad8b79

Merge branch 'master' of https://github.com/PASSIONLab/CombBLAS

4fe1c22

Cleanup CMake

4eaca67

Move to Wrap_SR

56132a8

Fix a bunch

7810b3c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CUDA-Enabled Matrix Multiplication #22

Add CUDA-Enabled Matrix Multiplication #22

tf-mac commented Mar 8, 2024 •

edited

Loading

richardlett commented Mar 10, 2024

Add CUDA-Enabled Matrix Multiplication #22

Are you sure you want to change the base?

Add CUDA-Enabled Matrix Multiplication #22

Conversation

tf-mac commented Mar 8, 2024 • edited Loading

richardlett commented Mar 10, 2024

tf-mac commented Mar 8, 2024 •

edited

Loading