Introducing @reduce for group level reduction #379

brabreda · 2023-04-05T20:34:53Z

The @reduce macro performs a group level reduction.

TODOs:

Figure out a place for the implementation.
Add a lane level reduction.
Create a more advanced group level reduction that is able to utilize platform dependant feature such as lane reduction and atomics.

vchuravy · 2023-04-06T00:46:10Z

lib/CUDAKernels/src/CUDAKernels.jl

+    threadIdx = KernelAbstractions.@index(Local)
+
+    # shared mem for a complete reduction
+    shared = KernelAbstractions.@localmem(T, 1024)


Maybe this is the moment we need dynamic shared memory support?

src/KernelAbstractions.jl

vchuravy · 2023-04-06T00:49:06Z

lib/CUDAKernels/src/CUDAKernels.jl

+    # perform the reduction
+    d = 1
+    while d < threads
+        KernelAbstractions.@synchronize()


You are inside CUDAKernels here and as such you can use CUDA.jl functionality directly.

Thats correct! But a implementation with KA.jl macros would allow for a single implementation that can run on all supported back-end. Because of this I am not sure what the best place is for the code for this implementation.

Also, the main difference between different back-end would the size of local memory but the use of dynamic memory would be a solution to this.

vchuravy · 2023-04-06T00:49:54Z

Looks like a great start! Will have to add it to 0.9 but that can happen after you are happy with the initial implementation.

brabreda · 2023-04-06T22:04:35Z

To make a more generalized @reduce operation, I would work with a Config struct. An example of this can be found in the GemmKernels.jl Config.

Based on this struct, the reduction could use atomics and lane/warp reductions.

brabreda and others added 5 commits February 24, 2023 13:15

addedwarp and block reduce

fbc0ced

Merge branch 'JuliaGPU:release-0.8' into release-0.8

899bc98

added op to groupreduce

a34ac1a

add reduction macro

66163f4

add reduction macro

ab3d6d1

vchuravy reviewed Apr 6, 2023

View reviewed changes

src/KernelAbstractions.jl Show resolved Hide resolved

vchuravy reviewed Apr 6, 2023

View reviewed changes

added reduce file

db024ed

brabreda closed this by deleting the head repository Apr 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introducing @reduce for group level reduction #379

Introducing @reduce for group level reduction #379

brabreda commented Apr 5, 2023

vchuravy Apr 6, 2023

vchuravy Apr 6, 2023

vchuravy Apr 6, 2023

brabreda Apr 6, 2023

vchuravy commented Apr 6, 2023

brabreda commented Apr 6, 2023

Introducing @reduce for group level reduction #379

Introducing @reduce for group level reduction #379

Conversation

brabreda commented Apr 5, 2023

vchuravy Apr 6, 2023

Choose a reason for hiding this comment

vchuravy Apr 6, 2023

Choose a reason for hiding this comment

vchuravy Apr 6, 2023

Choose a reason for hiding this comment

brabreda Apr 6, 2023

Choose a reason for hiding this comment

vchuravy commented Apr 6, 2023

brabreda commented Apr 6, 2023