Disable `CUDNN_SOFTMAX_FAST` or use a separate math mode variable for `softmax` #506

chengchingwen · 2023-02-09T08:38:00Z

Since #455 is merged, I want to point out that CUDNN_SOFTMAX_FAST would easily cause problem for attention operation. In the masking scenario, we would usually set the masked value to -Inf or some really small value, like -1e9. But if we want to use CUDA.math_mode!(CUDA.FAST_MATH) to accelerate the gemm, softmax would actually introduce many NaNs.

MWE:

julia> using CUDA, Flux
                                                                                                                       
julia> x = CUDA.randn(Float32, 512, 10); fill!(x, -1f3);

julia> CUDA.math_mode!(CUDA.FAST_MATH)

julia> softmax(x)
512×10 CuArray{Float32, 2, CUDA.Mem.DeviceBuffer}:                                                                     
 NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN
 NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN
 NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN
   ⋮                        ⋮
 NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN
 NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN
 NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN
 NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN  NaN

The text was updated successfully, but these errors were encountered:

ToucheSir · 2023-02-09T16:12:45Z

This relates to the discussion about customizing conv algorithms too, JuliaGPU/CUDA.jl#938. I guess the question remains how we can group and expose these toggles without overwhelming users and/or developers. Any ideas for a good API design would be appreciated.

CarloLucibello transferred this issue from FluxML/NNlibCUDA.jl Jun 24, 2023

CarloLucibello added the CUDA label Jun 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable `CUDNN_SOFTMAX_FAST` or use a separate math mode variable for `softmax` #506

Disable `CUDNN_SOFTMAX_FAST` or use a separate math mode variable for `softmax` #506

chengchingwen commented Feb 9, 2023

ToucheSir commented Feb 9, 2023

Disable CUDNN_SOFTMAX_FAST or use a separate math mode variable for softmax #506

Disable CUDNN_SOFTMAX_FAST or use a separate math mode variable for softmax #506

Comments

chengchingwen commented Feb 9, 2023

ToucheSir commented Feb 9, 2023

Disable `CUDNN_SOFTMAX_FAST` or use a separate math mode variable for `softmax` #506

Disable `CUDNN_SOFTMAX_FAST` or use a separate math mode variable for `softmax` #506