Add Support for ROCM (AMD GPUs) #99

jredmondson · 2024-12-20T15:43:04Z

Summary

Nvidia GPUs are ubiquitous in the market and darknet supports them well. However, it would be nice to have options in the AMD GPU market, which tends to be less expensive and often has more VRAM available for darknet training and usage than many of their Nvidia counterparts.

Though there are two options for non-Nvidia support (OpenCL and HIP/ROCM), the mapping between HIP and CUDA calls should be more of a one-to-one, and easier to support. Various papers/research surveys have also noted that OpenCL is generally slower (often by multiples like 2-8x, depending on operations).

In a hackathon, a group submitted a pull request to the old darknet repo with support for HIP. The integration was crude with #defines used to replace CUDA calls with direct HIP calls. It would be appreciated to review this pull request and/or tools that exist for CUDA->HIP porting to support modern AMD GPU optimizations so that the .weights models can be directly loaded onto the GPU without conversions to onnx.

Goals

Implement HIP/ROCM support in darknet for training and running weights models
Integrate HIP/ROCM compilation into the cmake build process

stephanecharette · 2024-12-21T03:44:26Z

This will require the purchase of some AMD hardware, since I only have NVIDIA-based systems.

jredmondson added the enhancement New feature or request label Dec 20, 2024

jredmondson assigned stephanecharette Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Support for ROCM (AMD GPUs) #99

Add Support for ROCM (AMD GPUs) #99

jredmondson commented Dec 20, 2024 •

edited

Loading

stephanecharette commented Dec 21, 2024

Add Support for ROCM (AMD GPUs) #99

Add Support for ROCM (AMD GPUs) #99

Comments

jredmondson commented Dec 20, 2024 • edited Loading

Summary

Goals

stephanecharette commented Dec 21, 2024

jredmondson commented Dec 20, 2024 •

edited

Loading