Skip to content

AMD ROCm™ Software

AMD ROCm software is AMD's Open Source stack for GPU computation.

To learn more about ROCm, check out our Documentation, Examples, and Developer Hub.

If you have questions or need help, reach out to us on GitHub.

Popular repositories Loading

  1. ROCm ROCm Public

    AMD ROCm™ Software - GitHub Home

    Shell 4.8k 392

  2. HIP HIP Public

    HIP: C++ Heterogeneous-Compute Interface for Portability

    C++ 3.8k 541

  3. MIOpen MIOpen Public

    AMD's Machine Intelligence Library

    Assembly 1.1k 232

  4. tensorflow-upstream tensorflow-upstream Public

    Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    C++ 689 95

  5. HIPIFY HIPIFY Public

    HIPIFY: Convert CUDA to Portable C++ Code

    C++ 534 76

  6. ROCm-docker ROCm-docker Public

    Dockerfiles for the various software layers defined in the ROCm software platform

    Shell 440 67

Repositories

Showing 10 of 295 repositories
  • composable_kernel Public

    Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

    ROCm/composable_kernel’s past year of commit activity
    C++ 329 137 27 (1 issue needs help) 47 Updated Dec 30, 2024
  • triton Public Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    ROCm/triton’s past year of commit activity
    C++ 101 MIT 1,710 9 43 Updated Dec 30, 2024
  • pytorch Public Forked from pytorch/pytorch

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    ROCm/pytorch’s past year of commit activity
    Python 220 23,547 77 39 Updated Dec 30, 2024
  • llvm-project Public Forked from llvm/llvm-project

    This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.

    ROCm/llvm-project’s past year of commit activity
    LLVM 126 12,531 34 17 Updated Dec 30, 2024
  • tensorflow-upstream Public Forked from tensorflow/tensorflow

    TensorFlow ROCm port

    ROCm/tensorflow-upstream’s past year of commit activity
    C++ 689 Apache-2.0 90,799 69 62 Updated Dec 30, 2024
  • xla Public Forked from openxla/xla

    A machine learning compiler for GPUs, CPUs, and ML accelerators

    ROCm/xla’s past year of commit activity
    C++ 3 Apache-2.0 460 0 18 Updated Dec 29, 2024
  • aomp Public

    AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releases, issues, documentation, packaging, and examples.

    ROCm/aomp’s past year of commit activity
    Fortran 210 Apache-2.0 48 1 43 Updated Dec 29, 2024
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    ROCm/flash-attention’s past year of commit activity
    Python 149 BSD-3-Clause 1,406 24 11 Updated Dec 29, 2024
  • rocWMMA Public

    rocWMMA

    ROCm/rocWMMA’s past year of commit activity
    C++ 96 MIT 26 2 1 Updated Dec 29, 2024
  • rocJPEG Public

    rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.

    ROCm/rocJPEG’s past year of commit activity
    C++ 3 MIT 6 1 1 Updated Dec 29, 2024