Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use GEMM kernel for KleidiAI to accelerate FP16Benchmark #3440

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

milpuz01
Copy link

@milpuz01 milpuz01 commented Dec 3, 2024

The following PR shows how to use kernels from KleidiAI to accelerate FP16Benchmark.

There is Makefile.FP16Benchmark.aarch64 that can be used to compile FP16Benchmark and FP16Test on AArch64 platforms with FBGEMM_ENABLE_KLEIDIAI enabled. It assumes that KleidiAI is in the external directory with branch f32_f32_f16p (https://gitlab.arm.com/kleidi/kleidiai/-/tree/f32_f32_f16p?ref_type=heads) in order to be able to access kernels that are implemented in this file KleidiAIFP16UKernelsNeon.cc (https://gitlab.arm.com/kleidi/kleidiai/-/blob/f32_f32_f16p/kai/ukernels/matmul/matmul_f32_f32_f16p/KleidiAIFP16UKernelsNeon.cc?ref_type=heads)

Copy link

netlify bot commented Dec 3, 2024

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 667ce9b
🔍 Latest deploy log https://app.netlify.com/sites/pytorch-fbgemm-docs/deploys/674f3f917af367000807cff3
😎 Deploy Preview https://deploy-preview-3440--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants