Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ARM] Support fp16 data type in JIT Reorder kernel #2185

Open
dmitry-gorokhov opened this issue Oct 28, 2024 · 0 comments
Open

[ARM] Support fp16 data type in JIT Reorder kernel #2185

dmitry-gorokhov opened this issue Oct 28, 2024 · 0 comments
Labels
enhancement A feature or an optimization request help wanted platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64

Comments

@dmitry-gorokhov
Copy link
Contributor

Summary

The request is to support fp16 data type in jit_uni_reorder kernel on aarch64 HW.

Problem statement

Currently only fp32 and bf16 floating point data types are supported in optimized Reoder implementation on aarch64 HW. Attempt to reorder memory with fp16 data type fallbacks on reference implementation which might times slower in comparison with jitted code.
Different FWs uses FP16 as default execution type on ARM HW. This is basically creates demand on highly optimized FP16 reorder to speedup model compilation/preparation time (mostly by optimizing Conv/Matmul weights reorder to blocked format) and inference time (most of the models are mixed precision and require multiple fp32<->fp16 and fp16<->u8/i8 conversion).

Preferred solution

Extend jit_uni_reoder kernel with fp16 data type to support fp32<->fp16 and fp16<->u8/i8 conversions.

@dmitry-gorokhov dmitry-gorokhov added the enhancement A feature or an optimization request label Oct 28, 2024
@theComputeKid theComputeKid added the platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64 label Oct 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement A feature or an optimization request help wanted platform:cpu-aarch64 Codeowner: @oneapi-src/onednn-cpu-aarch64
Projects
None yet
Development

No branches or pull requests

3 participants