Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FOR DRAFT-PR ONLY] Enable -ffp-mode=fast for armclang #2755

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

AdrianLundell
Copy link
Contributor

-ffp-mode=fast enables extra compiler optimizations for floating point operators which increases performance, previously set to -ffp-mode=std as it is uncompatible with std::numeric_limits::quiet_NaN/infinity. See https://developer.arm.com/documentation/dui0774/latest/Compiler-Command-line-Options/-ffp-mode for more info.

This pull-request puts all incompatible code inside the TFLITE_EMULATE_FLOAT flag, which is not defined on arm targets.
It would probably need to be upstreamed to the tensorflow repo, so this is a proof of concept only.
Relevant github issue: #2518

-ffp-mode=fast enables extra compiler optimizations for floating point operators which increases performance,
previously set to -ffp-mode=std as it is uncompatible with std::numeric_limits::quiet_NaN/infinity.
See https://developer.arm.com/documentation/dui0774/latest/Compiler-Command-line-Options/-ffp-mode for more info.

This pull-request puts all incompatible code inside the TFLITE_EMULATE_FLOAT flag, which is not defined on arm targets.

Change-Id: Ic8fab0f11497ef4fd834a3a731a8a5625913486e
Copy link
Contributor

"This PR is being marked as stale due to inactivity. Remove label or comment to prevent closure in 5 days."

@github-actions github-actions bot added the Stale label Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant