Add --enable-unsafe-fp-math flag when -O3 and NNPA is used #2963

chentong319 · 2024-10-02T12:41:31Z

Since NNPA uses less precise data, it is reasonable to enable aggressive optimization.
Add the flag only for O3, leaving O0 for debugging.

Signed-off-by: chentong319 <chentong@us.ibm.com>

AlexandreEichenberger

LGTM, will check if it makes a difference for the test that I was looking at.

If it makes a difference, I almost wonder if we should not always turn it on at O3.

AlexandreEichenberger · 2024-10-02T16:19:06Z

src/Compiler/CompilerOptions.cpp

+  // Enable aggressive optimization for NNPA with -O3
+  if (OptimizationLevel == OptLevel::O3 &&
+      getTargetAccel().find("NNPA") != std::string::npos &&
+      getLLVMOption().find("enable-unsafe-fp-math") == std::string::npos) {


I see, the idea is that you just check if that opt is already there. If it's there, either its true and there is no need to add it again, or false and then we don't add it also. Clever.

jenkins-droid · 2024-10-02T17:18:40Z

Jenkins Linux amd64 Build #15754 [push] implement (#2963) Signe... started at 12:18

jenkins-droid · 2024-10-02T17:18:40Z

Jenkins Linux s390x Build #15757 [push] implement (#2963) Signe... started at 13:18

jenkins-droid · 2024-10-02T17:18:40Z

Jenkins Linux ppc64le Build #14784 [push] implement (#2963) Signe... started at 13:30

jenkins-droid · 2024-10-02T18:43:24Z

Jenkins Linux amd64 Build #15754 [push] implement (#2963) Signe... passed after 1 hr 24 min

jenkins-droid · 2024-10-02T19:20:01Z

Jenkins Linux s390x Build #15757 [push] implement (#2963) Signe... passed after 2 hr 1 min

jenkins-droid · 2024-10-02T19:53:51Z

Jenkins Linux ppc64le Build #14784 [push] implement (#2963) Signe... passed after 2 hr 35 min

chentong319 added 2 commits October 2, 2024 08:29

implement

a7c39a4

Signed-off-by: chentong319 <chentong@us.ibm.com>

Merge remote-tracking branch 'upstream/main' into fp-flag

4d260f3

chentong319 requested a review from AlexandreEichenberger October 2, 2024 15:17

AlexandreEichenberger approved these changes Oct 2, 2024

View reviewed changes

chentong319 merged commit ccf9552 into onnx:main Oct 2, 2024
7 checks passed

chentong319 deleted the fp-flag branch October 2, 2024 17:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add --enable-unsafe-fp-math flag when -O3 and NNPA is used #2963

Add --enable-unsafe-fp-math flag when -O3 and NNPA is used #2963

chentong319 commented Oct 2, 2024

AlexandreEichenberger left a comment

AlexandreEichenberger Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

Add --enable-unsafe-fp-math flag when -O3 and NNPA is used #2963

Add --enable-unsafe-fp-math flag when -O3 and NNPA is used #2963

Conversation

chentong319 commented Oct 2, 2024

AlexandreEichenberger left a comment

Choose a reason for hiding this comment

AlexandreEichenberger Oct 2, 2024

Choose a reason for hiding this comment

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024

jenkins-droid commented Oct 2, 2024