Sarkars/batchnorm update #318

sayantan-nervana · 2018-11-19T09:12:27Z

Opening a PR for future ngraph BN API change: https://github.com/NervanaSystems/ngraph/pull/2046/files

sayantan-nervana

Future-proofing for this change: NervanaSystems/ngraph#2046

sayantan-nervana · 2018-11-19T09:14:19Z

test/test_nn_ops.cpp

@@ -912,6 +912,81 @@ TEST(NNOps, Conv2DBackpropInputNHWCWithDilation) {
  }
 }  // end of op Conv2DBackpropInputNHWCWithDilation

+// FusedBatchNorm : Forward pass, training = true
+// TODO fix this test
+TEST(NNOps, DISABLED_FusedBatchNormNHWCTrainTrue) {


This test does not pass. Sample output:

[ RUN ] NNOps.DISABLE_FusedBatchNormNHWCTrainTrue 2018-11-19 01:03:39.177831: I tensorflow/core/common_runtime/process_util.cc:69] Creating new thread pool wit h default inter op setting: 2. Tune using inter_op_parallelism_threads for best performance. /localdisk/sarkars/workspace1/tf_ngtf_7_mkl_1_12/ngraph-tf/test/test_utilities.h:126: Failure Value of: rt Actual: false Expected: true TF output 20.955995559692383 NG output 20.606725692749023 /localdisk/sarkars/workspace1/tf_ngtf_7_mkl_1_12/ngraph-tf/test/test_utilities.h:126: Failure Value of: rt Actual: false Expected: true TF output 21.971120834350586 NG output 21.604936599731445 [ FAILED ] NNOps.DISABLE_FusedBatchNormNHWCTrainTrue (125 ms)

jianyinglang · 2018-11-20T01:35:10Z

src/ngraph_builder.cc

-    ng_y = make_shared<ng::op::GetOutputElement>(ng_batch_norm, 0);
-    ng_mean = make_shared<ng::op::GetOutputElement>(ng_batch_norm, 1);
-    ng_variance = make_shared<ng::op::GetOutputElement>(ng_batch_norm, 2);
+    shared_ptr<ngraph::Node> ng_y_out, ng_mean_out, ng_variance_out;


I must misunderstand about the training op orders. In ngraph, shouldn't the output order be {gamma, beta, input}? Could you please explain this a little bit?

So this PR was to sync with this PR in ngraph, which reorders batch norm: NervanaSystems/ngraph#2046 (comment)

But apparently that has been closed, and will come later, so I suppose we don't have to do anything now.

sayantan-nervana · 2018-11-30T21:16:12Z

TESTNOW

sayantan-nervana · 2018-11-30T22:23:49Z

TESTNOW

sayantan-nervana · 2018-11-30T23:07:41Z

Fail message:

self = <test_tanhgrad.TestTanhGradOp object at 0x7f72180e2b50>

    def test_tanhgrad_2d(self):
        y = constant_op.constant(
            self.generate_random_numbers(30, 1.0, 10.0), shape=[10, 3])
        y_delta = constant_op.constant(
            self.generate_random_numbers(30, 0.0, 10.0), shape=[10, 3])
    
        out = tanh_grad(y, y_delta)
    
        def run_test(sess):
            return sess.run(out)
    
>       assert np.allclose(
            self.with_ngraph(run_test), self.without_ngraph(run_test))
E       assert False
E        +  where False = <function allclose at 0x7f736018d488>(array([[-5.6790155e+02, -1.0593641e+02, -5.1666357e+02],\n       [-9.5448242e+0...e+02],\n       [-4.1972357e+02, -3.4175458e+02, -1.0048141e+02]], dtype=float32), array([[-5.6790155e+02, -1.0593641e+02, -5.1666357e+02],\n       [-9.5448242e+0...e+02],\n       [-4.1972357e+02, -3.4175458e+02, -1.0048141e+02]], dtype=float32))
E        +    where <function allclose at 0x7f736018d488> = np.allclose
E        +    and   array([[-5.6790155e+02, -1.0593641e+02, -5.1666357e+02],\n       [-9.5448242e+0...e+02],\n       [-4.1972357e+02, -3.4175458e+02, -1.0048141e+02]], dtype=float32) = <bound method TestTanhGradOp.with_ngraph of <test_tanhgrad.TestTanhGradOp object at 0x7f72180e2b50>>(<function run_test at 0x7f72300a0a28>)
E        +      where <bound method TestTanhGradOp.with_ngraph of <test_tanhgrad.TestTanhGradOp object at 0x7f72180e2b50>> = <test_tanhgrad.TestTanhGradOp object at 0x7f72180e2b50>.with_ngraph
E        +    and   array([[-5.6790155e+02, -1.0593641e+02, -5.1666357e+02],\n       [-9.5448242e+0...e+02],\n       [-4.1972357e+02, -3.4175458e+02, -1.0048141e+02]], dtype=float32) = <bound method TestTanhGradOp.without_ngraph of <test_tanhgrad.TestTanhGradOp object at 0x7f72180e2b50>>(<function run_test at 0x7f72300a0a28>)
E        +      where <bound method TestTanhGradOp.without_ngraph of <test_tanhgrad.TestTanhGradOp object at 0x7f72180e2b50>> = <test_tanhgrad.TestTanhGradOp object at 0x7f72180e2b50>.without_ngraph

test_tanhgrad.py:45: AssertionError
=============== 1 failed, 79 passed, 51 skipped in 4.62 seconds ================

jianyinglang · 2018-12-01T01:02:41Z

One more comment: in ngraph core, it requires the input dimension >=2. I think it might be good if we add this as a confirmation constraint.

sayantan-nervana added 2 commits November 19, 2018 00:28

Changing batchnorm api and adding tests. One test fails

caa71d5

Fixed second test

6cffbcb

sayantan-nervana requested review from mingshan-wang and avijit-nervana November 19, 2018 09:12

sayantan-nervana commented Nov 19, 2018

View reviewed changes

sayantan-nervana added the WIP label Nov 19, 2018

sayantan-nervana requested a review from jianyinglang November 19, 2018 09:17

sayantan-nervana added Future Change and removed WIP labels Nov 19, 2018

jianyinglang reviewed Nov 20, 2018

View reviewed changes

sayantan-nervana added the WIP label Nov 30, 2018

sayantan-nervana added 2 commits November 30, 2018 14:30

Changing tag to ngraph 0.10 release

b2e49af

Merge branch 'master' into sarkars/batchnorm_update

acbbfd6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sarkars/batchnorm update #318

Sarkars/batchnorm update #318

sayantan-nervana commented Nov 19, 2018

sayantan-nervana left a comment

sayantan-nervana Nov 19, 2018

jianyinglang Nov 20, 2018

sayantan-nervana Nov 20, 2018

sayantan-nervana commented Nov 30, 2018

sayantan-nervana commented Nov 30, 2018

sayantan-nervana commented Nov 30, 2018

jianyinglang commented Dec 1, 2018

Sarkars/batchnorm update #318

Are you sure you want to change the base?

Sarkars/batchnorm update #318

Conversation

sayantan-nervana commented Nov 19, 2018

sayantan-nervana left a comment

Choose a reason for hiding this comment

sayantan-nervana Nov 19, 2018

Choose a reason for hiding this comment

jianyinglang Nov 20, 2018

Choose a reason for hiding this comment

sayantan-nervana Nov 20, 2018

Choose a reason for hiding this comment

sayantan-nervana commented Nov 30, 2018

sayantan-nervana commented Nov 30, 2018

sayantan-nervana commented Nov 30, 2018

jianyinglang commented Dec 1, 2018