[pnorm] fix bug in fp16 & optimize memory #39011

LemonNoel · 2022-01-18T02:56:43Z

PR types

Bug fixes

PR changes

OPs

Describe

Fix bug for pnorm op in float16.
Optimize memory for pnorm op.

paddle-bot-old · 2022-01-18T02:56:46Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

ZHUI · 2022-01-21T03:33:45Z

paddle/fluid/operators/p_norm_op.cu

 struct UnsignedPowFunctor {
  HOSTDEVICE explicit inline UnsignedPowFunctor(float porder) {
    this->porder = porder;
  }
-  HOSTDEVICE inline Ty operator()(const Tx x) const {


原来这里两个类型是要支持 fp16 场景吗？现在为什么不用了

之前的实现是有问题的，fp16类型在inline function里边会转换为float类型计算，并不需要在外部转换。

ZHUI · 2022-01-21T03:35:24Z

paddle/fluid/operators/p_norm_op.cu

+      TensorReduceFunctorImpl<T, T, kps::AddFunctor, UnsignedPowFunctor<T>>(
+          *in_x, out_norm, UnsignedPowFunctor<T>(porder), reduce_axis, stream);
+
+      const framework::Tensor* tmp_norm = out_norm;


这里的tmp_norm不需要了吧，直接是 out_norm

这里ins的类型是 std::vector<const framework::Tensor*> ，需要转换为const。

ZHUI · 2022-01-21T03:42:57Z

paddle/fluid/operators/p_norm_op.cu

-    dx->device(place) = dy->broadcast(dim) * equals.select(ones, zeros) *
-                        positives.select(ones, negs);
+    dx->device(place) = dy->broadcast(dim) * (*x).sign() *
+                        ((*x).abs() == y->broadcast(dim)).select(ones, zeros);


https://github.com/PaddlePaddle/Paddle/blob/develop/paddle/fluid/operators/p_norm_op.h#L126-L128

dx.device(*place) = (x.abs() == norm.broadcast(bcast)).template cast<T>() * x.sign() * norm_dy.broadcast(bcast);

也可以用cast

这里意思是按照这种写法cast就可以获取非零值吗

ZHUI · 2022-01-21T03:44:03Z

paddle/fluid/operators/p_norm_op.cu

  template <typename DeviceContext, typename X, typename Y, typename DX,
            typename DY, typename Dim>
  void operator()(const DeviceContext& place, X* x, Y* y, DX* dx, DY* dy,
                  const Dim& dim, int size) {
    auto ones = dx->constant(static_cast<T>(1.));
-    auto negs = dx->constant(static_cast<T>(-1.));
    auto zeros = dx->constant(static_cast<T>(0.));


auto ones = dx->constant(static_cast<T>(1.)); auto negs = dx->constant(static_cast<T>(-1.)); auto zeros = dx->constant(static_cast<T>(0.));

都不需要了吧

ZHUI

LGTM

wanghuancoder

LGTM

… pnorm_fp16

ZHUI

LGTM

ZzSean

LGTM

Xreki

LGTM for the change of atol of float16 unittest (change to 1e-3 is acceptable for float16).

[pnorm] fix bug in fp16 & optimize memory

fd2f6e2

LemonNoel force-pushed the pnorm_fp16 branch from d182627 to fd2f6e2 Compare January 18, 2022 04:55

[pnorm] fit in pten

e00bde1

LemonNoel force-pushed the pnorm_fp16 branch from aff7fd9 to e00bde1 Compare January 18, 2022 14:15

Merge branch 'develop' into pnorm_fp16

c720a45

ZHUI requested review from ZHUI and wawltor January 20, 2022 09:42

ZHUI reviewed Jan 21, 2022

View reviewed changes

ZHUI previously approved these changes Jan 21, 2022

View reviewed changes

wanghuancoder previously approved these changes Jan 21, 2022

View reviewed changes

LemonNoel added 2 commits January 21, 2022 07:18

[pnorm] remove unnessesary lines

b67a658

Merge branch 'pnorm_fp16' of https://github.com/LemonNoel/Paddle into…

9b4d030

… pnorm_fp16

LemonNoel dismissed stale reviews from wanghuancoder and ZHUI via 9b4d030 January 24, 2022 02:47

LemonNoel requested a review from wanghuancoder January 25, 2022 02:12

wanghuancoder approved these changes Jan 25, 2022

View reviewed changes

ZHUI approved these changes Jan 25, 2022

View reviewed changes

ZzSean approved these changes Jan 25, 2022

View reviewed changes

Xreki approved these changes Jan 25, 2022

View reviewed changes

ZHUI merged commit 3825b40 into PaddlePaddle:develop Jan 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pnorm] fix bug in fp16 & optimize memory #39011

[pnorm] fix bug in fp16 & optimize memory #39011

Uh oh!

LemonNoel commented Jan 18, 2022 •

edited

Loading

Uh oh!

paddle-bot-old bot commented Jan 18, 2022

Uh oh!

ZHUI Jan 21, 2022

Uh oh!

LemonNoel Jan 21, 2022

Uh oh!

ZHUI Jan 21, 2022

Uh oh!

LemonNoel Jan 21, 2022

Uh oh!

ZHUI Jan 21, 2022

Uh oh!

LemonNoel Jan 21, 2022

Uh oh!

ZHUI Jan 21, 2022

Uh oh!

ZHUI left a comment

Uh oh!

wanghuancoder left a comment

Uh oh!

ZHUI left a comment

Uh oh!

ZzSean left a comment

Uh oh!

Xreki left a comment

Uh oh!

Uh oh!

[pnorm] fix bug in fp16 & optimize memory #39011

[pnorm] fix bug in fp16 & optimize memory #39011

Uh oh!

Conversation

LemonNoel commented Jan 18, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR types

PR changes

Describe

Uh oh!

paddle-bot-old bot commented Jan 18, 2022

Uh oh!

ZHUI Jan 21, 2022

Choose a reason for hiding this comment

Uh oh!

LemonNoel Jan 21, 2022

Choose a reason for hiding this comment

Uh oh!

ZHUI Jan 21, 2022

Choose a reason for hiding this comment

Uh oh!

LemonNoel Jan 21, 2022

Choose a reason for hiding this comment

Uh oh!

ZHUI Jan 21, 2022

Choose a reason for hiding this comment

Uh oh!

LemonNoel Jan 21, 2022

Choose a reason for hiding this comment

Uh oh!

ZHUI Jan 21, 2022

Choose a reason for hiding this comment

Uh oh!

ZHUI left a comment

Choose a reason for hiding this comment

Uh oh!

wanghuancoder left a comment

Choose a reason for hiding this comment

Uh oh!

ZHUI left a comment

Choose a reason for hiding this comment

Uh oh!

ZzSean left a comment

Choose a reason for hiding this comment

Uh oh!

Xreki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LemonNoel commented Jan 18, 2022 •

edited

Loading