【Operator Mechanism】add fused_stack_transpose_quant api #73228

ggggxm · 2025-06-10T07:22:05Z

PR Category

Operator Mechanism

PR Types

New features

Description

新增fused_stack_transpose_quant和fused_stack_quant支持

paddle-bot · 2025-06-10T07:22:10Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

zhangbo9674 · 2025-06-11T13:20:10Z

paddle/phi/ops/yaml/ops.yaml

@@ -2382,6 +2382,24 @@
  backward: fused_softmax_mask_upper_triangle_grad
  interfaces : paddle::dialect::InferSymbolicShapeInterface

+- op : fused_stack_quant


有一个专门存放融合算子的：paddle/phi/ops/yaml/fused_ops.yaml

zhangbo9674 · 2025-06-11T13:20:32Z

paddle/phi/kernels/funcs/quant_utils.h

+#include <iostream>
+#include <limits>
+
+// #include "paddle/extension.h"


delete unused code

zhangbo9674 · 2025-06-11T13:20:45Z

paddle/phi/kernels/funcs/quant_utils.h

+    }                                            \
+  }
+
+// 对二维坐标进行swizzle变换，提供相对offset,避免bank conflict


中文注释清理一下

zyfncg · 2025-06-13T09:41:45Z

paddle/phi/infermeta/multiary.cc

@@ -6245,6 +6245,108 @@ void FullWithTensorInferMeta(const IntArray& shape,
  out->set_dtype(dtype);
 }

+std::tuple<int64_t, int64_t, int64_t> FusedStackQuantCommonCheck(


放到fusion.h/.cc中

zyfncg · 2025-06-13T09:42:46Z

paddle/phi/infermeta/multiary.h

+std::tuple<int64_t, int64_t, int64_t> FusedStackQuantCommonCheck(
+    const std::vector<const MetaTensor*>& x);


非InferMeta函数不用把声明加到.h

zyfncg · 2025-06-13T09:43:32Z

test/legacy_test/test_fused_stack_transpose_quant_op.py

@@ -0,0 +1,84 @@
+#  Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.


2023 -> 2025

zyfncg · 2025-06-13T09:43:48Z

python/paddle/nn/functional/fp8.py

+
+
+    """
+    if in_dynamic_mode():


dynamic or pir mode

zyfncg · 2025-06-13T09:43:57Z

python/paddle/nn/functional/fp8.py

+        else:
+            return _C_ops.fused_stack_quant(x)
+
+    else:


else分支去掉

lshpku · 2025-06-13T10:56:27Z

paddle/phi/kernels/fusion/gpu/fused_stack_quant_kernel.cu

+    dev_ctx.template Alloc<phi::dtype::float8_e4m3fn>(out);
+    dev_ctx.template Alloc<float>(scale);
+    auto out_dims = out->dims();
+    out->Resize(out_dims);
+    auto scale_dims = scale->dims();
+    scale->Resize(scale_dims);


一般是先resize才alloc：

Paddle/paddle/fluid/framework/operator.h

Lines 594 to 597 in c999558

phi::DenseTensor tmp;

tmp.Resize(dim);

dev_ctx.template Alloc<T>(&tmp);

return tmp;

还有为啥要把一个tensor resize成自己的dims？这样会产生什么变化吗

zhangbo9674

LGTM

This reverts commit 06ca104.

…#73228) * add fused_stack_transpose_quant api * modify platform check * Revert "modify platform check" This reverts commit 06ca104. * split op test * refine op test

paddle-bot bot added the contributor External developers label Jun 10, 2025

lshpku changed the title ~~【Operator Mechanism 】add fused_stack_transpose_quant api~~ 【Operator Mechanism】add fused_stack_transpose_quant api Jun 10, 2025

ggggxm force-pushed the stack_tranpose_quant branch from b8eefa1 to daff73d Compare June 11, 2025 06:52

zhangbo9674 reviewed Jun 11, 2025

View reviewed changes

ggggxm force-pushed the stack_tranpose_quant branch from daff73d to 149c8c1 Compare June 13, 2025 06:08

zyfncg reviewed Jun 13, 2025

View reviewed changes

lshpku reviewed Jun 13, 2025

View reviewed changes

add fused_stack_transpose_quant api

4ddde16

ggggxm force-pushed the stack_tranpose_quant branch from f6798e5 to 4ddde16 Compare June 13, 2025 12:14

zyfncg previously approved these changes Jun 13, 2025

View reviewed changes

modify platform check

06ca104

ggggxm dismissed zyfncg’s stale review via 06ca104 June 13, 2025 16:36

zhangbo9674 previously approved these changes Jun 14, 2025

View reviewed changes

Revert "modify platform check"

557adf7

This reverts commit 06ca104.

ggggxm dismissed zhangbo9674’s stale review via f3e5a1c June 14, 2025 03:49

ggggxm added 2 commits June 14, 2025 11:49

split op test

f3e5a1c

refine op test

6b1dbee

lshpku approved these changes Jun 14, 2025

View reviewed changes

zhangbo9674 approved these changes Jun 14, 2025

View reviewed changes

zyfncg approved these changes Jun 14, 2025

View reviewed changes

swgu98 added the skip-ci: approval label Jun 14, 2025

zhangbo9674 merged commit 36f76e1 into PaddlePaddle:develop Jun 14, 2025
47 of 52 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

【Operator Mechanism】add fused_stack_transpose_quant api #73228

【Operator Mechanism】add fused_stack_transpose_quant api #73228

Uh oh!

ggggxm commented Jun 10, 2025

Uh oh!

paddle-bot bot commented Jun 10, 2025

Uh oh!

zhangbo9674 Jun 11, 2025

Uh oh!

zhangbo9674 Jun 11, 2025

Uh oh!

zhangbo9674 Jun 11, 2025

Uh oh!

zyfncg Jun 13, 2025

Uh oh!

zyfncg Jun 13, 2025

Uh oh!

zyfncg Jun 13, 2025

Uh oh!

zyfncg Jun 13, 2025

Uh oh!

zyfncg Jun 13, 2025

Uh oh!

lshpku Jun 13, 2025 •

edited

Loading

Uh oh!

zhangbo9674 left a comment

Uh oh!

Uh oh!

Uh oh!

		std::tuple<int64_t, int64_t, int64_t> FusedStackQuantCommonCheck(
		const std::vector<const MetaTensor*>& x);

		@@ -0,0 +1,84 @@
		# Copyright (c) 2023 PaddlePaddle Authors. All Rights Reserved.

	phi::DenseTensor tmp;
	tmp.Resize(dim);
	dev_ctx.template Alloc<T>(&tmp);
	return tmp;

【Operator Mechanism】add fused_stack_transpose_quant api #73228

【Operator Mechanism】add fused_stack_transpose_quant api #73228

Uh oh!

Conversation

ggggxm commented Jun 10, 2025

PR Category

PR Types

Description

Uh oh!

paddle-bot bot commented Jun 10, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lshpku Jun 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhangbo9674 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

lshpku Jun 13, 2025 •

edited

Loading