【UnitTestFix No.14】fix test_matmul_v2_op.py by scyyh11 · Pull Request #75909 · PaddlePaddle/Paddle

scyyh11 · 2025-10-17T05:23:09Z

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

This PR fixes a critical issue where the matmul_v2 gradient operation produces inconsistent gradient shapes between eager mode and static compilation mode when the inputs are 1-D tensors.

Problem

Eager mode correctly returns 1-D gradients (n,) for 1-D inputs.
Static compilation mode (with check_prim_pir=True) incorrectly returns 2-D gradients (n, 1) or (1, n) for 1-D inputs.
This inconsistency causes test failures with shape mismatch errors like:

  AssertionError: Not equal to tolerance rtol=1e-15, atol=1e-15
  Check static comp grad out failed. Mismatch between static comp and eager on Place(gpu:0)
  static comp grad out tensor: [[-0.01653409 -0.33412735 ...]]  # shape (1, 100)
  eager grad out tensor: [-0.01653409 -0.33412735 ...]        # shape (100,)

Root Cause

The shape inconsistency occurs in two places:

Composite implementation (paddle/fluid/primitive/decomp_rule/decomp_vjp/details.h):
When check_prim_pir=True, the composite matmul_grad function unsqueezes 1-D inputs to 2-D for computation but does not convert the resulting 2-D gradients back to 1-D.
Kernel implementation (paddle/phi/kernels/impl/matmul_grad_kernel_impl.h):
Internal matrix operations produce 2-D gradients that are not reshaped back to 1-D for 1-D inputs.

Fix

In composite implementation, added shape handling logic to squeeze 2-D gradients back to 1-D if the original input was 1-D.
In kernel implementation, applied the same logic for consistency.
Both paths now check whether the original input was 1-D and squeeze the appropriate dimension to ensure consistent gradient shapes.

Changes

Modified matmul_grad in details.h to handle 1-D gradient shapes.
Modified MatmulGradKernel in matmul_grad_kernel_impl.h for consistency.

This fix ensures consistent behavior between eager and static compilation modes, resolving gradient shape mismatch errors in matmul_v2 backpropagation.

问题描述

动态图模式：对 1 维输入，正确返回 1 维梯度 (n,)。
静态编译模式（check_prim_pir=True）：错误地返回 2 维梯度 (n,1) 或 (1,n)。
这种不一致会导致测试失败，报出形状不匹配错误，例如：

  AssertionError: Not equal to tolerance rtol=1e-15, atol=1e-15
  Check static comp grad out failed. Mismatch between static comp and eager on Place(gpu:0)
  static comp grad out tensor: [[-0.01653409 -0.33412735 ...]]  # shape (1, 100)
  eager grad out tensor: [-0.01653409 -0.33412735 ...]        # shape (100,)

根本原因

问题出现在 两个地方：

复合实现（paddle/fluid/primitive/decomp_rule/decomp_vjp/details.h）：
当 check_prim_pir=True 时，matmul_grad 复合函数会将 1 维输入 unsqueeze 为 2 维 来进行矩阵运算，但 没有在梯度返回时再 squeeze 回 1 维。
内核实现（paddle/phi/kernels/impl/matmul_grad_kernel_impl.h）：
内部矩阵计算同样会生成 2 维梯度，未根据原始输入形状进行还原。

解决方案

复合实现：添加形状处理逻辑，如果原始输入是 1 维，则将 2 维梯度 squeeze 回 1 维。
内核实现：添加相同的逻辑，保证一致性。
现在两个路径都会检查原始输入维度，并在必要时对梯度进行 squeeze 处理。

修改内容

修改 details.h 中的 matmul_grad 复合函数，支持 1 维梯度形状处理。
修改 matmul_grad_kernel_impl.h 中的 MatmulGradKernel，保证与复合逻辑一致。

此修复确保了 eager 与静态编译模式下梯度计算行为一致，彻底解决了 matmul_v2 反向传播中的形状不匹配问题。

@luotao1 @YqGe585

…t shape for 1-D tensors in both forward and backward passes. This includes adjustments in the `details.h` and `matmul_grad_kernel_impl.h` files to handle reshaping of gradients appropriately.

paddle-bot · 2025-10-17T05:23:15Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

scyyh11 · 2025-10-17T09:32:17Z

/re-run all-failed

A-nnonymous

LGTM in squeezing logics

fix: Enhance matmul_grad to ensure output shape matches original inpu…

b97e01a

…t shape for 1-D tensors in both forward and backward passes. This includes adjustments in the `details.h` and `matmul_grad_kernel_impl.h` files to handle reshaping of gradients appropriately.

paddle-bot bot added the contributor External developers label Oct 17, 2025

luotao1 added the HappyOpenSource 快乐开源活动issue与PR label Oct 17, 2025

luotao1 assigned luotao1 and YqGe585 Oct 17, 2025

luotao1 mentioned this pull request Oct 17, 2025

【启航计划】PaddlePaddle GPU单测修复 #75208

Closed

scyyh11 marked this pull request as ready for review October 17, 2025 08:55

A-nnonymous approved these changes Oct 20, 2025

View reviewed changes

YqGe585 approved these changes Oct 20, 2025

View reviewed changes

luotao1 approved these changes Oct 21, 2025

View reviewed changes

luotao1 merged commit 8f6b9df into PaddlePaddle:develop Oct 21, 2025
69 of 71 checks passed

scyyh11 deleted the fix/test_matmul_v2_op branch October 21, 2025 02:33

scyyh11 mentioned this pull request Oct 22, 2025

[WeeklyReports] 2025.10.13~2025.10.24 第二次周报收集 PFCCLab/Starter#710

Closed

31 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【UnitTestFix No.14】fix test_matmul_v2_op.py#75909

【UnitTestFix No.14】fix test_matmul_v2_op.py#75909
luotao1 merged 1 commit intoPaddlePaddle:developfrom
scyyh11:fix/test_matmul_v2_op

scyyh11 commented Oct 17, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Oct 17, 2025

Uh oh!

scyyh11 commented Oct 17, 2025

Uh oh!

A-nnonymous left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

scyyh11 commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Problem

Root Cause

Fix

Changes

问题描述

根本原因

解决方案

修改内容

Uh oh!

paddle-bot bot commented Oct 17, 2025

Uh oh!

scyyh11 commented Oct 17, 2025

Uh oh!

A-nnonymous left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

scyyh11 commented Oct 17, 2025 •

edited

Loading