Implement IEEE 754 rounding conditions for fp32 to fp16 conversion in host #74044

DongBaiYue · 2025-07-15T06:40:16Z

PR Category

Operator Mechanism

PR Types

Bug fixes

Description

Problem Summary

When converting fp32 tensors to fp16, current implementation lacks IEEE 754-compliant rounding logic, leading to Precision Error.

Example

import numpy as np
import paddle
import torch

input_data = 1e-05
numpy_tensor = np.array(input_data).astype("float16")
torch_x = torch.tensor(numpy_tensor)
paddle_x = paddle.to_tensor(numpy_tensor)
print("numpy == 1e-5:", numpy_tensor == 1e-5 )              # True
print("torch == 1e-5:", (torch_x == 1e-5).cpu().numpy() )   # True
print("paddle == 1e-5:", (paddle_x == 1e-5 ) )              # False

Solution Approach

Implemented IEEE 754 round-to-nearest-even standard

// Rounding: round to nearest, ties to even
// https://en.wikipedia.org/wiki/Rounding#Rounding_half_to_even
const uint32_t lsb =
    (v.ui >> shift) & 0x1;               // Least significant retained bit
v.ui += (0xFFF + lsb) & -(v.ui < infN);  // Round with overflow protection

Validation

Input (fp32)	Previous Output	New Output
1e-5f	0x00a7	0x00a8
0.333f	0x3553	0x3554
6.10056e-5f	0x03ff	0x0400
5e-8f	0x0000	0x0001

Others

These codes are very difficult to understand, so I have added some comments to clarify them.

Pcard-67164

paddle-bot · 2025-07-15T06:40:27Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

DongBaiYue · 2025-07-15T07:54:54Z

/re-run coverage build

DongBaiYue · 2025-07-15T09:26:37Z

/re-run Infer

DongBaiYue · 2025-07-21T06:20:03Z

/re-run inference build

DongBaiYue · 2025-07-21T12:17:06Z

/re-run all-failed

wanghuancoder

LGTM

DongBaiYue · 2025-07-24T15:55:49Z

/re-run all-failed

DongBaiYue · 2025-07-25T00:25:34Z

/re-run all-failed

DongBaiYue · 2025-07-25T02:26:12Z

/re-run all-failed

DongBaiYue · 2025-07-27T09:29:22Z

/re-run all-failed

wanghuancoder

LGTM

DongBaiYue added 6 commits July 9, 2025 11:33

Implement IEEE 754 rounding conditions for fp32 to fp16 conversion

2ac55ad

Merge remote-tracking branch 'dongbaiyue/develop' into develop

a7ded17

fix indent

b7e42a6

fix codestyle

253fde4

Merge remote-tracking branch 'origin/develop' into develop

4ccd519

Merge remote-tracking branch 'origin/develop' into fp32tofp16

1bdd369

paddle-bot bot added the contributor External developers label Jul 15, 2025

[fp32tofp16] fix subnormal round to normal error

ea6bcac

wanghuancoder previously approved these changes Jul 22, 2025

View reviewed changes

lshpku previously approved these changes Jul 22, 2025

View reviewed changes

fp32tofp16，all test pass!

f6030f4

DongBaiYue dismissed stale reviews from lshpku and wanghuancoder via f6030f4 July 24, 2025 06:23

DongBaiYue added 2 commits July 24, 2025 06:49

fix

519317b

Merge remote-tracking branch 'origin/develop' into fp32tofp16

261a743

DongBaiYue added 2 commits July 28, 2025 04:30

fix int to uint

4cb193f

fix arm bug

507f29a

wanghuancoder approved these changes Jul 31, 2025

View reviewed changes

lshpku approved these changes Jul 31, 2025

View reviewed changes

lshpku merged commit a8bb594 into PaddlePaddle:develop Jul 31, 2025
51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement IEEE 754 rounding conditions for fp32 to fp16 conversion in host #74044

Implement IEEE 754 rounding conditions for fp32 to fp16 conversion in host #74044

Uh oh!

DongBaiYue commented Jul 15, 2025 •

edited

Loading

Uh oh!

paddle-bot bot commented Jul 15, 2025

Uh oh!

DongBaiYue commented Jul 15, 2025

Uh oh!

DongBaiYue commented Jul 15, 2025

Uh oh!

DongBaiYue commented Jul 21, 2025

Uh oh!

DongBaiYue commented Jul 21, 2025

Uh oh!

wanghuancoder left a comment

Uh oh!

DongBaiYue commented Jul 24, 2025

Uh oh!

DongBaiYue commented Jul 25, 2025

Uh oh!

DongBaiYue commented Jul 25, 2025

Uh oh!

DongBaiYue commented Jul 27, 2025

Uh oh!

wanghuancoder left a comment

Uh oh!

Uh oh!

Uh oh!

Implement IEEE 754 rounding conditions for fp32 to fp16 conversion in host #74044

Implement IEEE 754 rounding conditions for fp32 to fp16 conversion in host #74044

Uh oh!

Conversation

DongBaiYue commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Category

PR Types

Description

Problem Summary

Example

Solution Approach

Validation

Others

Uh oh!

paddle-bot bot commented Jul 15, 2025

Uh oh!

DongBaiYue commented Jul 15, 2025

Uh oh!

DongBaiYue commented Jul 15, 2025

Uh oh!

DongBaiYue commented Jul 21, 2025

Uh oh!

DongBaiYue commented Jul 21, 2025

Uh oh!

wanghuancoder left a comment

Choose a reason for hiding this comment

Uh oh!

DongBaiYue commented Jul 24, 2025

Uh oh!

DongBaiYue commented Jul 25, 2025

Uh oh!

DongBaiYue commented Jul 25, 2025

Uh oh!

DongBaiYue commented Jul 27, 2025

Uh oh!

wanghuancoder left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

DongBaiYue commented Jul 15, 2025 •

edited

Loading