fix(pytorch): Rename layer_scale parameter to avoid quantization error #2172

ved1beta · 2025-04-14T13:33:20Z

Here's the formatted PR description based on our changes:

Type of Change

Bug fix
No API changes (parameter rename only)

Description

This PR fixes an issue where PyTorch models using layer_scale parameters fail during quantization due to tensor conversion errors. The problem occurs when the quantization process attempts to convert tensor-type scale parameters to Python scalars.

Key changes:

Renamed problematic layer_scale parameter to layer_gamma to avoid conflicts with scale detection
Enhanced _get_module_scale_zeropoint method to better handle scale parameters
Added improved logging for scale parameter detection

The fix maintains backward compatibility while resolving the quantization failure.

Expected Behavior & Potential Risk

Expected Behavior:

Models using the renamed layer_gamma parameter will successfully complete quantization
Existing scale detection logic works correctly with the new parameter name
Debug logs provide better visibility into scale parameter detection

Potential Risks:

Models explicitly looking for layer_scale parameter name might need updates
Need to communicate the parameter rename in documentation
May need to provide migration guide for existing models

How has this PR been tested?

Tests have been implemented in test/adaptor/test_pytorch_layer_scale.py with two test cases:

test_layer_scale_error: Verifies the original issue by confirming that models with layer_scale parameter fail quantization with the expected error
test_layer_gamma_success: Validates that models using the new layer_gamma parameter successfully complete quantization

Test Environment:

Python 3
PyTorch latest version
Neural Compressor latest version
Test models: ConvEncoder architecture with both original and fixed parameter names
Input tensor shape: (1, 64, 32, 32)

To reproduce:

cd neural-compressor
python3 test/adaptor/test_pytorch_layer_scale.py

Dependency Change?

No new dependencies were introduced or removed.

Uses existing PyTorch and Neural Compressor infrastructure
All changes are internal to the codebase

for more information, see https://pre-commit.ci

Copilot

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

Comments suppressed due to low confidence (1)

neural_compressor/adaptor/pytorch.py:4175

The current filter condition excludes any parameter containing 'gamma', which might unintentionally filter out valid parameters. Consider checking explicitly for 'layer_scale' and 'layer_gamma' instead to avoid potential misclassification.

if "scale" in node.target and not any(exclude in node.target for exclude in ["layer_scale", "gamma"]):

xin3he · 2025-04-15T08:18:55Z

Hi @ved1beta Thanks for your commit.
I notice that the unexpected case is that scale is a tensor list, not one value. If so, we can fix it in a better way.

Suggestion:
float(getattr(model, node.target)) -> getattr(model, node.target).tolist() # it can cover both one value and list value

BTW, currently our CI is blocked by #2171. To resolve the CI issue, please update your branch after #2171 is merged.

XuehaoSun · 2025-04-17T02:35:56Z

Please fix CI issues and sign DCO
https://dev.azure.com/lpot-inc/neural-compressor/_build/results?buildId=39554&view=logs&j=e5896b99-a49d-517b-218b-3b918f0c116d&t=b18c099a-26ee-5571-9980-67a803d9b7da&l=29800

* [pre-commit.ci] pre-commit autoupdate updates: - [github.com/pycqa/isort: 5.13.2 → 6.0.1](PyCQA/isort@5.13.2...6.0.1) - [github.com/psf/black.git: 24.10.0 → 25.1.0](https://github.com/psf/black.git/compare/24.10.0...25.1.0) - [github.com/codespell-project/codespell: v2.3.0 → v2.4.1](codespell-project/codespell@v2.3.0...v2.4.1) - [github.com/astral-sh/ruff-pre-commit: v0.8.6 → v0.11.4](astral-sh/ruff-pre-commit@v0.8.6...v0.11.4) Signed-off-by: Sun, Xuehao <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Signed-off-by: changwangss <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Signed-off-by: xin3he <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

Signed-off-by: V-E-D <[email protected]>

test/adaptor/test_pytorch_layer_scale.py

…or into layer_scale_fix

neural_compressor/adaptor/pytorch.py

test/adaptor/test_pytorch_layer_scale.py

ved1beta · 2025-04-21T06:04:31Z

hey i did required changes but had a merge conflict , fixing it

test/adaptor/test_pytorch_layer_scale.py

for more information, see https://pre-commit.ci

test/adaptor/test_pytorch_layer_scale.py

…or into layer_scale_fix

xin3he · 2025-04-23T05:09:10Z

Thanks, @ved1beta, merged

ved1beta and others added 3 commits April 14, 2025 18:54

fix(pytorch): Rename layer_scale parameter to avoid quantization error

51fdac1

fix(pytorch): Rename layer_scale parameter to avoid quantization error

fbda817

[pre-commit.ci] auto fixes from pre-commit.com hooks

50f0f41

for more information, see https://pre-commit.ci

thuang6 requested review from xin3he, XuehaoSun and Copilot and removed request for xin3he April 14, 2025 14:13

Copilot AI reviewed Apr 14, 2025

View reviewed changes

pre-commit-ci bot and others added 4 commits April 17, 2025 10:01

add revision for hf-internal-testing/tiny-random-gptj in UT (intel#2174)

b282da4

Signed-off-by: changwangss <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

suit transformers>=4.51 (intel#2171)

34333f4

Signed-off-by: xin3he <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

DOC fix --amend --signoff

7f76ab4

Signed-off-by: V-E-D <[email protected]>

ved1beta force-pushed the layer_scale_fix branch from 9c09ae4 to 7f76ab4 Compare April 17, 2025 04:32

Merge branch 'master' into layer_scale_fix

f1173df

xin3he approved these changes Apr 21, 2025

View reviewed changes

xin3he reviewed Apr 21, 2025

View reviewed changes

test/adaptor/test_pytorch_layer_scale.py Outdated Show resolved Hide resolved

ved1beta added 4 commits April 21, 2025 11:21

Moved the CalibDataloader class outside of the test methods

0f6254d

Merge branch 'layer_scale_fix' of github.com:ved1beta/neural-compress…

b93b5fb

…or into layer_scale_fix

precommit

614a640

precommit

ae0701c

ved1beta force-pushed the layer_scale_fix branch from eecb889 to ae0701c Compare April 21, 2025 06:00

xin3he requested changes Apr 21, 2025

View reviewed changes

neural_compressor/adaptor/pytorch.py Outdated Show resolved Hide resolved

test/adaptor/test_pytorch_layer_scale.py Outdated Show resolved Hide resolved

conflict solved

59bcd77

xin3he reviewed Apr 21, 2025

View reviewed changes

test/adaptor/test_pytorch_layer_scale.py Outdated Show resolved Hide resolved

test/adaptor/test_pytorch_layer_scale.py Outdated Show resolved Hide resolved

test/adaptor/test_pytorch_layer_scale.py Outdated Show resolved Hide resolved

ved1beta added 2 commits April 21, 2025 11:57

required stuff

eb98714

required chnages

44626a1

[pre-commit.ci] auto fixes from pre-commit.com hooks

a56c9b2

for more information, see https://pre-commit.ci

xin3he approved these changes Apr 22, 2025

View reviewed changes

xin3he reviewed Apr 22, 2025

View reviewed changes

test/adaptor/test_pytorch_layer_scale.py Outdated Show resolved Hide resolved

ved1beta added 2 commits April 22, 2025 14:31

assertIsNotNone added

3facba9

Merge branch 'layer_scale_fix' of github.com:ved1beta/neural-compress…

b1baba1

…or into layer_scale_fix

XuehaoSun approved these changes Apr 23, 2025

View reviewed changes

xin3he merged commit f0812a1 into intel:master Apr 23, 2025
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(pytorch): Rename layer_scale parameter to avoid quantization error #2172

fix(pytorch): Rename layer_scale parameter to avoid quantization error #2172

ved1beta commented Apr 14, 2025

Copilot AI left a comment

xin3he commented Apr 15, 2025 •

edited

Loading

XuehaoSun commented Apr 17, 2025

ved1beta commented Apr 21, 2025

xin3he commented Apr 23, 2025

fix(pytorch): Rename layer_scale parameter to avoid quantization error #2172

fix(pytorch): Rename layer_scale parameter to avoid quantization error #2172

Conversation

ved1beta commented Apr 14, 2025

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Copilot AI left a comment

Choose a reason for hiding this comment

xin3he commented Apr 15, 2025 • edited Loading

XuehaoSun commented Apr 17, 2025

ved1beta commented Apr 21, 2025

xin3he commented Apr 23, 2025

xin3he commented Apr 15, 2025 •

edited

Loading