Fix code for llmc llama4 quantization #1161

mengniwang95 · 2025-12-18T09:19:11Z

LLMC uses quantize_block API to quantize block, where iters maybe 0, add process for dump_info in _quantize_block
LLMC passes decoding layer into quantize_block, which doesn't have config attribute

Signed-off-by: Mengni Wang <[email protected]>

for more information, see https://pre-commit.ci

yiliu30 · 2025-12-19T00:33:03Z

@XuehaoSun @chensuyue Is it possible to cherry‑pick this into 0.9.3, so we can use it when we upgrade AutoRound on the LLMC side?
cc @thuang6

yiliu30

code change LGTM

auto_round/compressors/base.py

auto_round/compressors/mllm/compressor.py

Signed-off-by: Mengni Wang <[email protected]>

for more information, see https://pre-commit.ci

mengniwang95 · 2025-12-19T09:22:16Z

@n1ck-guo please take a look for the template part code change

Signed-off-by: Mengni Wang <[email protected]> (cherry picked from commit 3c88b3b)

mengniwang95 and others added 5 commits December 18, 2025 06:43

Fix code for llmc llama4 quantization

57a4e80

Signed-off-by: Mengni Wang <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

33568a8

for more information, see https://pre-commit.ci

Merge branch 'main' into mengni/llmc_llama4

5e13d8d

Update base.py

64e88e6

Update base.py

29ca219

mengniwang95 mentioned this pull request Dec 18, 2025

[AutoRound] Support w8a8 scheme in auto-round and add example vllm-project/llm-compressor#2150

Open

yiliu30 requested review from n1ck-guo and yiliu30 December 19, 2025 00:33

yiliu30 approved these changes Dec 19, 2025

View reviewed changes

xin3he reviewed Dec 19, 2025

View reviewed changes

auto_round/compressors/base.py Show resolved Hide resolved

wenhuach21 reviewed Dec 19, 2025

View reviewed changes

auto_round/compressors/mllm/compressor.py Show resolved Hide resolved

mengniwang95 and others added 4 commits December 19, 2025 09:01

fix template for llmc llama4 quant

f3b5be6

Signed-off-by: Mengni Wang <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

4213843

for more information, see https://pre-commit.ci

Update compressor.py

9914c37

Merge branch 'main' into mengni/llmc_llama4

38fcf61

chensuyue added this to the 0.9.3 milestone Dec 22, 2025

n1ck-guo approved these changes Dec 23, 2025

View reviewed changes

chensuyue merged commit 3c88b3b into main Dec 23, 2025
28 checks passed

chensuyue deleted the mengni/llmc_llama4 branch December 23, 2025 06:16

chensuyue pushed a commit that referenced this pull request Dec 23, 2025

Fix code for llmc llama4 quantization (#1161)

13c4c81

Signed-off-by: Mengni Wang <[email protected]> (cherry picked from commit 3c88b3b)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix code for llmc llama4 quantization #1161

Fix code for llmc llama4 quantization #1161

Uh oh!

mengniwang95 commented Dec 18, 2025 •

edited

Loading

Uh oh!

yiliu30 commented Dec 19, 2025

Uh oh!

yiliu30 left a comment

Uh oh!

Uh oh!

Uh oh!

mengniwang95 commented Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Fix code for llmc llama4 quantization #1161

Fix code for llmc llama4 quantization #1161

Uh oh!

Conversation

mengniwang95 commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yiliu30 commented Dec 19, 2025

Uh oh!

yiliu30 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

mengniwang95 commented Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

mengniwang95 commented Dec 18, 2025 •

edited

Loading