Migrate export_llama to new ao quantize API #8422

jackzhxng · 2025-02-12T18:01:18Z

🚀 The feature, motivation and pitch

Int8DynActInt4WeightQuantizer for -qmode 8da4w is no longer being developed by ao and doesn't support bias. Migrate to the new quantize_ api which can take in int8_dynamic_activation_int4_weight.

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

cc @mergennachin @iseeyuan @lucylq @helunwencser @tarun292 @kimishpatel @cccclai

The text was updated successfully, but these errors were encountered:

mergennachin · 2025-02-12T21:59:55Z

@jackzhxng

see #5106 and D62223540 for some historical context

jackzhxng added module: examples Issues related to demos under examples/ module: llm Issues related to LLM examples and apps, and to the extensions/llm/ code labels Feb 12, 2025

jackzhxng self-assigned this Feb 12, 2025

github-project-automation bot added this to ExecuTorch Core Feb 12, 2025

github-project-automation bot moved this to To triage in ExecuTorch Core Feb 12, 2025

iseeyuan added this to etLLM: LLMs via ExecuTorch Feb 13, 2025

lucylq moved this from To triage to In progress in ExecuTorch Core Feb 13, 2025

jackzhxng added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Feb 14, 2025

jackzhxng mentioned this issue Feb 24, 2025

Switch to new ao quant api for 8da4w #8501

Merged

jackzhxng closed this as completed in #8501 Feb 25, 2025

github-project-automation bot moved this to Done in etLLM: LLMs via ExecuTorch Feb 25, 2025

github-project-automation bot moved this from In progress to Done in ExecuTorch Core Feb 25, 2025

jackzhxng reopened this Mar 3, 2025

github-project-automation bot moved this from Done to Backlog in ExecuTorch Core Mar 3, 2025

jackzhxng moved this from Backlog to In progress in ExecuTorch Core Mar 3, 2025

jackzhxng moved this from In progress to Backlog in ExecuTorch Core Mar 11, 2025

jackzhxng moved this from Backlog to In progress in ExecuTorch Core Mar 20, 2025

tarun292 moved this from In progress to Done in ExecuTorch Core Mar 27, 2025

tarun292 closed this as completed by moving to Done in ExecuTorch Core Mar 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate export_llama to new ao quantize API #8422

Migrate export_llama to new ao quantize API #8422

jackzhxng commented Feb 12, 2025 •

edited by pytorch-bot bot

Loading

mergennachin commented Feb 12, 2025

Migrate export_llama to new ao quantize API #8422

Migrate export_llama to new ao quantize API #8422

Comments

jackzhxng commented Feb 12, 2025 • edited by pytorch-bot bot Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)

mergennachin commented Feb 12, 2025

jackzhxng commented Feb 12, 2025 •

edited by pytorch-bot bot

Loading