INT8 inference issues w8a8

Hi am am trying to use the Zentorch modules via fp32 with in8 for w8a8, currently i've tried using Quark compile Mistral 7b 0.2 and now also PytorchAO compiled W8a8.

Pytorch W8a8 works as expected without ZenTorch.

Tested from PyTorch ans associated ZenTorch version 2.4 -> 2.8

[PyTorch version 2 7 0 (w8a8).txt](https://github.com/user-attachments/files/22223705/PyTorch.version.2.7.0.w8a8.txt)

[basic in8 script.txt](https://github.com/user-attachments/files/22223722/basic.in8.script.txt)

See attached for event log and associated basic testing script.

Noting: assuming again this may be due to enforced AVX512 for int8 (which probably should not be in this case), but certainly no mention of it this time.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

INT8 inference issues w8a8 #4

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

INT8 inference issues w8a8 #4

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions