Align `_choose_scale_float8`'s use of `block_size` with other code

Recently I found that for `_choose_scale_float8`:

`len(block_size) == 0` for per tensor: https://github.com/pytorch/ao/blob/b4ec4cbab799b5320fc1ca094e0398b3ad967dbf/torchao/quantization/quant_primitives.py#L2303

`keepdim=True` for others: https://github.com/pytorch/ao/blob/b4ec4cbab799b5320fc1ca094e0398b3ad967dbf/torchao/quantization/quant_primitives.py#L2314

While for `_choose_qparams_affine` (for int):

we use `keepdim=False`: https://github.com/pytorch/ao/blob/b4ec4cbab799b5320fc1ca094e0398b3ad967dbf/torchao/quantization/quant_primitives.py#L1552-L1553 that will make per tensor quantization to have a scalar scale automatically.

I think we should fix this discrepancy to reduce confusion, by aligning `_choose_scale_float8` with the int one, i.e. change `keepdim=True` to `keepdim=False` and make sure the tests work.


we can also add some docs afterwards to both ops to clarify.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Align `_choose_scale_float8`'s use of `block_size` with other code #3324

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	min_val = torch.amin(input, dim=reduction_dims, keepdim=False)
	max_val = torch.amax(input, dim=reduction_dims, keepdim=False)

Align _choose_scale_float8's use of block_size with other code #3324

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Align `_choose_scale_float8`'s use of `block_size` with other code #3324