F8 scaling #450

aacostadiaz · 2025-06-30T06:44:22Z

This PR adds support for scaling for FP8 GEMM.

# Conflicts: # include/cutlass/gemm/dispatch_policy.hpp

…8-scale

sanchitintel · 2025-06-30T16:40:38Z

Hi! It seems Float8 dequantization typically doesn't use zero-points. Is there some workload in which it might? Thanks!

include/cutlass/gemm/collective/xe_mma_fp8_scaling.hpp

aacostadiaz · 2025-06-30T18:37:24Z

Hi! It seems Float8 dequantization typically doesn't use zero-points. Is there some workload in which it might? Thanks!

Correct, zero point is mostly for integer quantisation. I'll remove it from the example although I'll keep the support in the pipeline for a future integration with mixed dtype.

aacostadiaz added 8 commits June 27, 2025 09:08

WIP

fb117ee

Za inf

5305b8a

remove shfl_sync

9d6c5cc

remove shfl_sync

938e91b

Prepare example

dd7cee6

Merge branch 'sycl-develop' into aacosta/f8-scale

dcffe4e

# Conflicts: # include/cutlass/gemm/dispatch_policy.hpp

static check

cb6b49f

fix print

269c270

aacostadiaz added the release label Jun 30, 2025

aacostadiaz added 5 commits June 30, 2025 11:02

Merge branch 'sycl-develop' into aacosta/f8-scale

0ffe3a0

more fixes

59130ba

Merge remote-tracking branch 'origin/aacosta/f8-scale' into aacosta/f…

4841498

…8-scale

Use 2d copy for scale and zeros

748d577

fix build

1f400be

aacostadiaz marked this pull request as ready for review June 30, 2025 10:16

aacostadiaz added 4 commits June 30, 2025 12:34

load fp8 as uint8

4423465

load fp8 as uint8

50ce669

add tests cases in the example

0ba0135

workaround IGC issue

eac0174

sanchitintel reviewed Jun 30, 2025

View reviewed changes

include/cutlass/gemm/collective/xe_mma_fp8_scaling.hpp Show resolved Hide resolved

mehdi-goli approved these changes Jun 30, 2025

View reviewed changes

aacostadiaz added 2 commits June 30, 2025 19:39

Remove zero-point

8559aa5

Merge branch 'sycl-develop' into aacosta/f8-scale

dd2428d

aacostadiaz merged commit 3da91e1 into codeplaysoftware:sycl-develop Jun 30, 2025
12 of 14 checks passed

t4c1 mentioned this pull request Jul 2, 2025

First version of FP8 scaled_mm. #428

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

F8 scaling #450

F8 scaling #450

Uh oh!

aacostadiaz commented Jun 30, 2025 •

edited

Loading

Uh oh!

sanchitintel commented Jun 30, 2025

Uh oh!

Uh oh!

aacostadiaz commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

F8 scaling #450

F8 scaling #450

Uh oh!

Conversation

aacostadiaz commented Jun 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sanchitintel commented Jun 30, 2025

Uh oh!

Uh oh!

aacostadiaz commented Jun 30, 2025

Uh oh!

Uh oh!

Uh oh!

aacostadiaz commented Jun 30, 2025 •

edited

Loading