Skip to content

Commit a2e866a

Browse files
Dhruva Kaushalfacebook-github-bot
Dhruva Kaushal
authored andcommitted
turning on prefill (llama) shapes for fp8_gemm_rowwise b200 dashboard
Summary: Minor change to select the correct shapes depending on the args for b200 fp8_gemm_rowwise_prefill. Reviewed By: minjang Differential Revision: D73976271 fbshipit-source-id: 1b52c0e570554cd4d8774be3725af354cf69c603
1 parent 5191581 commit a2e866a

File tree

1 file changed

+5
-1
lines changed

1 file changed

+5
-1
lines changed

tritonbench/operators/fp8_gemm_rowwise/operator.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -177,7 +177,11 @@ def __init__(
177177
self.use_tma = True
178178
self.no_use_persistent = False
179179
self.warp_specialization = False
180-
self.shapes = BUILDIN_SHAPES
180+
self.shapes = (
181+
gemm_shapes(addmm_args.prefill)
182+
if (addmm_args.llama)
183+
else BUILDIN_SHAPES
184+
)
181185

182186
@register_benchmark(enabled=HAS_TRITON)
183187
def _triton(self, xq, wq, x_scale, w_scale) -> Callable:

0 commit comments

Comments
 (0)