You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Use bounds_check_indices v2 on ROCm (pytorch#3916)
Summary:
X-link: facebookresearch/FBGEMM#1005
This diff forces using bounds_check_indices v2 on ROCm because ROCm
has a constraint that the gridDim * blockDim has to be smaller than
2^32. The v1 kernel can be launched with gridDim * blockDim > 2^32
while the v2 kernel limits the gridDim size to 64 * # of SMs. Thus,
its gridDim * blockDim is guaranteed to be smaller than 2^32
Reviewed By: q10, jianyuh, joebos
Differential Revision: D72334377
0 commit comments