Fix shfl_sync for CUDA8.0 #10325

chengduoZH · 2018-05-02T03:19:41Z

CUDA 8:

T __shfl(T var, int srcLane, int width=warpSize);
T __shfl_down(T var, unsigned int delta, int width=warpSize);

CUDA 9:

T __shfl_sync(unsigned mask, T var, int srcLane, int width=warpSize);
T __shfl_down_sync(unsigned mask, T var, unsigned detla,  int width=warpSize);

dzhwinter

LGTM

dzhwinter · 2018-05-02T03:51:32Z

I forget to change the wrapper in old paddle, but how it passes the CI test? So weird.

chengduoZH force-pushed the fix_shfl_sync branch 2 times, most recently from 47e4a20 to 044f86d Compare May 2, 2018 03:29

fix shfl_sync for CUDA8.0

90d73c7

chengduoZH force-pushed the fix_shfl_sync branch from 044f86d to 90d73c7 Compare May 2, 2018 03:42

chengduoZH requested a review from dzhwinter May 2, 2018 03:48

dzhwinter approved these changes May 2, 2018

View reviewed changes

chengduoZH merged commit 3222cf1 into PaddlePaddle:develop May 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix shfl_sync for CUDA8.0 #10325

Fix shfl_sync for CUDA8.0 #10325

Uh oh!

chengduoZH commented May 2, 2018 •

edited

Loading

Uh oh!

dzhwinter left a comment

Uh oh!

dzhwinter commented May 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix shfl_sync for CUDA8.0 #10325

Fix shfl_sync for CUDA8.0 #10325

Uh oh!

Conversation

chengduoZH commented May 2, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dzhwinter left a comment

Choose a reason for hiding this comment

Uh oh!

dzhwinter commented May 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chengduoZH commented May 2, 2018 •

edited

Loading