-
Notifications
You must be signed in to change notification settings - Fork 5.8k
[Auto Paralle] [Cherry-pick] cherry-pick for auto parallel verison custom ops (moe_combine moe_gate_dispatch) #74645
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
Codecov Report❌ Patch coverage is ❌ Your patch status has failed because the patch coverage (0.00%) is below the target coverage (90.00%). You can increase the patch coverage or adjust the target coverage. Additional details and impacted files@@ Coverage Diff @@
## develop #74645 +/- ##
==========================================
Coverage ? 0.00%
==========================================
Files ? 6
Lines ? 271
Branches ? 0
==========================================
Hits ? 0
Misses ? 271
Partials ? 0 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
/re-run all-failed |
1 similar comment
/re-run all-failed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
x: [S, H], S = b*s | ||
gate_logits: [S, E] | ||
outputs: | ||
y: [E, C, H] is use_pad is true, else [S, K, H], currently only support |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is use_pad is true
-> if use_pad is true
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
后续pr中修改
PR Category
Auto Parallel
PR Types
Others
Description
为了绕过切多刀,动手/动半自定义算子moe_gate_dispatch与moe_combine在最初的实现版本中output有很多不同,此pr通过新增 auto 版本算子来暂时将动半版本的自定义算子下沉至框架内实现,后续需将动手/动半统一为一个版本
具体的diff如下: