-
Notifications
You must be signed in to change notification settings - Fork 5.8k
MTP related operator enhance & implement #74684
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
Codecov Report❌ Patch coverage is ❌ Your patch status has failed because the patch coverage (43.90%) is below the target coverage (90.00%). You can increase the patch coverage or adjust the target coverage. Additional details and impacted files@@ Coverage Diff @@
## develop #74684 +/- ##
==========================================
Coverage ? 43.90%
==========================================
Files ? 5
Lines ? 41
Branches ? 0
==========================================
Hits ? 18
Misses ? 23
Partials ? 0 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
/re-run all-failed |
3 similar comments
/re-run all-failed |
/re-run all-failed |
/re-run all-failed |
/re-run all-failed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
请根据自定义算子规范,补充框架基础API组合实现版本。
- op : embedding_grad_add_to | ||
args : (Tensor token_indices, Tensor main_grad_, Tensor out_grad) | ||
output : Tensor(main_grad_out) | ||
infer_meta : | ||
func : UnchangedInferMeta | ||
param : [main_grad_] | ||
kernel : | ||
func : embedding_grad_add_to | ||
param : [token_indices, main_grad_, out_grad] | ||
data_type : main_grad_ | ||
inplace : (main_grad_ -> main_grad_out) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这些放到fused_ops.yaml中吧,看起来不是标准算子
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
好的收到,我今天提交一个修复PR,同步进行CI
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
#74811 这是后续的修复PR,正在工作
/re-run all-failed |
/re-run approval |
/re-run Static-Check |
/re-run all-failed |
* stash * Added embedd_grad_add_to kernel * fix openblas git * fix banner * Specialized cross_entropy_w_softmax in bfloat16 logit circumstances * Fix bugs * Add cross_entropy_with_softmax_bwd_w_downcast * Finish optest * fix miscs * Optimized kernel performance * fix miscs * bypass optest in some invalid enviroments. * Fix corner case * forbid dcu bf16 dtype.
PR Category
Operator Mechanism
PR Types
Performance
Description
MTP related operator enhance & implement
This PR includes:
pcard-91067