Skip to content
Merged
Changes from 1 commit
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions docs/ascend_tutorial/features/ascend_backend_features.md
Original file line number Diff line number Diff line change
Expand Up @@ -272,3 +272,5 @@ class MindSpeedFeature:
| `actor_rollout_ref.actor.megatron.override_transformer_config.use_fused_swiglu` |是否使用融合swiglu,默认值为False|
| `actor_rollout_ref.actor.megatron.override_transformer_config.num_layers_in_first_pipeline_stage` |第一个pipeline stage 的层数,默认值为none|
| `actor_rollout_ref.actor.megatron.override_transformer_config.num_layers_in_last_pipeline_stage` |最后一个pipeline stage 的层数,默认值为none|

注:mbridge 暂不支持同时开启 VPP(virtual_pipeline_model_parallel_size);同理 VPP 请在关闭 mbridge 时使用,由于当前 verl 默认开启 mbridge, 使用 VPP 参数时请手动将 mbridge 置为 False。
Comment thread
zjchenn marked this conversation as resolved.
Outdated