Skip to content

Support Megatron training backend for MoE training #203

@tyler-griggs

Description

@tyler-griggs

To support large-scale multi-node MoE training, we will integrate the Megatron training backend.

Tasks:

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions