Skip to content

How much memory is needed to train Eagle3 for Qwen3-32b? #162

@IdoCohenTD

Description

@IdoCohenTD

I am using the online training version, and I have the target and draft on two different gpus. How much memory did you need to run your default params (batch of 8, max_len of 2048)?

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions