-
Notifications
You must be signed in to change notification settings - Fork 969
Closed
Description
Please tell me about the relationship between GPU memory usage and the number of NCCL communicators.
- Does it mean that the more NCCL communicators are created, the more GPU memory will be used? Is it a linear relationship?
It is found here that when an NCCL communicator is established and allreduce communication is performed, the memory occupies about 1.6GB; when multiple NCCL communicators (>=3) are established, the memory occupies about 6GB. If a lot of nccl connections are established, will it be OOM?
- Is there any relationship between the memory usage of NCCL and the number of GPUs?
Is there any difference in the amount of memory used when hundreds of GPUs establish NCCL connections and dozens of GPUs establish NCCL connections?
Metadata
Metadata
Assignees
Labels
No labels