File tree 1 file changed +6
-1
lines changed
references/classification
1 file changed +6
-1
lines changed Original file line number Diff line number Diff line change @@ -40,12 +40,17 @@ python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py\
40
40
41
41
### ResNext-101 32x8d
42
42
43
- On 8 nodes, each with 8 GPUs (for a total of 64 GPUS)
44
43
```
45
44
python -m torch.distributed.launch --nproc_per_node=8 --use_env train.py\
46
45
--model resnext101_32x8d --epochs 100
47
46
```
48
47
48
+ Note that the above command corresponds to a single node with 8 GPUs. If you use
49
+ a different number of GPUs and/or a different batch size, then the learning rate
50
+ should be scaled accordingly. For example, the pretrained model provided by
51
+ ` torchvision ` was trained on 8 nodes, each with 8 GPUs (for a total of 64 GPUs),
52
+ with ` --batch_size 16 ` and ` --lr 0.4 ` , instead of the current defaults
53
+ which are respectively batch_size=32 and lr=0.1
49
54
50
55
### MobileNetV2
51
56
```
You can’t perform that action at this time.
0 commit comments