Skip to content

Commit da509d6

Browse files
cuichenxxueh-nv
authored andcommitted
update deepseek recipe default (NVIDIA-NeMo#12424)
Signed-off-by: Chen Cui <chcui@nvidia.com> Signed-off-by: Xue Huang <xueh@nvidia.com>
1 parent 827303c commit da509d6

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

nemo/collections/llm/recipes/deepseek_v3.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -90,7 +90,7 @@ def finetune_recipe(
9090
dir: Optional[str] = None,
9191
resume_path: str = "deepseek-ai/DeepSeek-V3-Base",
9292
name: str = "default",
93-
num_nodes: int = 4,
93+
num_nodes: int = 5,
9494
num_gpus_per_node: int = 8,
9595
peft_scheme: Optional[str] = 'lora',
9696
seq_length: Optional[int] = None,
@@ -120,7 +120,7 @@ def finetune_recipe(
120120
Examples:
121121
CLI usage:
122122
$ nemo llm finetune --factory deepseek_v3
123-
$ nemo llm finetune --factory "deepseek_v3(num_nodes=6, name='my_deepseek_v3_finetune')"
123+
$ nemo llm finetune --factory "deepseek_v3(num_nodes=5, name='my_deepseek_v3_finetune')"
124124
125125
Python API usage:
126126
>>> recipe = finetune_recipe(name="deepseek_v3_finetune", num_nodes=6)

0 commit comments

Comments
 (0)