[lora] enforce_eager=true slows down generation time dramatically with LoRA by devpatelio · Pull Request #665 · NovaSky-AI/SkyRL

devpatelio · 2025-11-14T04:59:00Z

Unfortunately, this seems to be a vLLM issue that has been widely reported and not addressed. I've provided some additional vllm configuration flags (fully sharded lora) and double checked that max_lora_rank is equal to the input lora rank which was also recorded as a potential cause.

For now, I've implemented a bandaid solution where we always set enforce_eager=false for LoRA runs to prevent slowdowns in all training runs with a warning. This is in line with the vLLM suggested fixes for the generator.

See vllm-project/vllm#13204 and vllm-project/vllm#9452

…daid fix to ensure enforce_eager=false always for LoRA runs

devpatelio · 2025-11-14T04:59:11Z

/gemini review

devpatelio · 2025-11-16T00:55:28Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a workaround for a known performance issue in vLLM where using LoRA with enforce_eager=true causes significant slowdowns. The change automatically disables enforce_eager for LoRA runs when using the vLLM backend and issues a warning, which is a sensible approach. The implementation is clean and also correctly plumbs through the new fully_sharded_loras configuration option. My feedback includes a minor suggestion to improve the robustness of the backend check.

gemini-code-assist · 2025-11-16T00:56:45Z

skyrl-train/skyrl_train/entrypoints/main_base.py

+        engine_kwargs["fully_sharded_loras"] = cfg.generator.fully_sharded_loras
+
+        # TODO(devpatel): Bandaid solution, replace this once we have a better solution for LoRA performance degradation on the vLLM side
+        if cfg.generator.enforce_eager and cfg.generator.backend == "vllm":


For improved robustness, it's a good practice to make the backend check case-insensitive. The configuration value for backend could potentially be provided with different casings (e.g., 'vllm', 'VLLM'). Using .lower() will ensure this check works as expected in all cases.

Suggested change

if cfg.generator.enforce_eager and cfg.generator.backend == "vllm":

if cfg.generator.enforce_eager and cfg.generator.backend.lower() == "vllm":

…matically with LoRA (NovaSky-AI#665) Unfortunately, this seems to be a vLLM issue that has been widely reported and not addressed. I've provided some additional vllm configuration flags (fully sharded lora) and double checked that max_lora_rank is equal to the input lora rank which was also recorded as a potential cause. For now, I've implemented a bandaid solution where we always set enforce_eager=false for LoRA runs to prevent slowdowns in all training runs with a warning. This is in line with the vLLM suggested fixes for the generator. See vllm-project/vllm#13204 and vllm-project/vllm#9452

devpatelio added 2 commits November 14, 2025 04:20

use fully sharded for eager true speedup

09e5be3

added fully sharded lora (no change in generation time) and added ban…

51f3ae1

…daid fix to ensure enforce_eager=false always for LoRA runs

This comment was marked as outdated.

Sign in to view

devpatelio added 5 commits November 14, 2025 05:01

linter

07f5bea

fix misnamed variable

490e3a8

reset lora example script

66fcc57

add fully sharded loras

44337ea

inter

f012b7c

gemini-code-assist bot reviewed Nov 16, 2025

View reviewed changes

SumanthRH approved these changes Nov 17, 2025

View reviewed changes

SumanthRH merged commit 803155e into NovaSky-AI:main Nov 17, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[lora] enforce_eager=true slows down generation time dramatically with LoRA#665

[lora] enforce_eager=true slows down generation time dramatically with LoRA#665
SumanthRH merged 7 commits intoNovaSky-AI:mainfrom
devpatelio:devpatel/skyrl-issue-67

devpatelio commented Nov 14, 2025

Uh oh!

devpatelio commented Nov 14, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

devpatelio commented Nov 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 16, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	if cfg.generator.enforce_eager and cfg.generator.backend == "vllm":
	if cfg.generator.enforce_eager and cfg.generator.backend.lower() == "vllm":

Conversation

devpatelio commented Nov 14, 2025

Uh oh!

devpatelio commented Nov 14, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

This comment was marked as outdated.

Uh oh!

devpatelio commented Nov 16, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants