do not submit - just provide comparison baseline by TimothySeah · Pull Request #63388 · ray-project/ray

TimothySeah · 2026-05-16T00:26:36Z

See #63309 for more details

Signed-off-by: Timothy Seah <tseah@anyscale.com>

gemini-code-assist

Code Review

This pull request modifies the ray.train.DataConfig in ray_dataloader_factory.py by hardcoding datasets_to_split to an empty list. Feedback indicates that this change disables default sharding, which may lead to incorrect performance metrics or resource issues in distributed training; it is recommended to make this setting configurable or provide a clear explanation for disabling sharding.

gemini-code-assist · 2026-05-16T00:27:21Z


    def get_ray_data_config(self) -> ray.train.DataConfig:
        return ray.train.DataConfig(
+            datasets_to_split=[],


Hardcoding datasets_to_split=[] disables the default sharding behavior in Ray Train. This causes every training worker to process the entire dataset rather than a shard, which is generally not the intended behavior for distributed training benchmarks and can lead to excessive resource consumption or incorrect performance metrics. If this is for a specific baseline comparison, it would be better to make this configurable in RayDataConfig or add a comment explaining why sharding is being disabled.

do not submit - just provide comparison baseline

31cf5b3

Signed-off-by: Timothy Seah <tseah@anyscale.com>

gemini-code-assist Bot reviewed May 16, 2026

View reviewed changes

TimothySeah mentioned this pull request May 16, 2026

[train] Implement DatasetManager #63309

Merged

TimothySeah closed this May 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

do not submit - just provide comparison baseline#63388

do not submit - just provide comparison baseline#63388
TimothySeah wants to merge 1 commit into
ray-project:masterfrom
TimothySeah:tseah/test-no-split-do-not-submit

TimothySeah commented May 16, 2026 •

edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

TimothySeah commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 16, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

TimothySeah commented May 16, 2026 •

edited

Loading