[Data] Partial fix for Dataset.context not being sealed after creation by raulchen · Pull Request #41569 · ray-project/ray

raulchen · 2023-12-01T23:20:24Z

Why are these changes needed?

Dataset.context should be sealed the first time the Dataset is created. But if a new operator is applied to the dataset, the new global DataContext will be saved again to the Dataset.

This bug prevents using different DataContexts for training and validation datasets in a training job.

Note this PR only fixes the issue when multiple datasets are created in the process but will be running in different processes. If they run in the same process, it's still a bug, see #41573.

Related issue number

#41573

Checks

I've signed off every commit(by using the -s flag, i.e., git commit -s) in this PR.
I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
- I've added any new APIs to the API Reference. For example, if I added a
  method in Tune, I've added it in doc/source/tune/api/ under the
  corresponding .rst file.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Signed-off-by: Hao Chen <chenh1024@gmail.com>

raulchen · 2023-12-01T23:22:00Z

python/ray/data/tests/test_streaming_integration.py

        ds2.take_all()


-def test_streaming_split_with_custom_data_context(


Moving this test to test_context_propagation, not changing the test code.

stephanie-wang · 2023-12-01T23:38:19Z

What about the places where we use DataContext.get_current() during planning, e.g., here? Don't we need to propagate the DataContext through to those?

raulchen · 2023-12-02T01:29:35Z

What about the places where we use DataContext.get_current() during planning, e.g., here? Don't we need to propagate the DataContext through to those?

Good point. So this PR can fix the case for training jobs, where be different datasets will be proposed to different processes (the SplitCoordinator actors) for execution.
But if they run in the driver process, it's still a bug. I created an issue #41573 for tracking.

stephanie-wang

Looks good, but can you update the PR description to make it clear what cases this does and does not cover?

raulchen added 2 commits December 1, 2023 15:07

Fix Dataset.context not being sealed

7ce830b

Signed-off-by: Hao Chen <chenh1024@gmail.com>

test_streaming_split

bc4731b

Signed-off-by: Hao Chen <chenh1024@gmail.com>

raulchen requested review from Zandew, amogkam, bveeramani, c21, ericl, scottjlee, scv119 and stephanie-wang as code owners December 1, 2023 23:20

raulchen commented Dec 1, 2023

View reviewed changes

raulchen assigned raulchen, stephanie-wang and c21 and unassigned raulchen Dec 1, 2023

stephanie-wang added the @author-action-required The PR author is responsible for the next step. Remove tag to send back to the reviewer. label Dec 1, 2023

raulchen mentioned this pull request Dec 2, 2023

[data][bug] Dataset.context not being sealed after creation #41573

Closed

stephanie-wang approved these changes Dec 4, 2023

View reviewed changes

raulchen changed the title ~~[Data] Fix Dataset.context not being sealed after creation~~ [Data] Partial fix for Dataset.context not being sealed after creation Dec 4, 2023

raulchen merged commit 1e691f0 into ray-project:master Dec 4, 2023

raulchen deleted the fix-plan-copy-context branch December 4, 2023 22:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] Partial fix for Dataset.context not being sealed after creation#41569

[Data] Partial fix for Dataset.context not being sealed after creation#41569
raulchen merged 2 commits intoray-project:masterfrom
raulchen:fix-plan-copy-context

raulchen commented Dec 1, 2023 •

edited

Loading

Uh oh!

raulchen Dec 1, 2023

Uh oh!

stephanie-wang commented Dec 1, 2023

Uh oh!

raulchen commented Dec 2, 2023

Uh oh!

stephanie-wang left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		ds2.take_all()


		def test_streaming_split_with_custom_data_context(

Conversation

raulchen commented Dec 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why are these changes needed?

Related issue number

Checks

Uh oh!

raulchen Dec 1, 2023

Choose a reason for hiding this comment

Uh oh!

stephanie-wang commented Dec 1, 2023

Uh oh!

raulchen commented Dec 2, 2023

Uh oh!

stephanie-wang left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

raulchen commented Dec 1, 2023 •

edited

Loading