Skip to content

[Data] Fix driver hang during streaming generator block metadata retrieval#1

Merged
snorkelopstesting2-coder merged 1 commit intomainfrom
pr-56451-fix/driver_hang
Oct 22, 2025
Merged

[Data] Fix driver hang during streaming generator block metadata retrieval#1
snorkelopstesting2-coder merged 1 commit intomainfrom
pr-56451-fix/driver_hang

Conversation

@snorkelopstesting2-coder
Copy link
Contributor

Recreated from original PR: ray-project/ray#56451

Why are these changes needed?

This PR fixes a critical driver hang issue in Ray Data's streaming generator. The problem occurs when computation completes and block data is generated, but the worker crashes before the metadata object is generated, causing the driver to hang completely until the task's metadata is successfully rebuilt. This creates severe performance issues, especially in cluster environments with significant resource fluctuations.

What was the problem?

**Specific s...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant