Skip to content

[train][checkpoint] Add ray.train.get_all_reported_checkpoints method#1

Merged
snorkelopstesting1-a11y merged 1 commit intomainfrom
pr-54555-tseah/get-checkpoints-counters
Oct 11, 2025
Merged

[train][checkpoint] Add ray.train.get_all_reported_checkpoints method#1
snorkelopstesting1-a11y merged 1 commit intomainfrom
pr-54555-tseah/get-checkpoints-counters

Conversation

@snorkelopstesting1-a11y
Copy link
Collaborator

Recreated from original PR: ray-project/ray#54555

Summary

This PR adds a ray.train.get_all_reported_checkpoints method that allows users to get all the checkpoints they have reported from within their training function.

This is different from Result in two ways:

  • It is called from the training function on the training worker instead of from the driver
  • It can be called while training is still in progress

Implementation Notes

The main idea is to use a worker-si...

@snorkelopstesting1-a11y snorkelopstesting1-a11y merged commit c51d2da into main Oct 11, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant