-
Notifications
You must be signed in to change notification settings - Fork 7.3k
[train][checkpoint] Add ray.train.get_all_reported_checkpoints method #54555
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
matthewdeng
merged 32 commits into
ray-project:master
from
TimothySeah:tseah/get-checkpoints-counters
Sep 2, 2025
Merged
Changes from 1 commit
Commits
Show all changes
32 commits
Select commit
Hold shift + click to select a range
104cb2b
CheckpointManager and Worker both count checkpoints
TimothySeah 7410f37
Merge remote-tracking branch 'upstream/master' into tseah/get-checkpo…
TimothySeah edf70f6
Do not assert str equality
TimothySeah 6158667
Finish implementation + add 1 unit test (not done yet)
TimothySeah 2f29f68
add test_data_parallel_trainer test case
TimothySeah 26c5535
rename TrainingResult to ReportedCheckpoint + add publicapi docstring
TimothySeah 08b318c
fix v1 import
TimothySeah e2617f0
move notify to async method
TimothySeah 2132dd6
Address some comments e.g. do not save report count in state, mock ac…
TimothySeah 0e4f5d0
address more comments
TimothySeah 3ef882b
Merge remote-tracking branch 'upstream/master' into tseah/get-checkpo…
TimothySeah 66fd5a8
remove unnecessary asyncio and make controller actor fully required
TimothySeah ffe4397
WorkerGroup passes current actor to workers + mocking to test that
TimothySeah 5f6af2c
document corner case
TimothySeah 6a2e824
[train][doc] Document get_all_reported_checkpoints and ReportedCheckp…
TimothySeah fa71eba
Merge pull request #1 from TimothySeah/tseah/get-all-reported-checkpo…
TimothySeah 1990be3
address pr comments and fix ci failure
TimothySeah 7767384
add comment to num_report_calls as suggested
TimothySeah 2cd9412
remove unnecessary line
TimothySeah 56a0537
Merge remote-tracking branch 'upstream/master' into tseah/get-checkpo…
TimothySeah e3a0b02
try adding publicapi annotation
TimothySeah c8dfdc5
address pr comments
TimothySeah 642ebcc
try different import
TimothySeah f8ded64
remove doc changes; will add in future pr
TimothySeah 0f0eccb
Merge remote-tracking branch 'upstream/master' into tseah/get-checkpo…
TimothySeah 1bb190c
fix unit tests
TimothySeah f9b75f0
always import ray.train._checkpoint.Checkpoint
TimothySeah 4b79eaf
try adding docs again
TimothySeah e705cf1
Revert "always import ray.train._checkpoint.Checkpoint"
TimothySeah 6f1c122
Use TYPE_CHECKING on Checkpoint, ReportedCheckpoint, and some related…
TimothySeah 17f9503
remove outdated pydoclint errors
TimothySeah 8d06d4c
Revert "try adding docs again"
TimothySeah File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,17 @@ | ||
| from dataclasses import dataclass | ||
| from typing import Any, Dict | ||
|
|
||
| from ray.train import Checkpoint | ||
|
|
||
|
|
||
| @dataclass | ||
| class ValidatedCheckpoint: | ||
| """A user-reported checkpoint and its associated metrics. | ||
|
|
||
| Attributes: | ||
| checkpoint: The checkpoint reported by the user. | ||
| metrics: The metrics associated with that checkpoint. | ||
| """ | ||
|
|
||
| checkpoint: Checkpoint | ||
| metrics: Dict[str, Any] | ||
TimothySeah marked this conversation as resolved.
Show resolved
Hide resolved
|
||
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.