[tune] Enable Train v2 in doc examples#56820
Conversation
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
There was a problem hiding this comment.
Code Review
This pull request focuses on updating documentation examples to enable and use the new Ray Tune v2 API. The changes include migrating from older APIs like session.report to tune.report, updating checkpointing logic, and refactoring examples to use tune.Tuner instead of older patterns. Overall, the changes are consistent and align with the goal of adopting the v2 API. However, I've found one critical issue where a test is being enabled for a notebook file that is being deleted in this same PR, which will break the build.
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
doc/source/tune/examples/BUILD.bazel
Outdated
| exclude = [ | ||
| "pbt_ppo_example.ipynb", | ||
| "tune-xgboost.ipynb", | ||
| "lightgbm_example.ipynb", # TODO: Uncomment after fixing Tune lightgbm callback. |
There was a problem hiding this comment.
Can you make sure this is tracked somewhere?
There was a problem hiding this comment.
There was a problem hiding this comment.
Can you include in PR description why we're removing Horovod here? Tune should still work directly with Horovod right? Is this just general cleanup since Horovod doesn't support py312?
There was a problem hiding this comment.
this uses horovod trainer which we don't have anymore
Flip the flag for Tune doctest CI in preparation for turning on Train V2 by default. This doesn't have any behavior change, but this asserts that ray.train -> ray.tune updates have all happened. Note that a few tests have been left behind due to Tune lightgbm and Keras callbacks not being updated yet. We need to do the equivalent of this PR: ray-project#54787 * `lightgbm_example` * `lightgbm_example_cv` * `tune_mnist_keras` Deletes `horovod_simple.ipynb` example because we don't support `HorovodTrainer` anymore. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: Marco Stephan <marco@magic.dev>
Flip the flag for Tune doctest CI in preparation for turning on Train V2 by default. This doesn't have any behavior change, but this asserts that ray.train -> ray.tune updates have all happened. Note that a few tests have been left behind due to Tune lightgbm and Keras callbacks not being updated yet. We need to do the equivalent of this PR: #54787 * `lightgbm_example` * `lightgbm_example_cv` * `tune_mnist_keras` Deletes `horovod_simple.ipynb` example because we don't support `HorovodTrainer` anymore. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Flip the flag for Tune doctest CI in preparation for turning on Train V2 by default. This doesn't have any behavior change, but this asserts that ray.train -> ray.tune updates have all happened. Note that a few tests have been left behind due to Tune lightgbm and Keras callbacks not being updated yet. We need to do the equivalent of this PR: ray-project#54787 * `lightgbm_example` * `lightgbm_example_cv` * `tune_mnist_keras` Deletes `horovod_simple.ipynb` example because we don't support `HorovodTrainer` anymore. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: Douglas Strodtman <douglas@anyscale.com>
Ports over the remaining unit tests that were marked as TODOs from this series of PRs: #57534, #57256, #56868, #56820, #56816. Notably: * `test_new_dataset_config -> test_data_integration` * `test_backend -> test_torch_trainer, test_worker_group` * `test_gpu -> test_torch_gpu` This PR also finishes migrating the Tune LightGBM/Keras examples which were unblocked by #57042 and #57121. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Flip the flag for Tune doctest CI in preparation for turning on Train V2 by default. This doesn't have any behavior change, but this asserts that ray.train -> ray.tune updates have all happened. Note that a few tests have been left behind due to Tune lightgbm and Keras callbacks not being updated yet. We need to do the equivalent of this PR: ray-project#54787 * `lightgbm_example` * `lightgbm_example_cv` * `tune_mnist_keras` Deletes `horovod_simple.ipynb` example because we don't support `HorovodTrainer` anymore. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Ports over the remaining unit tests that were marked as TODOs from this series of PRs: ray-project#57534, ray-project#57256, ray-project#56868, ray-project#56820, ray-project#56816. Notably: * `test_new_dataset_config -> test_data_integration` * `test_backend -> test_torch_trainer, test_worker_group` * `test_gpu -> test_torch_gpu` This PR also finishes migrating the Tune LightGBM/Keras examples which were unblocked by ray-project#57042 and ray-project#57121. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Ports over the remaining unit tests that were marked as TODOs from this series of PRs: ray-project#57534, ray-project#57256, ray-project#56868, ray-project#56820, ray-project#56816. Notably: * `test_new_dataset_config -> test_data_integration` * `test_backend -> test_torch_trainer, test_worker_group` * `test_gpu -> test_torch_gpu` This PR also finishes migrating the Tune LightGBM/Keras examples which were unblocked by ray-project#57042 and ray-project#57121. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: xgui <xgui@anyscale.com>
Ports over the remaining unit tests that were marked as TODOs from this series of PRs: #57534, #57256, #56868, #56820, #56816. Notably: * `test_new_dataset_config -> test_data_integration` * `test_backend -> test_torch_trainer, test_worker_group` * `test_gpu -> test_torch_gpu` This PR also finishes migrating the Tune LightGBM/Keras examples which were unblocked by #57042 and #57121. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: elliot-barn <elliot.barnwell@anyscale.com>
Flip the flag for Tune doctest CI in preparation for turning on Train V2 by default. This doesn't have any behavior change, but this asserts that ray.train -> ray.tune updates have all happened. Note that a few tests have been left behind due to Tune lightgbm and Keras callbacks not being updated yet. We need to do the equivalent of this PR: ray-project#54787 * `lightgbm_example` * `lightgbm_example_cv` * `tune_mnist_keras` Deletes `horovod_simple.ipynb` example because we don't support `HorovodTrainer` anymore. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Ports over the remaining unit tests that were marked as TODOs from this series of PRs: ray-project#57534, ray-project#57256, ray-project#56868, ray-project#56820, ray-project#56816. Notably: * `test_new_dataset_config -> test_data_integration` * `test_backend -> test_torch_trainer, test_worker_group` * `test_gpu -> test_torch_gpu` This PR also finishes migrating the Tune LightGBM/Keras examples which were unblocked by ray-project#57042 and ray-project#57121. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Ports over the remaining unit tests that were marked as TODOs from this series of PRs: ray-project#57534, ray-project#57256, ray-project#56868, ray-project#56820, ray-project#56816. Notably: * `test_new_dataset_config -> test_data_integration` * `test_backend -> test_torch_trainer, test_worker_group` * `test_gpu -> test_torch_gpu` This PR also finishes migrating the Tune LightGBM/Keras examples which were unblocked by ray-project#57042 and ray-project#57121. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: Aydin Abiar <aydin@anyscale.com>
Flip the flag for Tune doctest CI in preparation for turning on Train V2 by default. This doesn't have any behavior change, but this asserts that ray.train -> ray.tune updates have all happened. Note that a few tests have been left behind due to Tune lightgbm and Keras callbacks not being updated yet. We need to do the equivalent of this PR: ray-project#54787 * `lightgbm_example` * `lightgbm_example_cv` * `tune_mnist_keras` Deletes `horovod_simple.ipynb` example because we don't support `HorovodTrainer` anymore. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: Future-Outlier <eric901201@gmail.com>
Ports over the remaining unit tests that were marked as TODOs from this series of PRs: ray-project#57534, ray-project#57256, ray-project#56868, ray-project#56820, ray-project#56816. Notably: * `test_new_dataset_config -> test_data_integration` * `test_backend -> test_torch_trainer, test_worker_group` * `test_gpu -> test_torch_gpu` This PR also finishes migrating the Tune LightGBM/Keras examples which were unblocked by ray-project#57042 and ray-project#57121. --------- Signed-off-by: Justin Yu <justinvyu@anyscale.com> Signed-off-by: Future-Outlier <eric901201@gmail.com>

Summary
Flip the flag for Tune doctest CI in preparation for turning on Train V2 by default. This doesn't have any behavior change, but this asserts that ray.train -> ray.tune updates have all happened.
Note that a few tests have been left behind due to Tune lightgbm and Keras callbacks not being updated yet. We need to do the equivalent of this PR: #54787
lightgbm_examplelightgbm_example_cvtune_mnist_kerasDeletes
horovod_simple.ipynbexample because we don't supportHorovodTraineranymore.