Skip to content

Commit 15b4562

Browse files
ptrblckpmeieratalman
authored
Remove CUDA 11.7 builds; add 11.8 (#7616)
Co-authored-by: Philip Meier <[email protected]> Co-authored-by: atalman <[email protected]>
1 parent 25c8922 commit 15b4562

File tree

3 files changed

+8
-7
lines changed

3 files changed

+8
-7
lines changed

.github/workflows/prototype-tests-linux-gpu.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ jobs:
1818
- python-version: "3.8"
1919
runner: linux.g5.4xlarge.nvidia.gpu
2020
gpu-arch-type: cuda
21-
gpu-arch-version: "11.7"
21+
gpu-arch-version: "11.8"
2222
fail-fast: false
2323
uses: pytorch/test-infra/.github/workflows/linux_job.yml@main
2424
with:

.github/workflows/tests.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ jobs:
2424
- python-version: 3.8
2525
runner: linux.g5.4xlarge.nvidia.gpu
2626
gpu-arch-type: cuda
27-
gpu-arch-version: "11.7"
27+
gpu-arch-version: "11.8"
2828
fail-fast: false
2929
uses: pytorch/test-infra/.github/workflows/linux_job.yml@main
3030
with:
@@ -85,7 +85,7 @@ jobs:
8585
- python-version: "3.8"
8686
runner: windows.g5.4xlarge.nvidia.gpu
8787
gpu-arch-type: cuda
88-
gpu-arch-version: "11.7"
88+
gpu-arch-version: "11.8"
8989
fail-fast: false
9090
uses: pytorch/test-infra/.github/workflows/windows_job.yml@main
9191
with:

test/test_models.py

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -682,10 +682,11 @@ def test_classification_model(model_fn, dev):
682682
model_name = model_fn.__name__
683683
if SKIP_BIG_MODEL and is_skippable(model_name, dev):
684684
pytest.skip("Skipped to reduce memory usage. Set env var SKIP_BIG_MODEL=0 to enable test for this model")
685-
if model_name == "vit_h_14" and dev == "cuda":
686-
# TODO: investigate why this fail on CI. It doesn't fail on AWS cluster with CUDA 11.6
687-
# (can't test with later versions ATM)
688-
pytest.xfail("https://github.com/pytorch/vision/issues/7143")
685+
if model_name == "resnet101" and dev == "cuda":
686+
# TODO: Investigate the Failure with CUDA 11.8: https://github.com/pytorch/vision/issues/7618
687+
# TODO: Investigate/followup on previous failure: https://github.com/pytorch/vision/issues/7143
688+
# its not happening on CI with CUDA 11.8 anymore. Follow up is needed if its still not resolved.
689+
pytest.xfail("https://github.com/pytorch/vision/issues/7618")
689690
kwargs = {**defaults, **_model_params.get(model_name, {})}
690691
num_classes = kwargs.get("num_classes")
691692
input_shape = kwargs.pop("input_shape")

0 commit comments

Comments
 (0)