Skip to content

Conversation

wking
Copy link
Member

@wking wking commented Aug 13, 2025

These will probably not pass without more work, but checking to see how far away we are.

@openshift-ci openshift-ci bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Aug 13, 2025
@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 13, 2025
Copy link

openshift-trt bot commented Aug 14, 2025

Job Failure Risk Analysis for sha: fcebaa6

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-1of2 IncompleteTests
Tests for this run (15) are below the historical average (1580): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch 2 times, most recently from ae9e67d to b18aac0 Compare August 14, 2025 20:26
@openshift-ci openshift-ci bot removed the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 14, 2025
Copy link

openshift-trt bot commented Aug 15, 2025

Job Failure Risk Analysis for sha: b18aac0

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-1of2 IncompleteTests
Tests for this run (25) are below the historical average (1742): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Risk analysis has seen new tests most likely introduced by this PR.
Please ensure that new tests meet guidelines for naming and stability.

New Test Risks for sha: b18aac0

Job Name New Test Risk
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-gcp-ovn-techpreview-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-ovn-ipv6-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit

New tests seen in this PR at sha: b18aac0

  • "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" [Total: 5, Pass: 0, Fail: 5, Flake: 0]

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch from 34ec4d1 to 2a74ae2 Compare August 15, 2025 04:59
Copy link

openshift-trt bot commented Aug 15, 2025

Job Failure Risk Analysis for sha: 2a74ae2

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-1of2 IncompleteTests
Tests for this run (15) are below the historical average (1735): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch from 2a74ae2 to 8924956 Compare August 15, 2025 18:44
Copy link

openshift-trt bot commented Aug 15, 2025

Job Failure Risk Analysis for sha: 8924956

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-vsphere-ovn IncompleteTests
Tests for this run (15) are below the historical average (3149): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-vsphere-ovn-upi IncompleteTests
Tests for this run (15) are below the historical average (3172): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch 2 times, most recently from 931699e to d7501ee Compare August 15, 2025 23:54
Copy link

openshift-trt bot commented Aug 16, 2025

Job Failure Risk Analysis for sha: d7501ee

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-aws-disruptive Medium
[sig-arch] events should not repeat pathologically for ns/openshift-kube-apiserver-operator
Potential external regression detected for High Risk Test analysis
---
[sig-cli][OCPFeatureGate:UpgradeStatus] oc amd upgrade status never fails
Potential external regression detected for High Risk Test analysis
---
[bz-Etcd] clusteroperator/etcd should not change condition/Available
Potential external regression detected for High Risk Test analysis
pull-ci-openshift-origin-main-e2e-aws-ovn-edge-zones Medium
[sig-instrumentation] Metrics should grab all metrics from kubelet /metrics/resource endpoint [Suite:openshift/conformance/parallel] [Suite:k8s]
This test has passed 93.13% of 2141 runs on release 4.20 [Overall] in the last week.

Open Bugs
e2e-aws-ovn-edge-zones is unstable
Kubelet metrics endpoint test regressed
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-upgrade IncompleteTests
pull-ci-openshift-origin-main-e2e-hypershift-conformance IncompleteTests
Tests for this run (19) are below the historical average (2886): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Risk analysis has seen new tests most likely introduced by this PR.
Please ensure that new tests meet guidelines for naming and stability.

New Test Risks for sha: d7501ee

Job Name New Test Risk
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-gcp-ovn-techpreview-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit

New tests seen in this PR at sha: d7501ee

  • "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" [Total: 4, Pass: 0, Fail: 4, Flake: 0]

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch 2 times, most recently from 3d796c1 to 6f8ca39 Compare August 16, 2025 05:00
Copy link

openshift-trt bot commented Aug 16, 2025

Job Failure Risk Analysis for sha: 6f8ca39

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-azure IncompleteTests
Tests for this run (19) are below the historical average (3238): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-gcp-csi IncompleteTests
Tests for this run (19) are below the historical average (1627): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-hypershift-conformance IncompleteTests
Tests for this run (16) are below the historical average (2871): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-okd-scos-e2e-aws-ovn IncompleteTests
Tests for this run (14) are below the historical average (3232): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch from 6f8ca39 to 4cbd5fe Compare August 16, 2025 17:10
Copy link

openshift-trt bot commented Aug 16, 2025

Job Failure Risk Analysis for sha: 4cbd5fe

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-hypershift-conformance IncompleteTests
Tests for this run (19) are below the historical average (2841): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-okd-scos-e2e-aws-ovn IncompleteTests
Tests for this run (101) are below the historical average (3198): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Risk analysis has seen new tests most likely introduced by this PR.
Please ensure that new tests meet guidelines for naming and stability.

New Test Risks for sha: 4cbd5fe

Job Name New Test Risk
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-gcp-ovn-techpreview-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-ovn-ipv6-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit

New tests seen in this PR at sha: 4cbd5fe

  • "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" [Total: 5, Pass: 0, Fail: 5, Flake: 0]

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch from 4cbd5fe to b9b2c30 Compare August 18, 2025 06:23
Copy link

openshift-trt bot commented Aug 18, 2025

Job Failure Risk Analysis for sha: b9b2c30

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-aws-disruptive High
[bz-Monitoring] clusteroperator/monitoring should not change condition/Available
This test has passed 99.94% of 4943 runs on release 4.20 [Overall] in the last week.
---
[bz-Monitoring] clusteroperator/monitoring should not change condition/Degraded
This test has passed 99.92% of 4943 runs on release 4.20 [Overall] in the last week.
---
[sig-arch][Late] operators should not create watch channels very often
This test has passed 99.94% of 4669 runs on release 4.20 [Overall] in the last week.
pull-ci-openshift-origin-main-e2e-hypershift-conformance IncompleteTests
Tests for this run (19) are below the historical average (2775): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-okd-scos-e2e-aws-ovn IncompleteTests
Tests for this run (101) are below the historical average (3143): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Risk analysis has seen new tests most likely introduced by this PR.
Please ensure that new tests meet guidelines for naming and stability.

New Test Risks for sha: b9b2c30

Job Name New Test Risk
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-gcp-ovn-techpreview-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-ovn-ipv6-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit

New tests seen in this PR at sha: b9b2c30

  • "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" [Total: 5, Pass: 0, Fail: 5, Flake: 0]

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch from b9b2c30 to ccb493b Compare August 19, 2025 22:59
Copy link

openshift-trt bot commented Aug 20, 2025

Job Failure Risk Analysis for sha: ccb493b

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-hypershift-conformance IncompleteTests
Tests for this run (16) are below the historical average (2771): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-okd-scos-e2e-aws-ovn IncompleteTests
Tests for this run (14) are below the historical average (2898): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch from ccb493b to eefe2bb Compare August 20, 2025 02:39
Copy link

openshift-trt bot commented Aug 20, 2025

Job Failure Risk Analysis for sha: eefe2bb

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-agnostic-ovn-cmd IncompleteTests
Tests for this run (19) are below the historical average (1767): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-csi IncompleteTests
Tests for this run (18) are below the historical average (1869): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-disruptive IncompleteTests
Tests for this run (18) are below the historical average (1140): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn IncompleteTests
Tests for this run (18) are below the historical average (3177): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-cgroupsv2 IncompleteTests
Tests for this run (18) are below the historical average (3279): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-edge-zones IncompleteTests
Tests for this run (19) are below the historical average (3322): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-fips IncompleteTests
Tests for this run (19) are below the historical average (3296): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-kube-apiserver-rollout IncompleteTests
Tests for this run (18) are below the historical average (1811): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-microshift IncompleteTests
Tests for this run (16) are below the historical average (1810): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-microshift-serial IncompleteTests
Tests for this run (16) are below the historical average (860): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-1of2 IncompleteTests
Tests for this run (18) are below the historical average (1946): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 IncompleteTests
Tests for this run (18) are below the historical average (1942): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node IncompleteTests
Tests for this run (18) are below the historical average (3066): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial IncompleteTests
Tests for this run (18) are below the historical average (1847): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-upgrade IncompleteTests
Tests for this run (19) are below the historical average (4136): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-ovn-upgrade IncompleteTests
Tests for this run (20) are below the historical average (1916): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-proxy IncompleteTests
Tests for this run (19) are below the historical average (3278): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-azure IncompleteTests
Tests for this run (19) are below the historical average (3220): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-gcp-csi IncompleteTests
Tests for this run (19) are below the historical average (1627): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-gcp-ovn IncompleteTests
Tests for this run (19) are below the historical average (2933): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Showing 20 of 40 jobs analysis

@wking wking force-pushed the test-oc-adm-upgrade-recommend-with-precheck-and-accept branch from eefe2bb to 93f5c58 Compare August 21, 2025 03:45
Copy link

openshift-trt bot commented Aug 21, 2025

Risk analysis has seen new tests most likely introduced by this PR.
Please ensure that new tests meet guidelines for naming and stability.

New Test Risks for sha: 93f5c58

Job Name New Test Risk
pull-ci-openshift-origin-main-e2e-aws-ovn-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-aws-ovn-single-node-serial High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-gcp-ovn-techpreview-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit
pull-ci-openshift-origin-main-e2e-metal-ipi-serial-ovn-ipv6-2of2 High - "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" is a new test that failed 1 time(s) against the current commit

New tests seen in this PR at sha: 93f5c58

  • "[Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with an accepted conditional recommendation to the --version target [Suite:openshift/conformance/serial]" [Total: 5, Pass: 0, Fail: 5, Flake: 0]

@wking wking changed the title WIP: test/extended/cli/adm_upgrade/recommend: Enable precheck and accept OTA-1559: test/extended/cli/adm_upgrade/recommend: Enable precheck and accept Aug 21, 2025
@wking
Copy link
Member Author

wking commented Sep 4, 2025

/payload-aggregate periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-serial 4

Copy link
Contributor

openshift-ci bot commented Sep 4, 2025

@wking: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command

  • periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/e0a1da90-89cd-11f0-8eef-a92312a06c06-0

@wking
Copy link
Member Author

wking commented Sep 4, 2025

Continuing to run up additional serial numbers, to see how reliable the new code is. Sticking with the aggregate-of-four batching, to avoid swamping CI-infra capacity, but also not waiting for the previous round of payload runs to wrap up, as I try to balance speedy-sample-size-growth against infra-load:

/payload-aggregate periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-serial 4

Copy link
Contributor

openshift-ci bot commented Sep 4, 2025

@wking: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command

  • periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/c4dd0710-89de-11f0-944e-baf715a5082d-0

@wking
Copy link
Member Author

wking commented Sep 4, 2025

e2e-aws-ovn-serial-2of2 failed, but on an install-time CI-registry 429, and not on anything related to this pull request.

/retest-required

…False

In 44cd78a (test/extended/cli/adm_upgrade/recommend: Account for
Upgradeable=False, 2025-08-05, openshift#30113), I'd updated the regular
expression to accept:

  Reason: MultipleReasons

for clusters that had both Upgradeable=False and conditional risks
going on.  In 7724a75 (test/extended/cli/adm_upgrade/recommend:
Trust the ingress CA, 2025-08-14, openshift#30113), I extended the regular
expression to cover:

  Reason: accepted TestRiskA via ConditionalUpdateRisk

But I hadn't thought through the MultipleReasons case, and this commit
catches us up to tech-preview serial output like [1]:

  ...
  Reason: accepted MultipleReasons via ConditionalUpdateRisk
  Message: Cluster operator config-operator should not be upgraded between minor versions: FeatureGatesUpgradeable: "TechPreviewNoUpgrade" does not allow updates

    This is a test risk. https://example.com/testRiskA
  ...

[1]: https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/30113/pull-ci-openshift-origin-main-e2e-gcp-ovn-techpreview-serial-1of2/1963672599955247104
@openshift-ci openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Sep 4, 2025
Copy link

openshift-trt bot commented Sep 5, 2025

Job Failure Risk Analysis for sha: c68be5e

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-aws-csi IncompleteTests
Tests for this run (25) are below the historical average (1754): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-disruptive IncompleteTests
Tests for this run (106) are below the historical average (551): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

Copy link
Member

@petr-muller petr-muller left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, leaving a note but not a blocker

Comment on lines +342 to +355
defaultIngressSecretName, err := oc.Run("get").Args("--namespace=openshift-ingress-operator", "-o", "jsonpath={.spec.defaultCertificate.name}", "ingresscontroller.operator.openshift.io", "default").Output()
if err != nil {
return "", err
}

if defaultIngressSecretName == "" {
defaultIngressSecretName = "router-certs-default"
}

ingressNamespace := "openshift-ingress"
defaultIngressCert, err := oc.Run("extract").Args("--namespace", ingressNamespace, fmt.Sprintf("secret/%s", defaultIngressSecretName), "--keys=tls.crt", "--to=-").Output()
if err != nil {
return "", err
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hehe given my push for "let's stop using oc to test CVO" in threads like this I think I'll need to voice here as well; do these two need to be oc calls? I think that at least the first get would be more appropriate to do thought client-go and get a typed struct instead of reading elements with jsonpath of --keys.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

they don't need to be oc, but given that we're in the test/extended/cli section, using oc seemed ok. I'm not strongly opinionated for this setup though; they could certainly be ported to the structured client-go if they were going to be used outside of oc CLI testing.

@wking
Copy link
Member Author

wking commented Sep 5, 2025

A few serial jobs failed, but all on either infra or other test-cases, not on anything related to my changes.

/retest-required

@wking
Copy link
Member Author

wking commented Sep 5, 2025

Both /payload-aggregate runs failed the top-level aggregation, but all four child jobs passed. The failures are because of my choice of 4, with lots of ...we require at least 6 attempts.... Between that and this pull's usual presubmits:

/verified by examining all presubmits and /payload jobs, no failures identified for these tests

@openshift-ci-robot openshift-ci-robot added the verified Signifies that the PR passed pre-merge verification criteria label Sep 5, 2025
@openshift-ci-robot
Copy link

@wking: This PR has been marked as verified by examining all presubmits and /payload jobs,no failures identified for these tests.

In response to this:

Both /payload-aggregate runs failed the top-level aggregation, but all four child jobs passed. The failures are because of my choice of 4, with lots of ...we require at least 6 attempts.... Between that and this pull's usual presubmits:

/verified by examining all presubmits and /payload jobs, no failures identified for these tests

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository.

@hongkailiu
Copy link
Member

/lgtm

/hold

for @petr-muller to check the reply above.

@openshift-ci openshift-ci bot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lgtm Indicates that a PR is ready to be merged. labels Sep 5, 2025
Copy link
Contributor

openshift-ci bot commented Sep 5, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: hongkailiu, petr-muller, wking

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Copy link

openshift-trt bot commented Sep 5, 2025

Job Failure Risk Analysis for sha: c68be5e

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-aws-csi IncompleteTests
Tests for this run (25) are below the historical average (1745): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)
pull-ci-openshift-origin-main-e2e-aws-disruptive IncompleteTests
Tests for this run (106) are below the historical average (534): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)


out, err := oc.Run("--certificate-authority", caBundleFilePath, "adm", "upgrade", "recommend", "--version", fmt.Sprintf("4.%d.0", currentVersion.Minor+1), "--accept", "ConditionalUpdateRisk").EnvVar("OC_ENABLE_CMD_UPGRADE_RECOMMEND", "true").EnvVar("OC_ENABLE_CMD_UPGRADE_RECOMMEND_PRECHECK", "true").EnvVar("OC_ENABLE_CMD_UPGRADE_RECOMMEND_ACCEPT", "true").Output()

o.Expect(err).NotTo(o.HaveOccurred())
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hrm, e2e-aws-ovn-single-node-serial failed:

: [Serial][sig-cli] oc adm upgrade recommend When the update service has conditional recommendations runs successfully with conditional recommendations to the --version target [Suite:openshift/conformance/serial]	25s
{  fail [github.com/openshift/origin/test/extended/cli/adm_upgrade/recommend.go:245]: Unexpected error:
    ...
    Failing=True:
    
      Reason: ClusterOperatorNotAvailable
      Message: Cluster operator authentication is not available
      ...
      error: issues that apply to this cluster but which were not included in --accept: Failing

The test-case that covers auth availability flaked:

: [bz-apiserver-auth] clusteroperator/authentication should not change condition/Available
...15 unwelcome but acceptable clusteroperator state transitions during e2e test run...
...exception: https://issues.redhat.com/browse/OCPBUGS-20056...

Working in an exception might be tricky regular-expression handling. Trying to gauge how frequent this issue is:

/payload-aggregate periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-single-node-serial 20

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Of the aggregation run's 20 attempts:

So the auth functionality is pretty flaky in these single-node clusters under serial-suite load. I've pushed c68be5e -> 78ac50f, to hopefully soften this test-case enough to not trip over this issue.

@wking
Copy link
Member Author

wking commented Sep 5, 2025

/retest-required

@wking
Copy link
Member Author

wking commented Sep 5, 2025

Maybe this can't be in a threaded comment?

/payload-aggregate periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-single-node-serial 20

Copy link
Contributor

openshift-ci bot commented Sep 5, 2025

@wking: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command

  • periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-single-node-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/e6b5d560-8a92-11f0-8a7f-9627853831e6-0

A depressing amount of the time in single-node testing, the
authentication ClusterOperator is Available=False, and causes the
'recommend' command to error if the 'Failing' risk is not accepted.
For example, in this aggregation run's 20 attempts [1]:

* 9 passed.
* three failed the '--version' test-case on 'authentication' 'Available=False' [2,3,4].
* two failed on build-cluster registry 500s [5,6].
* two failed on 'prometheus-operator' watch requests [7,8].
* one failed on a 'context deadline exceeded' out of
  'runUpdateService' [9], with 'Failed to pull image...authentication
  required' issues trying to get the 'tools' image.
* one failed on authentication Pod restarts [10].
* one failed on an un-excepted 'authentication' 'Available=False'
  ('OAuthServerDeployment_NoPod') [11].
* one failed on an unexpected successful return in an
  TestImageStreamTagsAdmission test-case [12].

So the auth functionality is pretty flaky in these single-node
clusters under serial-suite load.  Ideally [13] can get addressed or
the auth component can otherwise get firmed up, but until then, this
commit softens our logic to allow that kind of ClusterOperator
Available=False (which gets bubbled up as ClusterVersion
Failing=True), without failing our new test-case.

[1]: openshift#30113 (comment)
[2]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055948225941504
[3]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055951426195456
[4]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055945973600256
[5]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055945063436288#1:build-log.txt%3A32
[6]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055945512226816#1:build-log.txt%3A43
[7]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055950948044800
[8]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055948683120640
[9]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055952327970816
[10]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055953233940480
[11]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055947772956672
[12]: https://prow.ci.openshift.org/view/gs/test-platform-results/logs/openshift-origin-30113-nightly-4.21-e2e-aws-ovn-single-node-serial/1964055952797732864
[13]: https://issues.redhat.com/browse/OCPBUGS-20056
@openshift-ci-robot openshift-ci-robot removed the verified Signifies that the PR passed pre-merge verification criteria label Sep 6, 2025
@openshift-ci openshift-ci bot removed the lgtm Indicates that a PR is ready to be merged. label Sep 6, 2025
Copy link
Contributor

openshift-ci bot commented Sep 6, 2025

New changes are detected. LGTM label has been removed.

Copy link
Contributor

openshift-ci bot commented Sep 6, 2025

@wking: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/e2e-aws-ovn-kube-apiserver-rollout 78ac50f link false /test e2e-aws-ovn-kube-apiserver-rollout
ci/prow/e2e-metal-ipi-serial-2of2 78ac50f link false /test e2e-metal-ipi-serial-2of2
ci/prow/e2e-aws-disruptive 78ac50f link false /test e2e-aws-disruptive

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Copy link

openshift-trt bot commented Sep 6, 2025

Job Failure Risk Analysis for sha: 78ac50f

Job Name Failure Risk
pull-ci-openshift-origin-main-e2e-aws-disruptive IncompleteTests
Tests for this run (106) are below the historical average (529): IncompleteTests (not enough tests ran to make a reasonable risk analysis; this could be due to infra, installation, or upgrade problems)

@wking
Copy link
Member Author

wking commented Sep 6, 2025

/payload-aggregate periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-single-node-serial 20

Copy link
Contributor

openshift-ci bot commented Sep 6, 2025

@wking: trigger 1 job(s) for the /payload-(with-prs|job|aggregate|job-with-prs|aggregate-with-prs) command

  • periodic-ci-openshift-release-master-nightly-4.21-e2e-aws-ovn-single-node-serial

See details on https://pr-payload-tests.ci.openshift.org/runs/ci/6bc15d30-8b59-11f0-958c-09916b4ce3f0-0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. jira/valid-bug Indicates that a referenced Jira bug is valid for the branch this PR is targeting. jira/valid-reference Indicates that this PR references a valid Jira ticket of any type.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants