Skip to content

Conversation

@ShahabT
Copy link
Contributor

@ShahabT ShahabT commented Jun 3, 2025

What changed?

When an auto-upgrade workflow moves to another deployment version, it clears the sticky queue from mutable state. But an edge case is possible in which a wf task is already started on the old version and will complete later, after the sticky queue is cleared. In this case we should ensure the completed wf task does not set the sticky queue again to the old worker.

Why?

This bug causes the workflow to get stuck with a transition that does not complete.

How did you test it?

  • built
  • run locally and tested manually
  • covered by existing tests
  • added new unit test(s)
  • added new functional test(s)

Potential risks

None. Should be added to the patch.

@ShahabT ShahabT requested a review from a team as a code owner June 3, 2025 00:27
@ShahabT ShahabT requested a review from carlydf June 3, 2025 00:27
@ShahabT ShahabT enabled auto-merge (squash) June 3, 2025 01:23
@ShahabT ShahabT merged commit 8226053 into main Jun 3, 2025
53 checks passed
@ShahabT ShahabT deleted the shahab/sticky-transition branch June 3, 2025 01:45
temporal-cicd bot added a commit that referenced this pull request Jun 3, 2025
Shivs11 pushed a commit that referenced this pull request Jun 3, 2025
Commit changes from pull request (#7852) into cloud/v1.28.0-134
1a9cb61 - Do not set sticky queue if a transition is ongoing

Co-authored-by: temporal-cicd[bot] <temporal-cicd[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants