Race condition in Global Concurrency Limits - recommended protection strategy? #20520

oleksandr-ieremchuk · 2026-02-02T19:20:29Z

oleksandr-ieremchuk
Feb 2, 2026

Environment

Prefect version: 3.6.15 (self-hosted)
Database: PostgreSQL 18.0
Deployment: Kubernetes with Prefect Worker

Situation

We have multiple long-running workflow deployments for each tenant that must NOT run concurrently due to:

Resource constraints (memory, CPU)
External API rate limits
Data consistency requirements

We use Global Concurrency Limits (GCL) with limit=1 to enforce this:

async with concurrency(
    ["tenant-xyz-global", "category-heavy-global"],
    occupy=1,
    timeout_seconds=3600,
    strict=False,
):
    # Workflow execution (~30-40 minutes)

Problem

We observed two flows simultaneously acquired slots from the same GCL despite limit=1.

Timeline from PostgreSQL flow_run table:

Flow A: Started 18:00:37, completed 18:35:56 (35 min) ✅
Flow B: Started 18:01:37 (waiting for slot), completed 18:39:23 (38 min)
Flow C: Started 18:01:46 (waiting for slot), completed 18:41:49 (40 min)

When Flow A released its slot at 18:35:56, both Flow B and Flow C (which had been waiting) captured the slot simultaneously, violating limit=1.

Database evidence:

SELECT name, active_slots, "limit", updated 
FROM concurrency_limit_v2 
WHERE name = 'tenant-xyz-global';

-- Result immediately after: active_slots=2, limit=1

Analysis

Looking at Prefect source code:

1. Orchestration rules (deployment/task concurrency) - PROTECTED:

Use database transactions for atomic bulk_increment_active_slots
Tests like test_concurrent_reacquisition_only_one_succeeds validate protection

2. HTTP API (/v2/concurrency_limits/increment-with-lease) - NOT PROTECTED:

Multiple concurrent HTTP requests can read same active_slots=0 before any commits
No SELECT FOR UPDATE or similar locking
Race window: Request A reads → Request B reads → both increment → both commit

Questions

Is this expected behavior? Should Global Concurrency Limits be considered "best-effort" rather than strict guarantees?
What's the recommended approach for strict enforcement across multiple deployments for the same resource/tenant?
Should we use deployment-level concurrency instead? Our use case: multiple deployment types per tenant, need coordination between them.
Is atomic protection planned for HTTP API-based GCL, or is there architectural reason not to add it?

jagmarques · 2026-04-06T20:23:55Z

jagmarques
Apr 6, 2026

This is a real race condition in the HTTP-based GCL path - your analysis is correct. The /increment-with-lease endpoint does not use SELECT FOR UPDATE or any row-level locking, so concurrent requests can read the same active_slots value before either commits.

Until this is fixed upstream, two practical workarounds:

Use deployment-level concurrency limits instead. These go through the orchestration rules path which does use atomic database operations. If you set concurrency_limit=1 on the deployment itself, the scheduler handles serialization correctly.
Add an external distributed lock. If you need cross-deployment coordination, a Redis-based lock (e.g., redis.lock() with auto-expiry matching your max flow duration) as the outermost guard gives you the atomic guarantee that GCL currently lacks.

Option 1 is simpler if your concurrency boundary aligns with deployments. Option 2 if you need cross-deployment tenant isolation.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Race condition in Global Concurrency Limits - recommended protection strategy? #20520

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Race condition in Global Concurrency Limits - recommended protection strategy? #20520

Uh oh!

oleksandr-ieremchuk Feb 2, 2026

Environment

Situation

Problem

Analysis

Questions

Replies: 1 comment

Uh oh!

jagmarques Apr 6, 2026

oleksandr-ieremchuk
Feb 2, 2026

jagmarques
Apr 6, 2026