Skip to content

Alerting system stops triggering after short uptime #106945

@S3bt3r

Description

@S3bt3r

Environment

self-hosted (https://develop.sentry.dev/self-hosted/)

Steps to Reproduce

  1. Restart the alerting service/system.
  2. Trigger a test alert condition immediately (verify it fires correctly).
  3. Wait for a short period.
  4. Trigger the alert condition again.
  5. Observe: The alert status does not update, and no notification is sent, effectively failing to detect the new breach.

Expected Result

The alerting system initially works correctly after a system restart but silently stops evaluating rules or triggering notifications after a short period. During the failure state, no alerts are fired even when metrics significantly exceed defined thresholds.

Observed Behavior:

  • After Restart: Alerts trigger correctly for a brief window.
  • After Short Uptime: Alert execution stalls. Status remains "Resolved" and "Last Triggered" shows no recent activity, despite active incidents (e.g., >350 errors/hr vs >100 threshold).
  • Workaround: Restarting the service temporarily restores functionality.

Actual Result

Image

Product Area

Alerts

Link

No response

DSN

No response

Version

25.12.1

Metadata

Metadata

Assignees

Projects

Status

No status

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions