-
Notifications
You must be signed in to change notification settings - Fork 21
Open
Description
First of all, I apologize if this is the wrong avenue to post this. Please let me know and I will redirect.
Issue
We've had our backend crash due to OOM this morning and after a restart, I'm seeing the AgentManager process on the phoenix live dashboard, steadily growing in usage and message queue size.
On first check, the queue was at 6000 messages and memory usage kept shifting between 75 and 100 mb.
An hour or so later, it is now at 24000 messages, with usage shifting between 250 and 400.
Looking at the code, it seems like it's having difficulties connecting to scout, causing it to wait, causing the queue to pile up.
Questions
- Should the process flush messages after a threshold, to avoid this scenario?
- Are we doing something wrong? We haven't changed anything in our configuration recently.
- Is there unreported downtime that could be causing this?
kylekermgard
Metadata
Metadata
Assignees
Labels
No labels