Skip to content

Unable to keep up with pending state heal #26687

Closed
@kumavis

Description

@kumavis

System information

Geth version: instance=Geth/v1.10.26-stable-e5eb32ac/linux-amd64/go1.18.8
CL client & version: prysm:stable
OS & Version: Linux name 5.19.0-31-generic #32-Ubuntu SMP PREEMPT_DYNAMIC Fri Jan 20 15:20:08 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux

Expected behaviour

"State heal in progress" eventually completes

Actual behaviour

"State heal in progress" continues indefinitely

Steps to reproduce the behaviour

  • bringing machine back on after a two week down time <-- important
  • running on Digital Ocean Storage-Optimized NVMe

Here is a graph of the pending "State heal in progress" field in the log line over time

eth1              | INFO [02-14|11:40:13.492] State heal in progress                   accounts=1,609,[email protected] slots=4,938,[email protected] [email protected] nodes=46,528,[email protected] pending=38937

image

lots of memory available for system disk Buffer
image

some disk statistics. iowait is ~50%
image
image
image
image

geth syncing dashboard snapshot https://snapshots.raintank.io/dashboard/snapshot/j57U07jPZBxmA5wxR2bM7PkBcfNCKIpx
system dashboard snapshot https://snapshots.raintank.io/dashboard/snapshot/duM8SNGtvhRkU3e9UO9jDsF4BT3J2n77

here is a small section of logs https://gist.github.com/kumavis/889eb03156fa7cc54935917b2539f10f

let me know what additional data can help

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions