Skip to content

Ingesters do not wait long enough for transfer #1307

@bboreham

Description

@bboreham

We used to tell ours to wait for two minutes, but that was torn up by #1185

Today I see this:

level=info ts=2019-03-28T16:41:45.673567131Z caller=lifecycler.go:456 msg="changing ingester state from" old_state=ACTIVE new_state=LEAVING
level=error ts=2019-03-28T16:41:45.688616369Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:45.695897001Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:45.746735562Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:45.807703134Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:46.334636223Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:46.46819466Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:47.212946695Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:47.370324186Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:48.672456187Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:51.35543268Z caller=transfer.go:199 msg="transfer failed" err="cannot find ingester to transfer chunks to: no pending ingesters"
level=error ts=2019-03-28T16:41:51.355469649Z caller=lifecycler.go:466 msg="Failed to transfer chunks to another ingester" err="terminated after 10 retries"

so it isn't even waiting the 100ms minimum I expect from cursory examination of the code.

Possibly related: #792

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions