Peers with different persistence cause hydra-node to crash on startup

## Context & versions

Ever since the change to etcd for networking, if you delete your persistence, but one of your (configured) peers does not, then you see errors like:

```
Apr 10 11:02:23 noon-hydra-nixos-head.c.iog-hydra.internal spinupHydra[561926]: {
    "timestamp":"2025-04-10T11:02:23.182677246Z"
  ,"threadId":68
  ,"namespace":"HydraNode-\"noon\""
  ,"message":{
      "network":{
        "contents":{
          "etcd":{
            "caller":"etcdmain/etcd.go:204"
          ,"error":"member 9140f16a1adb1a87 has already been bootstrapped"
          ,"level":>
...
```

i.e. "member ... has already been bootstrapped".

It's a bit inconvenient, because it means you need to wait for all your peers to do the same before you can re-run hydra-node successfully.

## Steps to reproduce

1. Run with two peers
2. Stop one node
3. Delete the persistence
4. Re-run the stopped node; the hydra-node executable won't even be able to start up.

## Expected behavior

It would be great if the `hydra-node` executable could stay running, and just retry to connect to the peers every x period, with some backoff perhaps. i.e. the problem to resolve by itself if everyone runs a `hydra-node` that is fixed like this and has **correct `--peer` and `--advertise` command line options**

## Solution idea

- Detect cluster misconfiguration errors from internal `etcd` process. Probably these two:
  - "member has already bootstrapped"
  - "mismatching member id"
- Wipe `etcd/` state dir upon seeing such errors
- Retry starting (incl. initiating the cluster) of `etcd` after some time
- The `hydra-node` should log info about this process and not stop upon seeing these errors

The [clustering guide](https://etcd.io/docs/v3.5/op-guide/clustering/) may be a useful resource explaining how the `--initial..` command line options work.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Peers with different persistence cause hydra-node to crash on startup #1937

Context & versions

Steps to reproduce

Expected behavior

Solution idea

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Peers with different persistence cause hydra-node to crash on startup #1937

Description

Context & versions

Steps to reproduce

Expected behavior

Solution idea

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions