Skip to content

Commit dfec0d6

Browse files
committed
document recovering from ha degraded mode docker#815
1 parent 20ee882 commit dfec0d6

File tree

1 file changed

+20
-0
lines changed

1 file changed

+20
-0
lines changed

datacenter/dtr/2.2/guides/admin/monitor-and-troubleshoot/troubleshoot-with-logs.md

Lines changed: 20 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,3 +37,23 @@ You can use javascript syntax to execute rethinkdb queries like so:
3737
```none
3838
> r.db('dtr2').table('repositories')
3939
```
40+
41+
## Recovering from a lost replica
42+
43+
When one of DTR's replicas is lost, the UI will start showing a warning that
44+
looks something like the following:
45+
46+
```none
47+
Warning: The following replicas are unhealthy: 59e4e9b0a254; Reasons: Replica reported health too long ago: 2017-02-18T01:11:20Z; Replicas 000000000000, 563f02aba617 are still healthy.
48+
```
49+
50+
To remedy this situation, you need to use the `remove` command to tell
51+
the cluster that the lost replica should be treated as permanently removed.
52+
After that you can use the `join` command to grow your cluster back to the
53+
desired number of replicas. In this example you would run the following
54+
commands (and follow the prompts for the UCP connection parameters):
55+
56+
```none
57+
$ docker run --rm -it docker/dtr remove --ucp-insecure-tls --replica-id 59e4e9b0a254 --existing-replica-id 000000000000
58+
$ docker run --rm -it docker/dtr join --ucp-insecure-tls --existing-replica-id 000000000000
59+
```

0 commit comments

Comments
 (0)