[Docs] explain cluster management + recovery operations #1649

joaofnfernandes · 2017-02-14T01:45:15Z

@nicolaka commented on Tue Jan 31 2017

What's the problem/feature request?

just got off the phone with a customer who were very confused on node management with UCP. IMO both swarm and ucp docs lack the end-to-end story around cluster management. Neither docs goes through (in details) the difference between node rm node demote swarm leave and UCP's UI's node remove operations along with the recommended process to remove nodes from a cluster. For UCP especially, how does node operations affect the HA and what to do when node removal/demotion goes wrong.

I think fixing these issues are crucial for next release:

#1451
#815

What versions/components are affected?

2.1

@alexmavr commented on Tue Jan 31 2017

+1 I agree that our documentation is strongly lacking on that subject and I had a similar discussion earlier regarding several customer issues that arise from this confusion. Our recommended path for removing a node from the cluster / uninstalling UCP from one node should be:

Perform docker node demote for the target node from another manager node. This step can be skipped if the target node is not a manager
(Optional) wait for 5-10 seconds so that node reconciliation kicks in and starts cleaning up the UCP leftovers. This guarantees that adding new nodes later on is not going to result in problems.
Perform a docker swarm leave on the target node
Perform a docker node rm for the target node from another manager node.

@joaofnfernandes commented on Tue Jan 31

Relates to #2699

The text was updated successfully, but these errors were encountered:

alexmavr · 2017-02-17T19:09:17Z

Fixed in #1792

@joaofnfernandes please close if you agree

docker-robott · 2023-03-06T03:04:29Z

Closed issues are locked after 30 days of inactivity.
This helps our team focus on active issues.

If you have found a problem that seems similar to this, please open a new issue.

/lifecycle locked

joaofnfernandes self-assigned this Feb 14, 2017

joaofnfernandes added area/enterprise Issue affects Docker Enterprise hackaton labels Feb 14, 2017

joaofnfernandes removed their assignment Feb 15, 2017

joaofnfernandes added the P0 label Feb 16, 2017

joaofnfernandes closed this as completed Feb 17, 2017

mdlinville added ddc-hackathon and removed hackathon labels Apr 5, 2017

docker locked and limited conversation to collaborators Mar 6, 2023

docker-robott added the lifecycle/locked label Mar 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Docs] explain cluster management + recovery operations #1649

[Docs] explain cluster management + recovery operations #1649

joaofnfernandes commented Feb 14, 2017 •

edited

Loading

alexmavr commented Feb 17, 2017

docker-robott commented Mar 6, 2023

[Docs] explain cluster management + recovery operations #1649

[Docs] explain cluster management + recovery operations #1649

Comments

joaofnfernandes commented Feb 14, 2017 • edited Loading

What's the problem/feature request?

What versions/components are affected?

alexmavr commented Feb 17, 2017

docker-robott commented Mar 6, 2023

joaofnfernandes commented Feb 14, 2017 •

edited

Loading