Clarify the role of Cluster ID and other inputs #6

clobrano · 2025-01-27T16:58:48Z

It came to my attention recently that the Cluster ID number is the outcome of member information, so forcing a new cluster does not automatically means getting a new Cluster ID.
I clarified this information in the EP.

Moreover, in the same context, I included the "current cluster state" as information that the Etcd agent might use to decide whether to reuse or drop Etcd data directory at reboot.

Remove the expectation that forcing a new cluster-of-one always results in a new "cluster ID", as the cluster ID is generated from the member information, so if you create the exact same cluster from the same members, it will yield the same cluster id.

Include "current cluster state" and removed "version counter". Signed-off-by: Carlo Lobrano <[email protected]>

beekhof · 2025-02-03T00:03:29Z

enhancements/two-node-fencing/tnf.md

@@ -125,7 +125,7 @@ Upon rebooting, the RHEL-HA components ensure that a node remains inert (not run
 If the failed peer is likely to remain offline for an extended period, admin confirmation is required on the remaining node to allow it to start OpenShift.
 This functionality exists within RHEL-HA, but a wrapper will be provided to take care of the details.

-When starting etcd, the OCF script will use etcd's cluster ID and version counter to determine whether the existing data directory can be reused, or must be erased before joining an active peer.
+When starting etcd, the OCF script will use data on disk (e.g. etcd's cluster ID) and the current state of the cluster (e.g. which resource agent is already running) to determine whether the existing data directory can be reused, or must be erased before joining an active peer.


I would have thought the reverse... that the cluster ID was almost useless, and the version counter the only interesting/useful detail

The cluster ID is still the only source on disk to understand if the peer node has forced a new cluster while the starting node was stopped, so, even if it is expected to be the same most of the time, I would still consider valuable to check on its value at startup.

clobrano added 2 commits January 27, 2025 17:49

Clarify on which inputs Etcd agent decide if reuse data directory

e4e0ee1

Include "current cluster state" and removed "version counter". Signed-off-by: Carlo Lobrano <[email protected]>

beekhof reviewed Feb 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify the role of Cluster ID and other inputs #6

Clarify the role of Cluster ID and other inputs #6

Uh oh!

clobrano commented Jan 27, 2025

Uh oh!

beekhof Feb 3, 2025

Uh oh!

clobrano Feb 3, 2025

Uh oh!

Uh oh!

Clarify the role of Cluster ID and other inputs #6

Are you sure you want to change the base?

Clarify the role of Cluster ID and other inputs #6

Uh oh!

Conversation

clobrano commented Jan 27, 2025

Uh oh!

beekhof Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

clobrano Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!