Skip to content

Clarify the role of Cluster ID and other inputs #6

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: 2no
Choose a base branch
from

Conversation

clobrano
Copy link
Collaborator

It came to my attention recently that the Cluster ID number is the outcome of member information, so forcing a new cluster does not automatically means getting a new Cluster ID.
I clarified this information in the EP.

Moreover, in the same context, I included the "current cluster state" as information that the Etcd agent might use to decide whether to reuse or drop Etcd data directory at reboot.

Remove the expectation that forcing a new cluster-of-one always results
in a new "cluster ID", as the cluster ID is generated from the member
information, so if you create the exact same cluster from the same
members, it will yield the same cluster id.
Include "current cluster state" and removed "version counter".

Signed-off-by: Carlo Lobrano <[email protected]>
@@ -125,7 +125,7 @@ Upon rebooting, the RHEL-HA components ensure that a node remains inert (not run
If the failed peer is likely to remain offline for an extended period, admin confirmation is required on the remaining node to allow it to start OpenShift.
This functionality exists within RHEL-HA, but a wrapper will be provided to take care of the details.

When starting etcd, the OCF script will use etcd's cluster ID and version counter to determine whether the existing data directory can be reused, or must be erased before joining an active peer.
When starting etcd, the OCF script will use data on disk (e.g. etcd's cluster ID) and the current state of the cluster (e.g. which resource agent is already running) to determine whether the existing data directory can be reused, or must be erased before joining an active peer.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would have thought the reverse... that the cluster ID was almost useless, and the version counter the only interesting/useful detail

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The cluster ID is still the only source on disk to understand if the peer node has forced a new cluster while the starting node was stopped, so, even if it is expected to be the same most of the time, I would still consider valuable to check on its value at startup.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants