01-03-2023 01:09 PM - edited 01-03-2023 01:10 PM
Attempted to upgrade a month ago. All is running as it should.
Pre-upgrade passes and once the upgrade is started a failure occurs with one of the nodes: 'Upgrading Storage Node' and 'Running Post Relinquish Node Tasks'
Attempted the upgrade about 5 times so far.
Opened a ticket with Cisco about 3 weeks ago and have gotten no feedback about the cause or solution. If anyone has run into this, would greatly appreciate the help!!
01-03-2023 01:10 PM - edited 01-03-2023 01:10 PM
see screenshot
02-27-2023 05:23 PM
Did you get an update on this?
02-28-2023 09:15 AM
Sounds like maybe the some type of timing issuing in stopping (relinquishing) services, that failed before the node was actually shutdown. If that specific node still shows being offline in hx connect, but you can still ssh into the storage controllers mgmt ip then try confirming a couple of the services are running:
priv service storfs status
priv service exhibitor status
If either of these are stopped, you can try restarting the services and see if the cluster will return to healthy, allowing you to retry the upgrade:
priv service storfs start (only if stopped)
priv service exhibitor start (only if stopped)
Eventually the cluster should go back to being healthy, allowing the upgrade to be retried.
stopping/starting services should normally include TAC guidance.
Kirk...
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide