cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
1920
Views
1
Helpful
3
Replies

Hyperflex 'Cluster Upgrade failure' - 4.0.2f to 5.0.2a

blee0518
Level 1
Level 1

Attempted to upgrade a month ago. All is running as it should. 
Pre-upgrade passes and once the upgrade is started a failure occurs with one of the nodes: 'Upgrading Storage Node' and 'Running Post Relinquish Node Tasks'

Attempted the upgrade about 5 times so far.
Opened a ticket with Cisco about 3 weeks ago and have gotten no feedback about the cause or solution. If anyone has run into this, would greatly appreciate the help!! 

 

3 Replies 3

blee0518
Level 1
Level 1

see screenshot

tr360
Level 1
Level 1

Did you get an update on this?

Kirk J
Cisco Employee
Cisco Employee

Sounds like maybe the some type of timing issuing in stopping (relinquishing) services, that failed before the node was actually shutdown.  If that specific node still shows being offline in hx connect, but you can still ssh into the storage controllers mgmt ip then try confirming a couple of the services are running:

priv service storfs status

priv service exhibitor status

If either of these are stopped, you can try restarting the services and see if the cluster will return to healthy, allowing you to retry the upgrade:

priv service storfs start  (only if stopped)

priv service exhibitor start (only if stopped)

Eventually the cluster should go back to being healthy, allowing the upgrade to be retried.

stopping/starting services should normally include TAC guidance.

Kirk...

Review Cisco Networking for a $25 gift card