We have a UCS B200 M3 blade running ESXi 6.0 that suddenly crashed with a PSOD. The server is running 2x E5-2690v2 and 384GB of memory. During the time of the crash, the SEL logs indicated memory issue. However, I just recently upgraded the CPUs from V1 to V2 and maxed out the DIMMs so I don't want to jump to conclusions yet.
Looking for some input on what if anything else could be the cause for the crash. I have multiple other servers running the exact same hardware setup and same firmware levels so I want to rule out any possible configuration issues as well.
I'm guessing the DIMM errors are probably red herrings.
It's likely there is a problem with the systemboard, or processors.
You may want to double-check the processor socket pins that are actually on the motherboard to make sure none got bent during the proc upgrades/replacements, as I'm assuming this systemboard had a previous clean track record...
If you don't see any bent pins, try pulling one proc (and related dimms) out, and run the diag iso.
Repeat with other CPU in single CPU config with diag ISO to see if this triggers again with a specific proc.
New episode on the Cisco UCS X-Series Modular SystemListen: https://smarturl.it/CCRS8E23 Follow us: https://twitter.com/CiscoChampion The Cisco UCS X-Series Modular System simplifies your data center, adapting to the unpredictable needs of...
Join us for this #CiscoChat focused on how you can unlock the power of your hybrid cloud infrastructure. A panel of Cisco and industry experts will discuss Cisco’s hybrid cloud strategy and dive deep into the future-ready infrastructure behind it all.
How to collect UCS Rack Server Tech support (Command Line) ?
SSH to the server and login with Username and password
Just Copy and Paste below commands for different protocols
CIMC tech support will be collected on the remote machine in som...
Was trying to get neworking to work on a new Ubuntu install on a UCS server. I did a "shutdown -r" and rebooted the server. Now it seems to be stuck in the Aptio setup utility. I either choose "save and exit" or exit without saving changes and I am brough...