08-23-2022 05:54 PM - edited 08-23-2022 05:57 PM
Hi,
We have recently installed an Nvidia P6 R graphics card into 2 blades - both M5 v 2 running firmware 4.0(4l).
Each blade is in a separate domain - working card in a 6454, not-looking-like-its-working card in a 6332.
1 blade has taken the graphics card, and it has the drivers installed and is showing either graphics or compute. this is good.
the other blade in another domain was set up the same, but it doesn't look the same at all, rather than showing as graphics or compute it just says N/A, and doesn't have a driver version or firmware package associated. It is visible, but i don't feel like its actually in use.
ive checked the firmware package and it contains the correct image, and for all intents and purposes the set up is identical, i rebooted the host to see if that would install the fw and drivers for the card, but it hasnt.
Has anyone had the same issue? we are about to install some more front mezz versions in the same hosts and i would like to get them both in the same state. We havent had any complaints from the customers, but im not sure they would know what to look for or anything.
i can't find any documentation on the configuration of the gpu once installed, there might not be anything to actually do on the ucs after, any ideas would be appreciated!
Thank you
08-23-2022 08:11 PM
Are they using the same firmware version? whats the vbios version on the GPU? you can check it from the UCS Manager and inventory under server.
08-24-2022 02:07 PM
Hi Ravi,
i can't see anywhere that mentions vbios for the gpu, can you be a little more specific as to where it is?
08-24-2022 02:56 PM
There is an example of where to check here:
Select the Server and on the right pane select Inventory > GPUs.
08-24-2022 03:26 PM - edited 08-24-2022 03:27 PM
08-24-2022 03:40 PM
You are saying the GPU is operating correctly from the OS side, but just not reporting correctly in UCSM?
If thats the case, you may just try and reset the CIMC on the server in questions from Recover Server action in UCSM for the blade in question.
08-24-2022 03:49 PM
When i go to the esxcli and run the commands, it doesnt show up - but Vsphere acknowledges that it is there and allows it to be configured.
I have done a re-cknowledge of the blades to no avail before, is it best to put the host into maint before tweaking the cimc?
08-24-2022 03:58 PM
CIMC reset should not impact the host OS, but would be best to put the blade in maintenance mode to be safe.
Its interesting that the esxcli commands do not work. Do they work on the other server? same version and driver in use?
08-24-2022 04:24 PM - edited 08-24-2022 04:25 PM
a click of the cimc made no difference to the bad host!
but what i mean about the commands is that the commands do work, but the output is different.
we have just installed a front mez card on the good host now. attached is the cli output from that and also a pic of what the ucs shows
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide