cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
1206
Views
1
Helpful
8
Replies

UCS 240 M3 power error

varadidani05
Level 1
Level 1

Hi!

I have a UCS 240 M3 server, and I get these errors in the imc, server won't turn on just the fans are spinning. Does anyone have an idea what could be causing the errors? 

image.png

 

I tested the power supplies, and they are good. 

 

Thanks for the reply!

1 Accepted Solution

Accepted Solutions

josoneal
Cisco Employee
Cisco Employee

I reviewed the new logs. It appears to be a motherboard issue as we see the server immediately reports power failed on error when you attempt to power on the server.

5:2023 Jun 27 20:14:38:BMC:kernel:-:<5>[__do_power_on]:385:__do_power_on
5:2023 Jun 27 20:14:38:BMC:kernel:-:<5>[__do_power_on]:401:Power Driver: Power On Logic Pulse for 250ms @ 135879
6:2023 Jun 27 20:14:38:BMC:kernel:-:<6>[press_button]:117:Power Driver: Press Power Button @ 135879
5:2023 Jun 27 20:14:38:BMC:IPMI:1583: Bridge.c:1468:audit from:kcs Fn:0x0 Cmd:0x2 Data: 0x1 
5:2023 Jun 27 20:14:38:BMC:IPMI:1583: Bridge.c:1474:audit Resp:0x0
5:2023 Jun 27 20:14:38:BMC:AUDIT:1820: Server power state modify (op:power-on)
6:2023 Jun 27 20:14:38:BMC:kernel:-:<6>[release_button]:132:Power Driver: Release Power Button @ 136130
5:2023 Jun 27 20:14:47:BMC:IPMI:1583: Bridge.c:1468:audit from:kcs Fn:0x0 Cmd:0x6 Data: 0x30 0x1 
5:2023 Jun 27 20:14:47:BMC:IPMI:1583: Bridge.c:1474:audit Resp:0x0
2:2023 Jun 27 20:14:58:BMC:kernel:-:<2>[__do_power_on]:428:Power Driver: Power On Fail Asserted @ 156211 RegInfo -> SWSTAT0[0x01] WKCFG0[0x11]
5:2023 Jun 27 20:14:58:BMC:IPMI:1583: Pilot2SrvPower.c:189:Pilot2SrvPowerOn
4:2023 Jun 27 20:14:58:BMC:IPMI:1583: Pilot2SrvPower.c:191:Pilot2SrvPowerOn: Failed to Power On [Timer expired]
5:2023 Jun 27 20:14:59:BMC:IPMI:1583: Pilot2SrvPower.c:466:Blade Power Changed To: [ ON ]
<ommit>
5:2023 Jun 27 20:15:00:BMC:selparser:1671: selparser.c:774: # 07 03 00 00 01 02 00 00 23 27 9B 64 20 00 04 24 08 00 00 00 04 01 FF FF # 307 | 06/27/2023 20:14:59 CEST | CIMC | Platform alert POWER_ON_FAIL #0x08 | Predictive Failure asserted | Asserted

 It is recommended to replace the motherboard, but unfortunately since this is a C240 M3, they are already end of life (EoL) according to C240 M3 EOL documentation 

 

View solution in original post

8 Replies 8

@varadidani05 seems like motherboard issue or power supply voltage issues. if you have any other backup power supplies, try swapping  with them. if power supplies are working well , then its board issue which may need replacement.

Please rate this and mark as solution/answer, if this resolved your issue
Good luck
KB

I replaced the power supplies, but i still get the same errors. Could it be that an electrolytic capacitor has dried up and is causing the problem?

josoneal
Cisco Employee
Cisco Employee

If you upload CIMC logs, granted the file is small enough, we can check to see what's causing the Power On Fail. Power on Fail can be caused by any component within the server including the motherboard itself. More than likely not going to be a power supply issue.

Here you go, and thanks for the reply. 

How many power supplies are being used in this server? I'm only seeing 1 PSU in inventory.

...
Fan status
 
...
Power Supplies
  deviceID: PSU1
  Status: Present
  Status2: equipped
  InputWatt: 17
  Power: on
  OutputMaxWatt: N/A
  fwVersion: 04030205
  Model: DPS-650AB-2 A
  MfgRev: 00
  ProductID: UCSC-PSU-650W
 
 
 
...
Adapter List
 
Also seeing PSU2 being discovered and then immediately losing power.:

# 56 02 00 00 01 02 00 00 7C D7 99 64 20 00 04 08 25 00 00 00 EF 30 FF 00 # 256 | 06/26/2023 20:22:52 CEST | CIMC | Power supply PSU2_STATUS #0x25 | Presence detected | | Deasserted
# 57 02 00 00 01 02 00 00 7C D7 99 64 20 00 04 08 25 00 00 00 EF 33 FF 00 # 257 | 06/26/2023 20:22:52 CEST | CIMC | Power supply PSU2_STATUS #0x25 | Power supply input lost (AC/DC) | | Deasserted 
 
I'd recommend doing some layer 1 troubleshooting.
- Reseat PSUs
- Reseat Power cables
- Check PDU
- Swap PSUs to see if the issue follows the PSU or stays with the slot.
- Also can swap power cables with PSUs to see if the issue follows the cable or stays with the PSU
 

I didn't plug in both power supplies while i downloaded CIMC logs, that's why it says that error. In the afternoon I will plug in both and send it again.

There you go and sorry for the wait

josoneal
Cisco Employee
Cisco Employee

I reviewed the new logs. It appears to be a motherboard issue as we see the server immediately reports power failed on error when you attempt to power on the server.

5:2023 Jun 27 20:14:38:BMC:kernel:-:<5>[__do_power_on]:385:__do_power_on
5:2023 Jun 27 20:14:38:BMC:kernel:-:<5>[__do_power_on]:401:Power Driver: Power On Logic Pulse for 250ms @ 135879
6:2023 Jun 27 20:14:38:BMC:kernel:-:<6>[press_button]:117:Power Driver: Press Power Button @ 135879
5:2023 Jun 27 20:14:38:BMC:IPMI:1583: Bridge.c:1468:audit from:kcs Fn:0x0 Cmd:0x2 Data: 0x1 
5:2023 Jun 27 20:14:38:BMC:IPMI:1583: Bridge.c:1474:audit Resp:0x0
5:2023 Jun 27 20:14:38:BMC:AUDIT:1820: Server power state modify (op:power-on)
6:2023 Jun 27 20:14:38:BMC:kernel:-:<6>[release_button]:132:Power Driver: Release Power Button @ 136130
5:2023 Jun 27 20:14:47:BMC:IPMI:1583: Bridge.c:1468:audit from:kcs Fn:0x0 Cmd:0x6 Data: 0x30 0x1 
5:2023 Jun 27 20:14:47:BMC:IPMI:1583: Bridge.c:1474:audit Resp:0x0
2:2023 Jun 27 20:14:58:BMC:kernel:-:<2>[__do_power_on]:428:Power Driver: Power On Fail Asserted @ 156211 RegInfo -> SWSTAT0[0x01] WKCFG0[0x11]
5:2023 Jun 27 20:14:58:BMC:IPMI:1583: Pilot2SrvPower.c:189:Pilot2SrvPowerOn
4:2023 Jun 27 20:14:58:BMC:IPMI:1583: Pilot2SrvPower.c:191:Pilot2SrvPowerOn: Failed to Power On [Timer expired]
5:2023 Jun 27 20:14:59:BMC:IPMI:1583: Pilot2SrvPower.c:466:Blade Power Changed To: [ ON ]
<ommit>
5:2023 Jun 27 20:15:00:BMC:selparser:1671: selparser.c:774: # 07 03 00 00 01 02 00 00 23 27 9B 64 20 00 04 24 08 00 00 00 04 01 FF FF # 307 | 06/27/2023 20:14:59 CEST | CIMC | Platform alert POWER_ON_FAIL #0x08 | Predictive Failure asserted | Asserted

 It is recommended to replace the motherboard, but unfortunately since this is a C240 M3, they are already end of life (EoL) according to C240 M3 EOL documentation 

 

Review Cisco Networking for a $25 gift card

Review Cisco Networking for a $25 gift card