11-21-2019 06:36 AM
Hi,
I received a minor warning on a server displaying CATERR_N:Sensor Failure Asserted.
Looking at the SEL log I can see this happening like 1-4 times a year since the server was first initialized but has now occurred multiple times in the last few days. Should I be worried?
As far as I understand, this relates to the CPU and/or the memory of the server?
ad1 | 11/20/2019 03:22:29 | CIMC | Voltage P0V75_DDR3_P1 #0x14 | Upper critical - going high | Asserted | Reading 0.72 >= Threshold 0.72 Volts ad2 | 11/20/2019 03:22:29 | CIMC | Entity presence MAIN_POWER #0x52 | Device Absent | Asserted ad3 | 11/20/2019 03:22:34 | CIMC | Processor CATERR_N #0x20 | Predictive Failure asserted | Asserted ad4 | 11/20/2019 03:22:34 | CIMC | Entity presence MAIN_POWER #0x52 | Device Present | Asserted ad5 | 11/20/2019 03:22:35 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Absent | Asserted ad6 | 11/20/2019 03:22:35 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is amber | Asserted ad7 | 11/20/2019 03:22:36 | CIMC | Processor CATERR_N #0x20 | Predictive Failure deasserted | Asserted ad8 | 11/20/2019 03:24:27 | BIOS | System Event #0x83 | OEM System Boot Event | Asserted ad9 | 11/20/2019 03:24:28 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Present | Asserted ada | 11/20/2019 03:24:37 | CIMC | Voltage P0V75_DDR3_P1 #0x14 | Upper critical - going high | Deasserted | Reading 0.69 <= Threshold 0.72 Volts adb | 11/20/2019 03:24:38 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is green | Asserted adc | 11/20/2019 03:24:55 | System Mgmt Software | OS Boot #0x00 | C: boot completed | Asserted add | 11/20/2019 03:24:55 | OEM timestamped record: Type: DC: Mfg ID: 37 01 00:Raw SEL: DD 0A DC 07 B2 D4 5D 37 01 00 00 1B C0 D4 5D 00 ade | 11/20/2019 03:36:01 | CIMC | Entity presence MAIN_POWER #0x52 | Device Absent | Asserted adf | 11/20/2019 03:36:07 | CIMC | Processor CATERR_N #0x20 | Predictive Failure asserted | Asserted ae0 | 11/20/2019 03:36:07 | CIMC | Entity presence MAIN_POWER #0x52 | Device Present | Asserted ae1 | 11/20/2019 03:36:07 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Absent | Asserted ae2 | 11/20/2019 03:36:08 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is amber | Asserted ae3 | 11/20/2019 03:36:08 | CIMC | Processor CATERR_N #0x20 | Predictive Failure deasserted | Asserted ae4 | 11/20/2019 03:36:08 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is green | Asserted ae5 | 11/20/2019 03:37:53 | BIOS | System Event #0x83 | OEM System Boot Event | Asserted ae6 | 11/20/2019 03:37:54 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Present | Asserted ae7 | 11/20/2019 03:38:20 | System Mgmt Software | OS Boot #0x00 | C: boot completed | Asserted ae8 | 11/20/2019 03:38:21 | OEM timestamped record: Type: DC: Mfg ID: 37 01 00:Raw SEL: E8 0A DC 2D B5 D4 5D 37 01 00 00 28 C3 D4 5D 00 ae9 | 11/21/2019 03:19:31 | CIMC | Entity presence MAIN_POWER #0x52 | Device Absent | Asserted aea | 11/21/2019 03:19:36 | CIMC | Processor CATERR_N #0x20 | Predictive Failure asserted | Asserted aeb | 11/21/2019 03:19:36 | CIMC | Entity presence MAIN_POWER #0x52 | Device Present | Asserted aec | 11/21/2019 03:19:36 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Absent | Asserted aed | 11/21/2019 03:19:37 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is amber | Asserted aee | 11/21/2019 03:19:37 | CIMC | Processor CATERR_N #0x20 | Predictive Failure deasserted | Asserted aef | 11/21/2019 03:19:38 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is green | Asserted af0 | 11/21/2019 03:21:22 | BIOS | System Event #0x83 | OEM System Boot Event | Asserted af1 | 11/21/2019 03:21:23 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Present | Asserted af2 | 11/21/2019 03:21:49 | System Mgmt Software | OS Boot #0x00 | C: boot completed | Asserted af3 | 11/21/2019 03:21:49 | OEM timestamped record: Type: DC: Mfg ID: 37 01 00:Raw SEL: F3 0A DC CD 02 D6 5D 37 01 00 00 CC 10 D6 5D 00 af4 | 11/21/2019 13:00:25 | CIMC | Entity presence MAIN_POWER #0x52 | Device Absent | Asserted af5 | 11/21/2019 13:00:30 | CIMC | Processor CATERR_N #0x20 | Predictive Failure asserted | Asserted af6 | 11/21/2019 13:00:30 | CIMC | Entity presence MAIN_POWER #0x52 | Device Present | Asserted af7 | 11/21/2019 13:00:30 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Absent | Asserted af8 | 11/21/2019 13:00:31 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is amber | Asserted af9 | 11/21/2019 13:00:31 | CIMC | Processor CATERR_N #0x20 | Predictive Failure deasserted | Asserted afa | 11/21/2019 13:00:31 | CIMC | Platform alert LED_BLADE_STATUS #0x62 | LED color is green | Asserted afb | 11/21/2019 13:02:21 | BIOS | System Event #0x83 | OEM System Boot Event | Asserted afc | 11/21/2019 13:02:21 | CIMC | Entity presence BIOS_POST_CMPLT #0x53 | Device Present | Asserted afd | 11/21/2019 13:02:48 | System Mgmt Software | OS Boot #0x00 | C: boot completed | Asserted afe | 11/21/2019 13:02:48 | OEM timestamped record: Type: DC: Mfg ID: 37 01 00:Raw SEL: FE 0A DC F8 8A D6 5D 37 01 00 00 F9 98 D6 5D 00
Thanks,
Simon
Solved! Go to Solution.
11-21-2019 07:25 AM - edited 11-21-2019 07:27 AM
That alert can be generated in association with DIMM issues, so it makes sense that you appear to have a batch of DIMM alerts going on around that same time.
I would open a TAC case to get the DIMM issue corrected/evaluated, and see if the CATERR_N issue disappears.
The DIMM alert may/may not be an issue, as it seems to be hitting a marginal voltage threshhold.
Kirk...
11-21-2019 07:25 AM - edited 11-21-2019 07:27 AM
That alert can be generated in association with DIMM issues, so it makes sense that you appear to have a batch of DIMM alerts going on around that same time.
I would open a TAC case to get the DIMM issue corrected/evaluated, and see if the CATERR_N issue disappears.
The DIMM alert may/may not be an issue, as it seems to be hitting a marginal voltage threshhold.
Kirk...
08-06-2021 07:38 PM
Hello Simon,
I am having the same issue with one of the Server.
Appreciate if you share that how were you able to solve this issue?
Regards,
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide