Hello all,
Context/Description:
I have an C6840-X-LE-40G with "high temperature" for the EARL. On Zabbix, I have recieve an alarm because the temperature was higher than 70°C.
#sh environment temperature
...
switch 1 module 1 EARL 1 device-1 temperature: 72C
switch 1 module 1 EARL 1 device-2 temperature: 72C
switch 1 module 1 EARL 2 device-1 temperature: 75C
switch 1 module 1 EARL 2 device-2 temperature: 73C
switch 1 module 1 EARL 3 device-1 temperature: 71C
switch 1 module 1 EARL 3 device-2 temperature: 69C
However, from the alarm threshold we have the following value for minor and major alarm:
#sh environment alarm thresholds | sec EARL
switch 1 module 1 EARL 1 device-1 temperature: 71C
threshold #1 for switch 1 module 1 EARL 1 device-1 temperature:
(sensor value >= 110C) is system minor alarm
threshold #2 for switch 1 module 1 EARL 1 device-1 temperature:
(sensor value >= 125C) is system major alarm
switch 1 module 1 EARL 1 device-2 temperature: 71C
threshold #1 for switch 1 module 1 EARL 1 device-2 temperature:
(sensor value >= 110C) is system minor alarm
threshold #2 for switch 1 module 1 EARL 1 device-2 temperature:
(sensor value >= 125C) is system major alarm
switch 1 module 1 EARL 2 device-1 temperature: 75C
threshold #1 for switch 1 module 1 EARL 2 device-1 temperature:
(sensor value >= 110C) is system minor alarm
threshold #2 for switch 1 module 1 EARL 2 device-1 temperature:
(sensor value >= 125C) is system major alarm
switch 1 module 1 EARL 2 device-2 temperature: 73C
threshold #1 for switch 1 module 1 EARL 2 device-2 temperature:
(sensor value >= 110C) is system minor alarm
threshold #2 for switch 1 module 1 EARL 2 device-2 temperature:
(sensor value >= 125C) is system major alarm
switch 1 module 1 EARL 3 device-1 temperature: 72C
threshold #1 for switch 1 module 1 EARL 3 device-1 temperature:
(sensor value >= 110C) is system minor alarm
threshold #2 for switch 1 module 1 EARL 3 device-1 temperature:
(sensor value >= 125C) is system major alarm
switch 1 module 1 EARL 3 device-2 temperature: 68C
threshold #1 for switch 1 module 1 EARL 3 device-2 temperature:
(sensor value >= 110C) is system minor alarm
threshold #2 for switch 1 module 1 EARL 3 device-2 temperature:
(sensor value >= 125C) is system major alarm
Question:
What are the good threshold to take in consider?
Moreover, on the documentation Catalyst 6840-X Switch Series - Switch Specifications, I didn't find anything regarding the temperature for the EARL. The given information seems to be for the environment and not for the CHIPs or ASICs components.
Is there any official information about the temperature limit we have to follow to prevent early hardware failure?
Thanks & Regards,
Alexandre