08-29-2014 05:44 PM
I need help for monitoring the redundant power supply of my switches (3750x and 3560x)
I have configured my snmp on my switches and is able to monitor interfaces, temperature and etc.
The problem is when I am monitoring the redundant power supply and fans.
I am using two Cisco C3KX-PWR-350WAC Modular Power Supply per switch.
The OID I use to monitor the PSU are the following:
Power Supply 1 .1.3.6.1.4.1.9.9.13.1.5.1.3 Instance: 1003
Power Supply 2 .1.3.6.1.4.1.9.9.13.1.5.1.3 Instance: 1004
When I do an SNMP walk, the value is showing (1) which means normal.
When I simulate a power outage on power supply 1 or 2 by unplugging the power source, the value doesn't change and is still (1).
Values: normal(1), warning(2), critical(3), shutdown(4), notPresent(5)
As for the Fans:
When I remove the fan module, I am able to get a notification that the fans are down, but when I plug back the fan module it is now using a different instance.
Below is my config for SNMP:
snmp-server community XXXXXXXXXXXXXX RO
snmp-server location XXXXXX
snmp-server contact XXXXXX
snmp-server chassis-id Cisco3560X
snmp-server enable traps snmp authentication linkdown linkup coldstart warmstart
snmp-server enable traps transceiver all
snmp-server enable traps tty
snmp-server enable traps license
snmp-server enable traps cpu threshold
snmp-server enable traps syslog
snmp-server enable traps vlancreate
snmp-server enable traps vlandelete
snmp-server enable traps port-security
snmp-server enable traps envmon fan shutdown supply temperature status
snmp-server enable traps errdisable
snmp-server enable traps vlan-membership
snmp-server host X.X.X.X version 2c XXXXXXXXXXXXXX
Note: This problem is the same for all of my 3560X switches and 3750X switches. I am not sure if I am monitoring the wrong OID.
Thanks in advance
Solved! Go to Solution.
08-30-2014 01:07 AM
Hi ,
You are running an old IOS code , definitely you are hitting a BUG .
there is no workaround or I would say unfirtunetely you will not be able to get the correct power supply status via SNMP.
you need to use command line for this.
I mean "show env all "
**** if you want to know the Excat BUG that your code is hitting ,please contact TAC.
Thanks-
Afroz
**Ratings Encourages Contributors ****
08-30-2014 01:19 AM
most of them are fixed in latest IOS code , you check internally and plan for latest IOS upgrade.
hopefully it will work fine then..
Thanks-
Afroz
***ratings Encourages Contributors ***
08-29-2014 10:04 PM
Hi ,
the OID you are using are correct , there are quite a known issue with monitoring RPS via SNMP.
CSCud01798 "FRU Power Supply is not responding" was seen unexpectedly
CSCsq77660 ciscoEnvMonSupplyState returns notPresent
I would suggest to use CLI to check the power supply status.
Thanks-
Afroz
***Ratings Encourages Contributors ****
08-29-2014 11:33 PM
Hi Afrahmad,
When I use " show environment all "
The state of the RPS seems to be reflected.
I tried removing the power cable for PSU1 and ran " show environment all "
It is properly showing that PSU1 is down/ faulty.
Also I would like to add that the correct traps are also being sent out by the Switch and are received by the NMS.
So I am wondering if there are other OID which I can use to monitor my Redundant PSU and Modular Fans.
08-29-2014 11:50 PM
Hi ,
try this
Object ciscoEnvMonSupplyStatusDescr OID 1.3.6.1.4.1.9.9.13.1.5.1.2
As I said ,this is a buggy behaviour , you might not be able to see the correct information via SNMP
workaround: use CLI commands
Thanks-
Afroz
***Ratings Encourages Contributors ***
08-30-2014 12:25 AM
Hi Afrahmad,
I am confused. Is there a way to use CLI for NMS?
Can you enlighten me regarding this matter, I am not so experienced with monitoring system.
Your help is highly appreciated.
08-30-2014 12:27 AM
what is the IOS code you are running ?
08-30-2014 12:58 AM
Currently I am running the ff:
12.2(58)SE2 for the 3750x
12.2(53)SE2 for the 3560X
Sorry for the late response. I needed to ran back to the rack room
08-30-2014 01:07 AM
Hi ,
You are running an old IOS code , definitely you are hitting a BUG .
there is no workaround or I would say unfirtunetely you will not be able to get the correct power supply status via SNMP.
you need to use command line for this.
I mean "show env all "
**** if you want to know the Excat BUG that your code is hitting ,please contact TAC.
Thanks-
Afroz
**Ratings Encourages Contributors ****
08-30-2014 01:17 AM
Will this be fixed if I upgrade the firmware?
Thanks for all the help
08-30-2014 01:19 AM
most of them are fixed in latest IOS code , you check internally and plan for latest IOS upgrade.
hopefully it will work fine then..
Thanks-
Afroz
***ratings Encourages Contributors ***
09-03-2014 09:47 PM
Hi Afrahmad,
Okay, I shall try upgrading one of my switches and hopefully it fixes the bugs.
Thank you very much for all your help.
Have a great day!
Franz
UPDATE:
I have upgraded the firmware from IOS 12.2(53)SE2 to 15.02-SE6 and now I am able to get the proper status of the PSU and FAN.
But still once I remove the Module, the instance still changes.
Maybe that is how cisco designed the FRU modules.
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide