10-18-2011 12:34 AM - edited 03-07-2019 02:52 AM
Recently, one WS-X6748-GE-TX in the switch 6509E suddently automatic reset every 10 minutes around 3 times.
We have to change server which connect to the module to another one.
After that we do some test:
1) when we reset module, we get :
00:00:27: %SYS-DFC1-5-RESTART: System restarted --
Cisco IOS Software, c6lc2 Software (c6lc2-SP-M), Version 12.2(33)SXH4, RELEASE SOFTWARE (fc1)
Technical Support: http://www.cisco.com/techsupport
Copyright (c) 1986-2008 by Cisco Systems, Inc.
Compiled Mon 10-Nov-08 08:30 by prod_rel_team
00:00:27: DFC1: Currently running ROMMON from S (Gold) region
Oct 18 11:13:42.231 GMT: %DIAG-SP-6-RUN_MINIMUM: Module 1: Running Minimal Diagnostics...
Oct 18 11:14:07.965 GMT: %DIAG-SP-6-DIAG_OK: Module 1: Passed Online Diagnostics
Oct 18 11:14:08.437 GMT: %OIR-SP-6-DOWNGRADE_EARL: Module 1 DFC installed is not identical to system PFC and will perform at current system operating mode.
Oct 18 11:14:33.853 GMT: %SNMP-5-MODULETRAP: Module 1 [Up] Trap
Oct 18 11:14:33.857 GMT: %OIR-SP-6-INSCARD: Card inserted in slot 1, interfaces are now online
I do some search and "
OIR-SP-6-DOWNGRADE_EARL" should be OK ->
https://supportforums.cisco.com/docs/DOC-5156
2) we do some show diagnostic command
S6509-02#sh diagnostic events module 1 ?
| Output modifiers
<cr>
S6509-02#sh diagnostic events module 1
Diagnostic events (storage for 500 events, 32 events recorded)
Event Type (ET): I - Info, W - Warning, E - Error
Time Stamp ET [Card] Event Message
------------------ -- ------ --------------------------------------------------
10/14 10:38:07.853 E [1] online_diag_flush_pak_queue: flushing a packet fr
om [1/13] when testing [1/-1/13]
10/14 10:38:15.913 E [1] online_diag_flush_pak_queue: flushing a packet fr
om [1/37] when testing [1/-1/37]
10/14 10:38:29.057 I [1] Diagnostics Passed
10/14 10:48:13.707 E [1] TestSynchedFabChannel Failed
10/14 10:48:13.707 E [1] test_synched_fabchan_health[1]: diag_get_fabric_s
tatus failed
10/14 10:48:13.711 E [1] diag_pf_const2_test_has_failed[1/-1]: reg_invoke_
diag_get_test_attribute_by_name failed
10/14 10:49:55.949 E [1] check_ether_packet [1/45]: newpak is NULL!
10/14 10:50:06.881 I [1] Diagnostics Passed
10/18 11:14:07.961 I [1] Diagnostics Passed
10/18 11:14:38.433 E [1] TestErrorCounterMonitor: ID:48 IN:1 PO:255 RE:84
RM:255 DV:65535 EG:2 CF:1 TF:1
S6509-02#sh diagnostic events module 2
Diagnostic events (storage for 500 events, 32 events recorded)
Event Type (ET): I - Info, W - Warning, E - Error
Time Stamp ET [Card] Event Message
------------------ -- ------ --------------------------------------------------
10/14 10:39:31.577 E [2] online_diag_flush_pak_queue: flushing a packet fr
om [2/13] when testing [2/-1/13]
10/14 10:39:35.585 E [2] online_diag_flush_pak_queue: flushing a packet fr
om [2/25] when testing [2/-1/25]
10/14 10:39:51.733 I [2] Diagnostics Passed
10/14 10:40:22.217 E [2] TestErrorCounterMonitor: ID:48 IN:1 PO:255 RE:21
RM:255 DV:2 EG:2 CF:1 TF:1
We have WS-X6748-GE-TX in both slot 1 and slot, and we see differrent result from same command, some test fail with module 1.
I would appreciate your help
duyen.
10-18-2011 03:16 AM
Hi,
How many supervisors do you have in the chassis?
Check if you can switch between the active and standby supervisor. Then check the Diag Test Results for both modules.
Hope his helps.
10-18-2011 03:33 AM
hi,
we have two Sup, could you explain the problem
thank you
Duyen.
10-18-2011 03:46 AM
Hi Duyen,
We have observed some fabric tests fail on certain modules. After switching between Active and Standby SUP the diagnostic tests did not fail.
We could not conclude the exact reason for the behavior, but the problem was attributed to some misbehavior on the active SUP.
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide