07-15-2008 08:11 AM - edited 03-06-2019 12:12 AM
Hi,
Was wondering if someone could help me - I have a customer with 2 x 6500 / sup 720's (single sup in each). They have both recently started to display the following errors in the log and then reboot:
4d22h: %CPU_MONITOR-SP-6-NOT_HEARD: CPU_MONITOR messages have not been heard for 30 seconds [5/1]
4d22h: %CPU_MONITOR-SP-6-NOT_HEARD: CPU_MONITOR messages have not been heard for 60 seconds [5/1]
4d22h: %CPU_MONITOR-SP-6-NOT_HEARD: CPU_MONITOR messages have not been heard for 90 seconds [5/1]
4d22h: SP: icc_send_request_internal: ipc_send_rpc_blocked failed, result 6 : ios-base : (PID=12311, TID=16) : -Traceback=(s72033-ipservicesk9_wan-4-dso-b.so+0x236B5C) ([34:0]+0x236DC8) ([34:0]+0x2371D0) ([34:0]+0x488530) ([34:0]+0x488D08)
4d22h: %CPU_MONITOR-SP-6-NOT_HEARD: CPU_MONITOR messages have not been heard for 120 seconds [5/1]
4d22h: %CPU_MONITOR-SP-6-NOT_HEARD: CPU_MONITOR messages have not been heard for 150 seconds [5/1]
4d22h: SP: IPC: Message 2834D8C timed out waiting for Ack
4d22h: SP: IPC: MSG: ptr: 0x2834D8C, flags: 0x20101, retries: 21, seq: 0x2159408, refcount: 2, retry: 00:00:00, rpc_result = 0x0, data_buffer = 0x6C703888, header = 0x76E98B0, data = 0x76E98D0 || HDR: src: 0x10000, dst: 0x2150014, index: 0, seq: 37896, sz: 80, type: 1, flags: 0x404 hi: 0x50414F, lo: 0x76E98D0 || DATA: 00 00 00 05 00 00 00 00 00 00 03 E9 00 00 00 00 00 00 00 15
4d22h: SP: IPC: Send failed: IPC msg timeout MSG: ptr: 0x2834D8C, flags: 0x20101, retries: 21, seq: 0x2159408, refcount: 2, retry: 00:00:00, rpc_result = 0x0, data_buffer = 0x6C703888, header = 0x76E98B0, data = 0x76E98D0 || HDR: src: 0x10000, dst: 0x2150014, index: 0, seq: 37896, sz: 80, type: 1, flags: 0x404 hi: 0x50414F, lo: 0x76E98D0 || DATA: 00 00 00 05 00 00 00 00 00 00 03 E9 00 00 00 00 00 00 00 15 : ios-base : (PID=12311, TID=16) : -Traceback=(s72033-ipservicesk9_wan-3-dso-b.so+0x38C0AC) ([34:0]+0x391D30) ([34:0]+0x392644) ([34:0]+0x3A4FEC) ([24:-9]6+0x116F54) ([34:0]+0x116F2C)
4d22h: %C6K_PROCMIB-SP-3-IPC_TRANSMIT_FAIL: Failed to send process statistics update : error code = timeout : ios-base : (PID=12311, TID=16) : -Traceback=(s72033-ipservicesk9_wan-6-dso-b.so+0x152080) ([34:0]+0x1520F0) ([34:0]+0x1523CC) ([34:0]+0x1526E8) ([34:0]+0x116F54) ([34:0]+0x116F2C)
4d22h: %CPU_MONITOR-SP-3-TIMED_OUT: CPU_MONITOR messages have failed, resetting system [5/1]
Crashdump : 11:05:14.221 Tue Jul 15 2008 : ios-base : (PID=12311, TID=16) : -Traceback=(s72033-ipservicesk9_wan-3-dso-b.so+0x24AD38) ([24:-9]5+0x38E1FC) ([34:0]+0x3905A8) ([24:-9]6+0x116F54) ([34:0]+0x116F2C)
crashdump called (with pause = 0 sec)
%ALIGN-1-FATAL: Illegal access to a low address 12:05:14 BST Tue Jul 15 2008
addr=0x0, pc=0x72A7F28C, ra=0x72A7F1BC, sp=0x357EBC8
If anybody has seen these logs before, it would be great to hear from you 🙂
Thanks
Andy
07-15-2008 09:07 AM
hi!
There seems to be some problem in the internal communication with the SP (switch processor . "IPC" here refers to "inter process communication" This results in a forced crash due to loss of internal communication.
You will have to open a TAC case for this and provide the crash-dump which would have been crated on the bootflash: or sup-botflash: .That would give more detals. It is not possible to know the exact reason from the logs you have pasted.
Is this issue seen after some major config changes or image upgrade?
Regards,
Niranjan
Discover and save your favorite ideas. Come back to expert answers, step-by-step guides, recent topics, and more.
New here? Get started with these tips. How to use Community New member guide