cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
24119
Views
0
Helpful
4
Replies

Various ESXi Hosts lose connectivity - VEM Heartbeat loss on Nexus 1000v 4.2.1 SV4a

jay.moodley
Level 1
Level 1

Hi ,

Please can i have some guidance.

Current Setup

2 x 4507R core switches with 1 HP Server cluster with passthru modules (Please see attached diagram)

4 x ESXi hosts on the cluster.

ESXi hosts disconnect and VM's become isolated, this only happens to one of the four ESXi hosts ( 3 working fine , one gets disconnected)

Checked ports and cabling connecting to uplink switches no disconnects found.

The hosts seem to disconnect for 6-30 seconds and then get reconnected.

Please see errors from VSM log

2013 Mar  8 14:02:25 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 5 (heartbeats lost)

2013 Mar  8 14:57:17 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 5 (heartbeats lost)

2013 Mar 11 08:03:20 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 08:04:14 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 08:04:39 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 08:33:08 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 08:38:48 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 08:39:53 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 09:05:19 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 09:07:13 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 09:07:40 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 09:10:09 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 09:12:18 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_REMOVE_NO_HB: Removing VEM 7 (heartbeats lost)

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETH_PORT_CHANNEL-5-PORT_DOWN: port-channel5: Ethernet7/1 is down

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETH_PORT_CHANNEL-5-FOP_CHANGED: port-channel5: first operational port changed from Ethernet7/1 to Ethernet7/2

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETH_PORT_CHANNEL-5-PORT_DOWN: port-channel5: Ethernet7/2 is down

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETH_PORT_CHANNEL-5-FOP_CHANGED: port-channel5: first operational port changed from Ethernet7/2 to Ethernet7/3

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_MODULE_REMOVED: Interface Ethernet7/1 is down (module removed)

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_PORT_CHANNEL_MEMBERS_DOWN: Interface port-channel5 is down (No operational members)

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_MODULE_REMOVED: Interface Ethernet7/2 is down (module removed)

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETH_PORT_CHANNEL-5-PORT_DOWN: port-channel5: Ethernet7/3 is down

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETH_PORT_CHANNEL-5-PORT_DOWN: port-channel5: port-channel5 is down

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETH_PORT_CHANNEL-5-FOP_CHANGED: port-channel5: first operational port changed from Ethernet7/3 to none

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_MODULE_REMOVED: Interface Ethernet7/3 is down (module removed)

2013 Mar 11 09:07:14 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_PORT_CHANNEL_MEMBERS_DOWN: Interface port-channel5 is down (No operational members)

2013 Mar 11 09:07:15 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_INTERFACE_REMOVED: Interface Ethernet7/1 is down (Interface removed)

2013 Mar 11 09:07:15 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_INTERFACE_REMOVED: Interface Ethernet7/2 is down (Interface removed)

2013 Mar 11 09:07:15 VSM1A_KINGSMEAD_BFL %ETHPORT-5-IF_DOWN_INTERFACE_REMOVED: Interface Ethernet7/3 is down (Interface removed)

2013 Mar 11 09:07:15 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-MOD_OFFLINE: Module 7 is offline

[7m--More-- [m

2013 Mar 11 09:07:20 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-VEM_MGR_DETECTED: Host kmdesxprh04 detected as module 7

2013 Mar 11 09:07:25 VSM1A_KINGSMEAD_BFL %VEM_MGR-2-MOD_ONLINE: Module 7 is online

2013 Mar 11 09:07:28 VSM1A_KINGSMEAD_BFL %VIM-5-IF_ATTACHED: Interface Vethernet19 is attached to vmk0 on port 1 of module 7 with dvport id 129

===============================================================================================================

UPLINK config

Current configuration : 333 bytes

!

switchport trunk encapsulation dot1q

switchport trunk allowed vlan 2-4094

switchport mode trunk

speed 1000

duplex full

qos trust dscp

no snmp trap link-status

spanning-tree portfast

spanning-tree bpduguard enable

end

===============================================================================================================

The disconnects randomly affect all the ESXi hosts. But only affects one host at a time.

Message was edited by: Jayendra Moodley

4 Replies 4

Robert Burns
Cisco Employee
Cisco Employee

Please attach your VSM config and the 4500 port config for the uplinks  (If they're all identical, just include one)

Thanks,

Robert

Hi Robert,

I have added the VSM config to the discussion.

Please also paste the relevant 4500 interface config where the VEMs uplink to.  If they're all identical, one will do.

Thanks,

Robert

admin11111
Level 4
Level 4

Dear all,

Did you find any resolution to that situation ?

Thx

Review Cisco Networking for a $25 gift card