cancel
Showing results for 
Search instead for 
Did you mean: 
cancel
1072
Views
0
Helpful
2
Replies

Heart Beat Communication Breaks while moving clustered VM to another ESX host

s.aliyarukunju
Level 1
Level 1

Dear All,

Let me explain the existing situation and then will come to problem description.

We do have Virtual distribution switch with 3 ESX box connected. And also we have two virtual connect (VC) , from which the etherchannels are formed between Access switches.VM's are configured in each ESX box.

The logical connection of 2 ESX to access switch is as follows.

                                          Portchannel

ESX-A 1st NIC --------> VC1============>Access Switch 1                                     

                                          Portchannel

ESX-A 2nd NIC -------> VC2============>Access Switch 2

                                         Portchannel

ESX-B 1st NIC --------> VC1===========>Access Switch 1

                                        Portchannel

ESX-B 2nd NIC -------> VC2===========>Access Switch 2

                                         Portchannel

ESX-C 1st NIC --------> VC1===========>Access Switch 1

                                        Portchannel

ESX-C 2nd NIC -------> VC2===========>Access Switch 2

There are two VM's configured in one of the ESX box with clustering.The heartbeat vlan is created and is allowed on portchannels ( dot1q trunk). The 2 VM's can able to ping their heartbeat IP without any issue when they are hosting on same ESX box ( eg ESX-A) .But when the system admin move one of their VM to another ESX ( ESX-B) , the heart beat IP reachabily is breaking and hence loose the clustering between servers.

This situation is different on some times.Two VMs, one in ESX-A and second in ESX-B can reach the heartbeat IP between them without any issue. But when one VM move from ESX-B to ESX-C , they loose the heartbeat connection.

In general , the only scenario where the heartbeat never breaks is hosting the clustered VMs on same ESX box.And also an established HB connection is breaking when VM is moving between the ESX.

Actually we are in process to deploy the High avalabiliy solution and migration of VMs among the ESX is highly required.

Clearing an arp might solve this issue.But i dont want to disturb the production network , as the portchannel on access swithes are configured to allow other production vlans.

It would be really appreciated , if someone could provide any solution/workaround for this issue.

Kind Regards

Shiji

2 Replies 2

zmeng
Cisco Employee
Cisco Employee

Hi Shiji,

A couple of questions:

1. what's kind of traffic used by heartbeat?  Unitcast? Multicast? or Broadcast?

2. What's type and mode of access switch? and version.

3. what's arp table like when heartbeat broken?

4. what's the guest OS , are they running with MS NLB? if so, what's NLB mode.

With those information, we can investigate further.

BR,

John Meng

Dear John ,

Many thanks for your attempt to help.

Actually , here they did not configure the NLB yet and we are in the intial phase of establishing heart beat communication between VMs.

Finally is the issue is solved. Since the access swithes doesnt have the port channels configured between , i need to allow the HB vlans to the etherchannel trunk that is connecting to the core/distribution layer.

Thanks again.

Best Regards

Shiji

Getting Started

Find answers to your questions by entering keywords or phrases in the Search bar above. New here? Use these resources to familiarize yourself with the community: