We are looking at deploying a pair of HA FMC 4100's into our environment but I am finding HA rather unstable and looking for advice.
My environment is a follows (development as I have never used FirePower before)
2 x FMC 4100 running 6.2.3.10
1 x 7125 running 6.2.3.10
All connected to a layer 3 switch using their management interfaces.
These are the steps I am performing
1. Install and patch Primary to 6.2.3.10
2. Install and patch Secondary to 6.2.3.10
3. Establish HA pair
Everything works up to this point HA is correctly established and health check is green.
After this point one of two issues have occurred
- If I pause synchronisation (which you need to do perform future patching) and then resume synchronisation (done from the primary and active FMC) the synchronisation fails during the copying large files stage (this stage takes 30 minutes to run and then fails). Once the synchronisation has failed I have found no way to successfully recover from it and if I break the HA (from GUI or CLI) it leaves the Secondary stuck in the Starting System Process stage and I have to re-image it to recover.
- When I register the 7125 it sometimes does not automatically get registered with the secondary and I get the error message that the secondary has less devices than the primary and there seems no way to recover from this. Deleting and re-adding the device doesn't resolve the issue which I have done from the primary and on the device itself (as detailed in the FirePower configuration guide)
Has anyone experienced these issue before or is there something that I am missing? When it works it works as expected but as soon as I get synchronisation errors it results in having to re-image