I am working at location where we have implemented Cisco ACI. Our idea was to replace all of the switching domains with a single fabric, that would be 2 data centers plus an edge zone (WAN, VPN, Internet, Collaboration) each zone interconnected/separated with a layer 3 core.
Whilst the concept of a single fabric is appealing, I am rather worried about having a single failure domain, specifically human error to make an error with "one click" that could be affect the whole fabric.
We already had a couple of incidents affecting FCoE for a large of number of servers, it was not pleasant to say this least.
Has anyone had similar challenges? can share any feedback or best practice how to segment the fabric to create failure domains, from administrative and technical point of view.