ASR9000/XR: Multichassis LAG or MC-LAG (MCLAG) guide

xthuijs · ‎07-15-2013

Introduction
Overview
- Port Status
Configuration
Switchover
Troubleshooting
Events and Scenarios
mLACP Synchonization
NAKing mLACP Messages
Syslog messages
Recovering from failures
Simple quick config blocks
Related Information

Introduction

Multichassis LAG is a tricky concept. In general the members of a bundle (also called LAG, Link Aggregation Group, Etherchannel, Portchannel) are between 2 distinct devices. The advantage of using a bundle is that there is a single routing peering, no worries about spanning tree and things like that. However the redundancy is compromised when either one of the peers fail. Using ECMP (Equal Cost Multipath) in L3 scenarios allows me to dual home to 2 different devices so I have a back up also when one of the peers fail for me, but that negates the benefit of using bundle having a single routing peering.

MC-LAG attempts to provide a means to allow me to dual home a device (DHD, the dual homed device) to two different peer devices (the POA, or Point of Attachment), so basically allowing me to have the benefits of node redundancy, while maintaining single peerings which makes my L2 (Spanning Tree/ STP) or L3 (no dual peerings) life a lot easier.

Does it come with restrictions? Of course! It's technology, nothing comes for free...! So in this document we will highlight how to set it up, what the restrictions are that you need to be aware of and how to troubleshoot and verify MC-LAG scenarios.

Overview

MC-LAG & ICCP enable a switch/router to use standard Ethernet Link Aggregation for device dual-homing, with active/standby redundancy

Dual-homed Device (DHD) operates as if it is connected to single virtual device and runs IEEE std. 802.1AX-2008 (LACP)

Point of Attachment (PoA) nodes run Inter-chassis Communication Protocol (ICCP) to synchronize state & form a Redundancy Group (RG)

Screen Shot 2013-07-15 at 8.36.30 AM.png

Idea is to let the peer “device” feel that it’s connected to a single “device” •à need information sync between two PoA.

MC-LAG uses ICCP to synchronize LACP configuration & operational state between PoAs, to provide DHD the perception of being connected to a single switch. All PoAs use the same System MAC Address & System Priority when communicating with DHD

Configurable or automatically synchronized via ICCP

Every PoA in the RG is configured with a unique Node ID (value 0 to 7). Node ID + 8 forms the most significant nibble of the Port Number.

For a given bundle, all links on the same PoA must have the same Port Priority.

Introduction

Overview

1. Port Status

Configuration

2. ICCP setup (POAs)

3. mLACP setup (POAs)

4. DHD setup

5. Checking status

Switchover

Troubleshooting

6. Information to Collect

7. Is this a bundle issue?

Checking ICCP

Features & Protocols

8. Bundle Infra issues

The bundle is down

9. The bundle flapped on the POA

10. The bundle flapped on the DHD

11. Switchover did not occur

12. Both POAs are Active

13. The bundle is Down on the Active POA

14. The bundle is Up on the Standby POA

Events and Scenarios

15. Initial Bringup

16. mLACP Active and Standby

Interface State

Hot vs Cold Standby

17. Switchover

Switchover Types

Dynamic Priority Management

Brute Force

Revertive Behavior

Non-revertive Behavior

Notes

18. Switchover Triggers

Link Failure

Device Failure

Core Isolation

19. User-Controlled Switchover

Bundle Interface Shutdown

Bundle Interface Bundle-level Shutdown

Non-revertive Switchover CLI

mLACP Synchonization

NAKing mLACP Messages

20. Configuration Changes

mLACP Node ID

mLACP System ID

21. Split Brain

22. DHD Control

Syslog messages

Recovering from failures

23.

Simple quick config blocks

Related Information