[j-nsp] strange problem on chassis cluster

Matthias Brumm Sat, 04 Sep 2010 04:33:02 -0700

HI!

We have a very strange problem on two chassis clusters with 10.0R3.10(will try updating to R4.7 today).


One chassis cluster (2x J6350) is our main system
The other (2x J4350) is a system located on the site of our customer.

The two clusters are speaking BGP with each other. For the customersystem, this is the only BGP session. Our main system has a full BGPmesh to our other locations and edge systems. For understanding theproblem, I would compress this to three BGP sessions:


A) BGP session to AMS-IX over VLAN 1
B) BGP session to ECIX over VLAN 1
C) BGP session to ECIX over VLAN 2

Involved are two switches. VLAN 1 is configured on both switches to makeit available in Amsterdam and Düsseldorf. VLAN 2 is only configured onthe switch, faced to Düsseldorf, to have a backup in the case the firstswitch is dead.

The day before yesterday, I started to pings to the ECIX router. Onefrom my local workstation, the other from the main cluster.

If I cofigure something on the redundant interfaces, as soon as I do thecommit, the first ping stays normal, the second junps to +30ms (normalaround 6ms). 2-3 minutes later, both pings stop. The BGP session drops.This is the only BGP session that is dropped, due to Hold timeexpiration. After a few minutes, the pings and the BGP session comeback. Every other BGP session even the one to Düsseldorf over VLAN 2stays up.

I switched the main load to Düsseldorf to VLAN 2. That time, that BGPsession was dropped, while the other stays up. The session to Düsseldorfis taking the main load with around 260000 prefixes.


Matthias
_______________________________________________
juniper-nsp mailing list juniper-nsp@puck.nether.net
https://puck.nether.net/mailman/listinfo/juniper-nsp

[j-nsp] strange problem on chassis cluster

Reply via email to