Hey everyone,
Yesterday we reviewed the whole situation together with Juniper support.
According to them, the overall setup and configuration on the Juniper side
looks correct. However, they suspect that the issue is related to how FRR
advertises the EVPN routes.
Specifically, the Type-2 routes received from FRR do not include the VNI in the
route itself (see output below – EthTag/VNI is shown as "0"). The switch
appears to recognize the VNI as a route label and learns the MAC addresses, but
due to the missing VNI field it does not properly associate the MAC entries
with the EVPN MAC-IP table. This would also explain why communication works
locally on the same switch, but not across EVPN/VXLAN between the servers.
The interesting part is that the VNI is shown correctly for the MAC address
"48:a9:8a:ef:75:73". However, this device is not connected via VXLAN directly —
it is a MikroTik board connected through a regular Layer-2 VLAN which is mapped
into the VNI on the EX4650 side. Communication between the VMs and the MikroTik
board is only partially working (75% packet loss). Only the two VM MAC
addresses ("02:04:02:64:00:01" and "02:04:02:64:00:02") learned from FRR are
missing the VNI information in the Type-2 routes:
show bgp l2vpn evpn route type 2
BGP table version is 4, local router ID is 10.66.200.3
Network Next Hop Metric LocPrf Weight Path
Extended Community
Route Distinguisher: 10.65.200.240:100
*> [2]:[4004055]:[48]:[48:a9:8a:ef:75:73] RD 10.65.200.240:100
10.65.200.240 0 65400 i
RT:100:100 ET:8
Route Distinguisher: 10.66.200.2:3
*> [2]:[0]:[48]:[02:04:02:64:00:01] RD 10.66.200.2:3
10.66.200.2 0 65400 65202 i
RT:100:100 ET:8
Route Distinguisher: 10.66.200.3:3
*> [2]:[0]:[48]:[02:04:02:64:00:02] RD 10.66.200.3:3
10.66.200.3 32768 i
ET:8 RT:100:100
Juniper’s statement was:
“Based on our findings, it appears that the servers are not sending their
Type-2 routes with the VNI included in the route, which may be preventing the
creation of entries in the EVPN MAC-IP table. In addition, the next-hop entry
seems to be missing from the PFE.”
We also discussed this with another engineer who has extensive experience with
Juniper and EVPN deployments. He also does not see a fundamental issue on the
Juniper side, but unfortunately he has no practical experience with FRR in this
specific scenario.
Maybe some of you have experience with FRR? Do you use it in a similar way to
us? Maybe we just missing a small detail in the config:
!
frr version 10.6.0
frr defaults traditional
hostname cs-kvm-st-cl1-03
log syslog informational
log file /var/log/frr/debug.log debugging
service integrated-vtysh-config
!
ip prefix-list LOOPBACKS seq 10 permit 10.0.0.0/8 le 32
!
interface bond1
ip ospf area 0.0.0.51
ip ospf network point-to-point
no ipv6 nd suppress-ra
exit
!
interface lo
ip address 10.66.200.3/32
exit
!
router bgp 65203
bgp router-id 10.66.200.3
no bgp ebgp-requires-policy
no bgp default ipv4-unicast
no bgp network import-check
neighbor uplinks peer-group
neighbor uplinks remote-as external
neighbor uplinks ebgp-multihop
neighbor uplinks update-source lo
neighbor 10.65.200.240 peer-group uplinks
!
address-family l2vpn evpn
neighbor uplinks activate
neighbor uplinks next-hop-self
advertise-all-vni
advertise ipv4 unicast
vni 4004070
route-target import 100:100
route-target export 100:100
proxy-arp
exit-vni
vni 4004055
route-target import 100:100
route-target export 100:100
advertise-svi-ip
proxy-arp
exit-vni
vni 10030864
route-target import 100:100
route-target export 100:100
proxy-arp
exit-vni
advertise-svi-ip
exit-address-family
exit
!
router ospf
ospf router-id 10.66.200.3
redistribute connected route-map OSPF_EXPORT
exit
!
route-map OSPF_EXPORT permit 10
match ip address prefix-list LOOPBACKS
exit
!
end
Thanks for your help!
Von: Julian Sluimann <[email protected]>
Datum: Dienstag, 19. Mai 2026 um 10:48
An: [email protected] <[email protected]>
Cc: Wido den Hollander <[email protected]>
Betreff: RE: Re: VXLAN with EVPN Problem
Hey Wido!
Thanks for reply!
> I must say I have never tried with a JunOS VC, but why would you also?
> This is a full L3 setup, correct? Why add the VC of JunOS?
We are currently using two virtual chassis because it was an existing setup
with running redundant L2 connections.
We still need these connections. Our plan was to migrate to a L3 environment
now. In a new setup we would use four single devices too.
Right now this is not possible because of production systems.
> Which MACs, where? Who is your gateway?
We are seeing all mac-addresses on the right sides in evpn database. These are
published to evpn fine.
> Why not use this for the BGP session?
We are using a bond because of the virual chassis on the switch side. We wanted
to use 2x25G connection. We are using OSPF to learn Loopback IPs of devices.
BGP connection via Loopback is the reason for one single bgp session between
switch and kvm host.
> Did you set a vtep-source under switch-options to lo0.0?
Yes, of course we set vtep Interface to loopback.
set switch-options vtep-source-interface lo0.0
Can you tell me something about your import/export rules or add a snippet too?
Right now we are not using any policies.
Our next steps are: We will try to run bgp sessions via interface addresses
without OSPF. We will try with a single spare device without VC.
We thought about using iBGP too but decided to use eBGP because of easier
debugging.
Thank you for your help!
On 2026/05/19 04:14:35 Wido den Hollander via users wrote:
>
>
> Op 18-05-2026 om 11:25 schreef Julian Sluimann:
> > Hi everyone,
> >
> > We are currently trying to expand our L2 VLAN-based CloudStack environment
> > to include EVPN VXLAN. We've run into a problem that we can't seem to solve…
> >
> > But right from the start: We are using two Juniper Virtual Chassis, each
> > with two Juniper EX4650 switches. The KVM hosts (Ubuntu 24.04 with
> > FRRouting 10.6.1) use an LACP bond (bond1) for guest traffic. We have
> > included the script needed to create the bridges. The bridge is created
> > correctly; we see traffic from the VM’s "vnet" interface passing through
> > the bridge and then exiting the bond encapsulated in VXLAN. However, the
> > traffic simply does not seem to arrive on the other side. It doesn't matter
> > whether the KVM hosts are connected to the same switch chassis. If the KVM
> > hosts communicate directly with each other via BGP, everything works
> > without any problems.
> >
>
> I do see some problems with FRR <> JunOS communicating with BGP+EVPN,
> but it can work.
>
> I must say I have never tried with a JunOS VC, but why would you also?
> This is a full L3 setup, correct? Why add the VC of JunOS?
>
> > We noticed that the ARP table within the VM is not populated correctly.
> > That's strange, because both the switches and the KVM hosts have filled
> > their ARP tables with all the MAC addresses. But even if we add the missing
> > MAC addresses of the VMs on both sides, they still cannot communicate with
> > each other. That doesn't seem to be the problem, but perhaps it's a
> > consequence of the actual problem?
>
> Which MACs, where? Who is your gateway?
>
> (see more below)
>
> >
> > Our current FRR configuration looks like this:
> >
> > !
> > frr version 10.6.0
> > frr defaults traditional
> > hostname kvm-h1
> > log syslog informational
> > service integrated-vtysh-config
> > !
> > ip prefix-list LOOPBACKS seq 10 permit 10.0.0.0/8 le 32
> > !
> > interface bond1
> > ip ospf area 0.0.0.51
> > ip ospf network point-to-point
> > no ipv6 nd suppress-ra
> > exit
> > !
> > interface lo
> > ip address 10.66.200.3/32
> > exit
> > !
> > router bgp 65203
> > bgp router-id 10.66.200.3
> > no bgp ebgp-requires-policy
> > no bgp default ipv4-unicast
> > no bgp network import-check
> > neighbor uplinks peer-group
> > neighbor uplinks remote-as external
> > neighbor uplinks ebgp-multihop
> > neighbor uplinks update-source lo
>
> Correct? How does the EX learn this loopback address of the KVM host?
> Why not use the address of the interface for the uplink?
>
> And why a bond and not two seperate BGP sessions?
>
> > neighbor 10.65.200.250 peer-group uplinks
> > neighbor 10.65.200.250 description SWITCH1
> > !
> > address-family l2vpn evpn
> > neighbor uplinks activate
> > advertise-all-vni
> > vni 10003168
> > route-target import 100:100
> > route-target export 100:100
> > proxy-arp
> > exit-vni
> > advertise-svi-ip
> > exit-address-family
> > exit
> > !
> > router ospf
> > ospf router-id 10.66.200.3
> > redistribute connected route-map OSPF_EXPORT
> > exit
> > !
> > route-map OSPF_EXPORT permit 10
> > match ip address prefix-list LOOPBACKS
> > exit
> > !
> > end
> >
> > Our test VNI is 10003168. It contains two VMs, each on a different KVM
> > host, which in turn are connected to two different virtual switches.
> >
> > The "bond1" interface is used, which is defined in the Netplan as follows:
> >
> > bonds:
> > bond1:
> > mtu: "9000"
> > interfaces:
> > - ens2f1np1
> > - eno3np1
> > addresses:
> > - "10.65.200.5/31"
>
> Why not use this for the BGP session?
>
> > parameters:
> > mode: "802.3ad"
> > lacp-rate: "slow"
> > transmit-hash-policy: "layer3+4"
> >
> > This is how our juniper configuration looks like:
> >
> > set protocols evpn no-core-isolation
> > set protocols evpn encapsulation vxlan
> > set protocols evpn default-gateway no-gateway-community
> > set protocols evpn duplicate-mac-detection detection-threshold 5
> > set protocols evpn duplicate-mac-detection detection-window 180
> > set protocols evpn duplicate-mac-detection auto-recovery-time 15
> > set protocols evpn multicast-mode ingress-replication
> > set protocols evpn extended-vni-list 4004070
> > set protocols evpn extended-vni-list 10003168
> >
> > set routing-options router-id 10.65.200.250
> > set switch-options vrf-target target:100:100
> > set switch-options route-distinguisher 10.65.200.250:100
>
> Did you set a vtep-source under switch-options to lo0.0?
>
> >
> > Session to EX4650 VC2
> >
> > set protocols bgp group BGP-SW-to-SW multihop ttl 2
> > set protocols bgp group BGP-SW-to-SW multihop no-nexthop-change
> > set protocols bgp group BGP-SW-to-SW family inet unicast
> > set protocols bgp group BGP-SW-to-SW family evpn signaling
> > set protocols bgp group BGP-SW-to-SW neighbor 10.65.100.250 description
> > AS65100
> > set protocols bgp group BGP-SW-to-SW neighbor 10.65.100.250 local-address
> > 10.65.200.250
> > set protocols bgp group BGP-SW-to-SW neighbor 10.65.100.250 peer-as 65100
> >
> > set protocols bgp group BGP-SW-to-KVM multihop ttl 2
> > set protocols bgp group BGP-SW-to-KVM multihop no-nexthop-change
> > set protocols bgp group BGP-SW-to-KVM family inet unicast
> > set protocols bgp group BGP-SW-to-KVM family evpn signaling
> > set protocols bgp group BGP-SW-to-KVM neighbor 10.66.200.3 description
> > "AS65203"
> > set protocols bgp group BGP-SW-to-KVM neighbor 10.66.200.3 local-address
> > 10.65.200.250
> > set protocols bgp group BGP-SW-to-KVM neighbor 10.66.200.3 peer-as 65203
>
> A snippet of the config on a QFX5120
>
>
> group compute {
> type external;
> multihop {
> no-nexthop-change;
> }
> accept-remote-nexthop;
> import compute-in;
> family inet {
> unicast {
> extended-nexthop;
> }
> }
> family inet6 {
> unicast;
> }
> family evpn {
> signaling;
> }
> export compute-out;
> neighbor 2a01:XXX:2:180::7 {
> description compute0;
> local-address 2a01:XXX:2:180::6;
> peer-as 64650;
> }
> }
>
> Here we peer over IPv6, so yes, slightly different.
>
>
> > > set protocols bgp local-as 65200
> >
> > Does anyone have experience with EVPN-VXLAN and Juniper EX4650 switches?
> > It’s probably a really silly problem… but we just can’t figure it out.
>
> As said, the QFX5120 gave me some troubles as well, but it works!
>
> Wido
>
> >
> > Thanks!
> >
>
>