> On Jun 5, 2020, at 2:11 PM, Ryan Rawdon <r...@u13.net> wrote:
> 
> 
>> On Jun 4, 2020, at 11:00 PM, James Breeden <ja...@arenalgroup.co> wrote:
>> 
>> I have been doing a lot of research recently on operating networks with 
>> partial tables and a default to the rest of the world. Seems like an easy 
>> enough approach for regional networks where you have maybe only 1 upstream 
>> transit and some peering. 
>> 
>> I come to NANOG to get feedback from others who may be doing this. We have 3 
>> upstream transit providers and PNI and public peers in 2 locations. It'd 
>> obviously be easy to transition to doing partial routes for just the peers, 
>> etc, but I'm not sure where to draw the line on the transit providers. I've 
>> thought of straight preferencing one over another. I've thought of using BGP 
>> filtering and community magic to basically allow Transit AS + 1 additional 
>> AS (Transit direct customer) as specific routes, with summarization to 
>> default for the rest. I'm sure there are other thoughts that I haven't had 
>> about this as well....
>> 
>> And before I get asked why not just run full tables, I'm looking at regional 
>> approaches to being able to use smaller, less powerful routers (or even 
>> layer3 switches) to run some areas of the network where we can benefit from 
>> summarization and full tables are really overkill. 
>> 

A few clarifications to my previous e-mail below:


> 
> We started filtering certain mixes of long and specific routes on transit, at 
> least while some upgrades to our edge capability are in progress.  We are a 
> mix of transit providers, and public/private peering at our edge.
> 
> Shortly after filtering, we started occasionally finding destinations that 
> were unreachable over the Internet (generally /24) due to:
> - We filtered them on transit, probably due to long paths
> - They were filtered from all of our transits, so their /24 was not in our 
> table
> - We did not receive their /24 on peering
> - However, we did receive a covering prefix on peering
> - Lastly, that actual destination network with the /24 no longer was 
> connected to the network we received a covering route from, like a datacenter 
> network that used to host them and SWIPed them their /24 to make it portable.
> 
- Each of the criteria above is necessary but not sufficient alone; the whole 
list is required for the reachability failure mode I was describing


> A 3rd party SaaS netflix platform’s BGP/netflow/SNMP collectors were impacted 
> by this, which was one of the first instances we encountered of this problem.
> 
- I meant Netflow, not Netflix…

> We now have some convoluted scripting and routing policy in place, trying to 
> proactively discover prefixes that may be impacted by this and then 
> explicitly accepting that prefix or ASN on transit.  It is not a desirable 
> solution, but this  seems like it could become more common over time with v4 
> prefix sales/swaps/deaggregation (with covering prefixes left in place); as 
> well as increased TE where parties announce aggregates and specifics from 
> disjoint locations.  
> 
> Our long term solution will be taking full tables again.
> 
> Ryan
> 

Reply via email to