On Sun, 14 Feb 2010, Robin Whittle wrote:

Your proposal involves using IPv4 header options.  These are
theoretically compatible with IPv4, but they can't be relied upon in
across the DFZ.  If all routers were updated, then  your system could
proceed,

Not all - some, likely a /very/ few (see below). That study shows that most paths are fine (Exactly how representative that study is, who knows - more samples might show the problem is greater or lesser in extent). However, aren't you advocating upgrading /all/ routers? :)

The techniques Fred and I developed are different, but both can cope with there being PMTU problems between ITRs and ETRs which do not result in PTBs being sent to the ITR.

This is an interesting point. Which problem is worse:

1. some DFZ routers having problems with IP options
2. some edge networks doing extremely dumb things and causing PMTU
   problems

?

Personally, I'll go for 1. Cause I know those routers are well-managed by generally very clueful people. The anecdotal evidence available to me suggests PMTU problems are caused by certain strange ideas about the importance of blocking everything but (tcp and dst port 80) having become ingrained in well-meaning but less than clueful edge network administrators (companies selling firewall/network security products bear a lot of responsibility for instilling that idea).

I see you have a protocol to deal with 2. Am I correct in thinking its Inter-TR, and exists to deal with PMTU problems between the xTRs? If so, then doesn't it leave the problem of where the PMTU hole is near the remote, non CES, edge network. e.g.:

                                remote network
host--ETR---ITR-------------------[FW----host]


Host is "CESed". "Remote network" unfortunately is behind a PMTU hole - i.e. FW drops incoming PTB ICMPs, so the remote-network host can never do PMTU properly. Hence many consumer routers clamp TCP MSS to local MTU - 40.

The problem is now there's another tunnel in the way, and it's eaten 20 from the PMTU, which 'host' (or its local access router) doesn't know about at all, and so can't automatically adjust its MTU for, and that 20 is larger than the 8 bytes of PPPoE overhead that routers usually are adjusting for.

So this creates another PMTU problem when remote-host tries sending a packet to host and never receives the PTB from the ITR.

In theory, this problem seems like it should be much easier to solve than fixing core routers to be nicer to things like IP options. However, I have my doubts :).

I notice you also have a draft that gets around this by rewriting the IP header. Though, that requires upgrading the vast majority of routers on the internet.

So it seems the problems we can choose between come down to:

1. Fixing the majority of those routers that are causing the problems
   for IP options on 15 to 20% of paths.

   If each tested path was completely independent and each path was
   10 hops on average, and the problems on each path were down to one
   router, then we're talking about about something like 1% of
   routers that need to be fixed. If average was 16 hops, then we're
   talking 0.5%. It'd be interesting to see field studies to pin this
   number down.

2. Upgrading all end-host software

3. Fixing the dumb edge networks that are causing the PMTU problems.

4. Upgrading all DFZ routers to handle a modified IP header.

It's hard to think 2 is achievable. It boils down to try re-educating people who don't really have much interest anyway in all this. We've had years and years of this problem and awareness and yet it's still there, and there are still widely used security products shipping that block ICMP by default and do all kinds of stupid things, IME. Also, the workaround for 2 is for end-hosts to clamp down their MSS even further. So adding tunneling to the internet has a risk of an ever escalating "race to 0 MTU" (PPPoE, VPNs, CES, etc - all of which can stack!).

So then it boils down to 1, 3 and 4. I think 1 is hard, but achievable. If 1 is considered so hard as to be unachieavable, then surely 4 must be impossible?

For me, the above order (1,2,4,3) corresponds to least-hardest to most-hardest. I.e. if you have to choose between which of those problems your solution must overcome, then I'd choose 1 obviously.

Obviously the choice may be more complex than that. E.g. it could be "1 AND 2" (e.g. having end-hosts add an IP option in some kind of CEE, or CEEish solution) versus "3 or 4" (e.g. CES). Then it's not so clear which is worse.

NB: I have finally found your terminology draft, so hopefully I can abuse the terminology less.

regards,
--
Paul Jakma      [email protected]  Key ID: 64A2FF6A
Fortune:
"I will make no bargains with terrorist hardware."
-- Peter da Silva
_______________________________________________
rrg mailing list
[email protected]
http://www.irtf.org/mailman/listinfo/rrg

Reply via email to