John Heffner wrote:
> Check the pmtu check at the transport layer (for UDP, ICMP and raw), and
> send a local error if socket is PMTUDISC_DO and packet is too big.  This is
> actually a pure bugfix for ipv6.  For ipv4, it allows us to do pmtu checks
> in the same way as for ipv6.
> 
> diff --git a/net/ipv4/ip_output.c b/net/ipv4/ip_output.c
> index d096332..593acf7 100644
> --- a/net/ipv4/ip_output.c
> +++ b/net/ipv4/ip_output.c
> @@ -822,7 +822,9 @@ int ip_append_data(struct sock *sk,
>       fragheaderlen = sizeof(struct iphdr) + (opt ? opt->optlen : 0);
>       maxfraglen = ((mtu - fragheaderlen) & ~7) + fragheaderlen;
>  
> -     if (inet->cork.length + length > 0xFFFF - fragheaderlen) {
> +     if (inet->cork.length + length > 0xFFFF - fragheaderlen ||
> +         (inet->pmtudisc >= IP_PMTUDISC_DO &&
> +          inet->cork.length + length > mtu)) {
>               ip_local_error(sk, EMSGSIZE, rt->rt_dst, inet->dport, 
> mtu-exthdrlen);
>               return -EMSGSIZE;
>       }


This makes ping report an incorrect MTU when IPsec is used since we're
only accounting for the additional header_len, not the trailer_len
(which is not easily changeable). Additionally it will report different
MTUs for the first and following fragments when the socket is corked
because only the first fragment includes the header_len. It also can't
deal with things like NAT and routing by fwmark that change the route.
The old behaviour was that we get an ICMP frag. required with the MTU
of the final route, while this will always report the MTU of the
initially chosen route.

For all these reasons I think it should be reverted to the old
behaviour.

-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to