On 09/28/2016 01:56 PM, Shmulik Ladkani wrote:
On Wed, 28 Sep 2016 12:30:56 +0200, dan...@iogearbox.net wrote:
@@ -4608,6 +4608,8 @@ int skb_vlan_push(struct sk_buff *skb, __be16 vlan_proto, 
u16 vlan_tci)

                skb->protocol = skb->vlan_proto;
                skb->mac_len += VLAN_HLEN;
+               if (offset)
+                       offset += VLAN_HLEN;

                skb_postpush_rcsum(skb, skb->data + (2 * ETH_ALEN), VLAN_HLEN);
                __skb_pull(skb, offset);

This looks much better indeed than your v1 of this patch.

Yep, after some meditation and history digging I happened to notice I
was barking at the wrong tree.

So the issue might only be visible to act_vlan as the other remaining user of
skb_vlan_push().

Yes, this is correct. I'll amend the log message to express that.
The bug occurs for callers of skb_vlan_push() whose data is not
pointing at mac_header.

My only question would be:
what about __skb_vlan_pop(), wouldn't that then need the same adjustment
a la offset -= VLAN_HLEN?

Well, theoretically, yes; but caller may expect 2 different things:

(assuming tags are in-payload)

(1) suppose upon entry we have

     DA,SA,0x8100,TCI,0x0800,
     ^                ^
     mac_hdr          data

initial offset is 18, and after current unwinding code we'll get

You mean data points after the 0x0800, right?


     DA,SA,0x0800,4_bytes,
     ^                    ^
     mac_hdr              data

which is probably incorrect, adjustment 'offset -= VLAN_HLEN' is needed.

(2) suppose upon entry we have

     DA,SA,0x8100,TCI,0x0800
     ^            ^
     mac_hdr      data

initial offset is 14, and after current unwinding code we'll get

     DA,SA,0x0800,
     ^            ^
     mac_hdr      data

which is probably what user has intended.
(had we adjusted offset to be 10, 'data' would point into SA)

 From test I've made using act_vlan upon ingress on QinQ tags, existing call
provides data as in (2).

Thoughts?

Yeah, so we likely end up at 2) because of things like eth_type_trans()
that would only pull ETH_HLEN.

Couldn't we end up with 1) for the act_vlan case when we'd have the
offset-adjusted skb_vlan_push() fix from here, where we'd then redirect
to ingress where skb_vlan_pop() would be called? If I'm not missing
something, skb_vlan_push() would then point to the data location of 1)
and with your other proposed direct netif_receive_skb() patch, no
further skb->data adjustments would be done, right?

Another potential issue (but unrelated to this fix here) I just noticed
is, whether act_vlan might have the same problem as we fixed in 8065694e6519
("bpf: fix checksum for vlan push/pop helper"). So potentially, we could
end up fixing CHECKSUM_COMPLETE wrongly on ingress, since these 14 bytes
are already pulled out of the sum at that point.

Should we adjust "offset" back, only if resulting offset is >=14 ?

If also the checksum one might end up as an issue, maybe it's just best
to go through the pain and do the push/pull for data plus csum, so both
skb_vlan_*() functions see the frame starting from mac header temporarily?
Jiri, any thoughts?

Reply via email to