On Thu, Aug 30, 2018 at 6:12 AM Vincent Whitchurch <vincent.whitchu...@axis.com> wrote: > > Currently, the only way to ignore outgoing packets on a packet socket is > via the BPF filter. With MSG_ZEROCOPY, packets that are looped into > AF_PACKET are copied in dev_queue_xmit_nit(), and this copy happens even > if the filter run from packet_rcv() would reject them. So the presence > of a packet socket on the interface takes away the benefits of > MSG_ZEROCOPY, even if the packet socket is not interested in outgoing > packets. (Even when MSG_ZEROCOPY is not used, the skb is unnecessarily > cloned, but the cost for that is much lower.) > > Add a socket option to allow AF_PACKET sockets to ignore outgoing > packets to solve this. Note that the *BSDs already have something > similar: BIOCSSEESENT/BIOCSDIRECTION and BIOCSDIRFILT. > > The first intended user is lldpd.
Clear description of the use case, thanks. I don't see a simple alternative to introducing a new socket option, either (a new ETH_P_xx protocol wildcard different from ETH_P_ALL, perhaps). > Signed-off-by: Vincent Whitchurch <vincent.whitchu...@axis.com> > --- > include/linux/netdevice.h | 1 + > include/uapi/linux/if_packet.h | 1 + > net/core/dev.c | 3 +++ > net/packet/af_packet.c | 15 +++++++++++++++ > 4 files changed, 20 insertions(+) > > diff --git a/include/linux/netdevice.h b/include/linux/netdevice.h > index ca5ab98053c8..8ef14d9edc58 100644 > --- a/include/linux/netdevice.h > +++ b/include/linux/netdevice.h > @@ -2317,6 +2317,7 @@ static inline struct sk_buff > *call_gro_receive_sk(gro_receive_sk_t cb, > > struct packet_type { > __be16 type; /* This is really htons(ether_type). > */ > + bool ignore_outgoing; > struct net_device *dev; /* NULL is wildcarded here > */ > int (*func) (struct sk_buff *, > struct net_device *, > diff --git a/include/uapi/linux/if_packet.h b/include/uapi/linux/if_packet.h > index 67b61d91d89b..467b654bd4c7 100644 > --- a/include/uapi/linux/if_packet.h > +++ b/include/uapi/linux/if_packet.h > @@ -57,6 +57,7 @@ struct sockaddr_ll { > #define PACKET_QDISC_BYPASS 20 > #define PACKET_ROLLOVER_STATS 21 > #define PACKET_FANOUT_DATA 22 > +#define PACKET_IGNORE_OUTGOING 23 > > #define PACKET_FANOUT_HASH 0 > #define PACKET_FANOUT_LB 1 > diff --git a/net/core/dev.c b/net/core/dev.c > index 325fc5088370..0addb4f0abfe 100644 > --- a/net/core/dev.c > +++ b/net/core/dev.c > @@ -1947,6 +1947,9 @@ static inline bool skb_loop_sk(struct packet_type > *ptype, struct sk_buff *skb) > if (!ptype->af_packet_priv || !skb->sk) > return false; > > + if (ptype->ignore_outgoing) > + return true; > + This probably does not belong in skb_loop_sk, but in dev_queue_xmit_nit directly. > if (ptype->id_match) > return ptype->id_match(ptype, skb->sk); > else if ((struct sock *)ptype->af_packet_priv == skb->sk)