On Fri, 02 Mar 2007 13:26:38 -0800 (PST) David Miller <[EMAIL PROTECTED]> wrote:
> From: Stephen Hemminger <[EMAIL PROTECTED]> > Date: Wed, 28 Feb 2007 17:18:46 -0800 > > > I was measuring bridging/routing performance and noticed this. > > > > The current code runs the "all packet" type handlers before calling the > > bridge hook. If an application (like some DHCP clients) is using AF_PACKET, > > this means that each received packet gets run through the Berkeley Packet > > Filter > > code in sk_run_filter (slow). > > I know we closed this out by saying that even though performance > sucks, we can't really apply this without breaking things. wrong. > What would be broken is if the DHCP client isn't specifying > a device ifindex when it binds the AF_PACKET socket. That > would be an easy way to fix this performance problem at the > application level. > > The DHCP client should only care about a particular interface's > traffic, the one it wants to listen on. My assumption is that when bridging, the normal stack path only has to receive those packets that it would receive if it was not doing bridging. A better version of the patch is: ============== The current code runs the "all packet" type handlers before calling the bridge hook. If an application (like some DHCP clients) is using AF_PACKET, this means that each received packet gets run through the Berkeley Packet Filter code in sk_run_filter. This is significant overhead. By moving the bridging hook to run first, the packets flowing through the bridge get filtered out there first. This results in a 14% improvement in performance. Signed-off-by: Stephen Hemminger <[EMAIL PROTECTED]> --- net/core/dev.c | 24 ++++++++++++------------ 1 file changed, 12 insertions(+), 12 deletions(-) --- netem.orig/net/core/dev.c +++ netem/net/core/dev.c @@ -1702,9 +1702,12 @@ struct net_bridge_fdb_entry *(*br_fdb_ge unsigned char *addr); void (*br_fdb_put_hook)(struct net_bridge_fdb_entry *ent); -static __inline__ int handle_bridge(struct sk_buff **pskb, - struct packet_type **pt_prev, int *ret, - struct net_device *orig_dev) +/* + * If bridge module is loaded call bridging hook. + * when it returns 1, this is a non-local packet + */ +int (*br_handle_frame_hook)(struct net_bridge_port *p, struct sk_buff **pskb) __read_mostly; +static int handle_bridge(struct sk_buff **pskb) { struct net_bridge_port *port; @@ -1712,15 +1715,10 @@ static __inline__ int handle_bridge(stru (port = rcu_dereference((*pskb)->dev->br_port)) == NULL) return 0; - if (*pt_prev) { - *ret = deliver_skb(*pskb, *pt_prev, orig_dev); - *pt_prev = NULL; - } - return br_handle_frame_hook(port, pskb); } #else -#define handle_bridge(skb, pt_prev, ret, orig_dev) (0) +#define handle_bridge(pskb) 0 #endif #ifdef CONFIG_NET_CLS_ACT @@ -1799,6 +1797,9 @@ int netif_receive_skb(struct sk_buff *sk } #endif + if (handle_bridge(&skb)) + goto out; + list_for_each_entry_rcu(ptype, &ptype_all, list) { if (!ptype->dev || ptype->dev == skb->dev) { if (pt_prev) @@ -1826,9 +1827,6 @@ int netif_receive_skb(struct sk_buff *sk ncls: #endif - if (handle_bridge(&skb, &pt_prev, &ret, orig_dev)) - goto out; - type = skb->protocol; list_for_each_entry_rcu(ptype, &ptype_base[ntohs(type)&15], list) { if (ptype->type == type && -- Stephen Hemminger <[EMAIL PROTECTED]> - To unsubscribe from this list: send the line "unsubscribe netdev" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html