Re: [driver-discuss] A question: can be avoid using ´bcopy´ in Tx of the NIC driver?

Garrett D'Amore Mon, 02 Mar 2009 19:39:03 -0800

Brian Xu - Sun Microsystems - Beijing China wrote:

Hi there,
I have a question here:
Why all of the NIC drivers have to bcopy the MBLKs for transmit? (someof them bcopy always, and some others bcopy under a threshold of thepacket length).
I think one of the reason is the overhead of the setup of dma on thefly is greater than the overhead of bcopy for short packets. I want toknow if this is the case and if there are any other reasons.

Yes. For any packet reasonably sized bcopy (ETHERMTU or smaller) isfaster on *all* recent hardware. (This is confirmed on even an older300MHz Via C3.) (Hmm... I've heard that for some Niagra systems thismight not be true, however. But I've not tested it myself.)


I think the situation is different with jumbo frames, though.

If what I guess is the major cause, I have a proposal and I want tohear your advice whether it makes sense.
The most time-consuming action for the dma setup is the dma bind, morespecific, calling into the VM layer to get the PFN for thevaddr(hat_getpfnum()), since it need to search the huge page table.While for the MBLKs, essentially which are slab objects, the PFN hasalready been determined in the slab layer, and for most of theirusage, we only touch the magazine layer, where the PFN is a predetermined one. That is, the PFN should be considered as a constructedstate, but we don't leverage it for dma bind.
In storage, we have a field 'b_shadow' in buf(9S) to store the pageswhich are recently used, through which the PFNs can be easily got. so inthe case that b_shadow works, ddi_dma_buf_bind_handle() is much fasterthan the ddi_dma_mem_bind_handle().Another example, moving the dma bind of the HBA driver(mpt) from Txpath to the kmem cache constrcutor, mpt driver got 26% throughputincrement. See CR6707308.
If the mblk could store the PFN info and we had addi_dma_mblk_bind_handle() like interface, then I think it willbenefit the performance of the NIC drivers. I consulted the PAE, andgot a answer that the bcopy is typically about 10-15% of a NIC TXworkload.

There are things that can do to make DMA faster, better, and simpler.In an ideal world, the GLDv3 could do most of this work, and the mblkcould just carry the ddi_dma_cookie with it.


   -- Garrett


Thanks,
Brian

_______________________________________________
driver-discuss mailing list
driver-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/driver-discuss


_______________________________________________
driver-discuss mailing list
driver-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/driver-discuss

Re: [driver-discuss] A question: can be avoid using ´bcopy´ in Tx of the NIC driver?

Reply via email to