Re: [driver-discuss] [networking-discuss] code review request - VIA Rhine Fast Ethernet driver

Joost Mulders Fri, 20 Feb 2009 03:10:41 -0800

Line 576-610: Your method register mapping is rather quaint. Youshouldn't need to examine the "reg" property, but instead useddi_regs_map_setup() and friends. (You can inquire about number ofregister sets and sizing information with ddi_dev_nregs(),ddi_dev_regsize(), etc.) I'd rather see you use the DDI convenienceroutines here. (Note that config space is set up withpci_config_setup().)
I agree in that is quaint... but... Memory space accesses don't workfor the old VT86C100A cards. (They have memory space in their bar'sbut nothing happens if you write to them via memory space.).
But you should use a different a register number then... and you shouldthen still be able to to use ddi_regs_map_setup(). Does this not work?

The issue is that the driver needs to know the access type (io space/memspace) for a set number. It queries the reg property to make thatassociation. Then, ddi_regs_map_setup() is used with the 'right'register set number.


The webpage states:

As mentioned in "IEEE 1275 PCI Binding," IEEE 1275 PCI binding makes nospecific connection between the entries in the "reg" property and theconfiguration registers in PCI configuration space. Driver developersshould check the "reg" property to ensure that the correct rnumber isused in ddi_regs_map_setup(9F)."

So, I read this as "you can't assume that set number 1 always providesi/o space accesses".

Lines 717-760: I believe you shouldn't need to use driver.conf...Brussels is the "new" way forward, and I believe that a driverwithout legacy concerns shouldn't be using driver.conf anymore.
I added it for consistancy with the current state of solaris' drivers.
Just tell me again it needs to go and it's gone.
Nuke it. I'm working on a different ethernet driver for a popularcommodity chip, and it will *not* have driver.conf settings. (Everyfriggin' driver always had its own unique tunables, so there was noconsistency to follow. Brussels is the way forward.)


OK. I will remove it.

Line 1287: I see you're using desballoc. For a 100M driver, I thinkthis adds complexity that is questionable.
For modern CPU's, the extra complexity of "zero copy" for a 100Mdevice does not outweigh the advantage of an easy maintainable driver.
However, I specifically wrote this driver for use on my 600Mhz VIA C3,
(http://joostm.nl/solaris/onix/), and on this box, it matters. Everycycle not spend on copying data, is used for something useful.
But with normal sized ethernet packets, you may find that the cost ofdoing the DMA machinations may approach the cost of doing the copying.Especially when you add locking that you're currently missing.
Even a 600MHz Via C3 shouldn't matter much with 100M.
With smaller packets (e.g. typical for HTTP) the copy will always win.The tradeoff varies from one platform to another, but any "modern" (i.e.anything made this decade) system will probably do better copying atleast up to 1K.
 Paritcularly, you need to
make sure that you don't have a race with your driver gettingunloaded (e.g. via modunload()) while you still have mblks that are"loaned up". I believe your driver suffers from this race, whichcould ultimately lead to a panic in certain situations. "Fixing"this requires the driver to use a lock to guard modunload() fromcompleting successfully if there are outstanding mblks viadesballoc. (That also means you need a global reference counter.)Again, for a typical 100Mbit driver, the perf. gain is questionableanyway, so I'd not bother with this and just use bcopy. (Plus bcopysolves a lot of extra problems with resource management...)
This is in detach:
418 if (vrp->dmamem > 0) {
419  vr_log(vrp, CE_NOTE, "Can't detach, buffers in upper layer.");
420  return (DDI_FAILURE);
421 }
vrp->dmamem is the amount of memory allocated for DMA buffers for thatinstance. Is this sufficient?
No, you need to guard against race conditions.  That implies locking.
This area is really tricky to get right, and almost every driver thathas ever tried to do this has gotten it WRONG.
The extra overhead this implies is painful. I'd *really* recommend youconsider just using bcopy.


What driver does it right then? A right example would be useful.

This would solve the problem described at line 1377 as well, becauseyou can then bcopy into a misaligned buffer, avoiding an extra datacopy in the IP stack later.If you don't change this otherwise, you must add safeguards againstthe modunload race, and you *should* conditionalize it on the lengthof the packet. (Tiny packets will get dismal perf. on this path.)
The TX path is changed so that only packets of 128 bytes and above,are mapped.
I suspect that 128 needs to be more like 512 or even 1024.

Line 1786. Similar to the rx comments above, this approach, which iscommon but IMO ugly, is better eliminated and using a bcopy of theframe into prealloc'd frames. Especially, DMA setup and tear down isexpensive and this actually *hurts* performance worse than just usingbcopy on *many* platforms... especially if the frames are not fullMTU frames. I think I wouldn't bother with this extra effort, but gowith the simple bcopy approach. You've got to do that anyway if thepacket doesn't fit your alignment or block count requirements, so itssimpler to just skip all the tests and do the copy unconditionally,IMO. (For larger frames -- say more than 512 bits, the tradeoff mayfavor DMA setup and teardown, but its pretty close, I think. And,for 100Mbps with reasonably recent CPUs -- anything in the lastdecade or so -- you shouldn't need the special DMA tricks to getline-speed performance.)Note that unlike rx, you can't even get the benefit of reusingmappings... so the trade off here for bcopy vs. DMA much morestrongly favors bcopy, I think.
Again, I agree. True for modern CPU's. On slower CPU's, things aredifferent. The driver's objective is not to get wirespeed TCPthroughput. It's more like, "get decent throughput in few as possiblecycles". TX DMA mapping supports that objective. Every cycle not spendin the driver is used elsewhere.
See my earlier comments. If you want to do this, go ahead, but I thinkyou're mistaken. You should do some performance analysis to see ifyou're really saving cpu cycles. I suspect you might be saving as muchas you think. (It will vary by packet size, as well as specific CPU andMMU configuration, of course.)

OK. I will re-evaluate copy versus dma mapping on the VIA C3 boxmeasured using NFS. Is your objection with dma mappings on the TX path,RX path or both?

You still need to handle the DMA binding properly, using locking. Thelocking will sap performance somewhat, which is one of the reasons thatI so strongly advocate just copying.
In fact, I'm probably going to do this in the Gigabit driver I'm workingon.
   -- Garrett


Thanks for caring!

Joost
--
Joost Mulders             +  email: [email protected]
Technical Consultant      +  phone: +31-33-45-15701
Client Solutions          +    fax: +31-33-45-15734
Sun Microsystems          + mobile: +31-6-5198-7268
-= Anything not done right, has to be done again =-
_______________________________________________
driver-discuss mailing list
[email protected]
http://mail.opensolaris.org/mailman/listinfo/driver-discuss

Re: [driver-discuss] [networking-discuss] code review request - VIA Rhine Fast Ethernet driver

Reply via email to