> -----Original Message-----
> From: Alexander Duyck [mailto:alexander.du...@gmail.com]
> Sent: Wednesday, December 10, 2014 9:29 PM
> To: Alexander Duyck; linux-a...@vger.kernel.org; net...@vger.kernel.org;
> linux-kernel@vger.kernel.org; a...@arndb.de; da...@davemloft.net
> Cc: mathieu.desnoy...@polymtl.ca; pet...@infradead.org;
> b...@kernel.crashing.org; heiko.carst...@de.ibm.com; mi...@kernel.org;
> mi...@neuling.org; li...@arm.linux.org.uk; Skidmore, Donald C; Vick,
> Matthew; ge...@linux-m68k.org; Kirsher, Jeffrey T; rom...@fr.zoreil.com;
> paul...@linux.vnet.ibm.com; nic_s...@realtek.com;
> will.dea...@arm.com; mich...@ellerman.id.au; Luck, Tony; torvalds@linux-
> foundation.org; o...@redhat.com; schwidef...@de.ibm.com;
> fweis...@gmail.com
> Subject: Re: [PATCH v7 0/4] arch: Add lightweight memory barriers for
> coherent memory access
> 
> On 11/25/2014 12:35 PM, Alexander Duyck wrote:
> > These patches introduce two new primitives for synchronizing cache
> > coherent memory writes and reads.  These two new primitives are:
> >
> >     dma_rmb()
> >     dma_wmb()
> >
> > The first patch cleans up some unnecessary overhead related to the
> > definition of read_barrier_depends, smp_read_barrier_depends, and
> > comments related to the barrier.
> >
> > The second patch adds the primitives for the applicable architectures
> > and asm-generic.
> >
> > The third patch adds the barriers to r8169 which turns out to be a
> > good example of where the new barriers might be useful as they have
> > full
> > rmb()/wmb() barriers ordering accesses to the descriptors and the
> > DescOwn bit.
> >
> > The fourth patch adds support for coherent_rmb() to the Intel fm10k,
> > igb, and ixgbe drivers.  Testing with the ixgbe driver has shown a
> > processing time reduction of at least 7ns per 64B frame on a Core i7-4930K.
> >
> > This patch series is essentially the v7 for:
> > v4-6:       Add lightweight memory barriers for coherent memory access
> > v3: Add lightweight memory barriers fast_rmb() and fast_wmb()
> > v2: Introduce load_acquire() and store_release()
> > v1: Introduce read_acquire()
> >
> > The key changes in this patch series versus the earlier patches are:
> > v7:
> >     - Dropped test/debug patch that was accidentally slipped in
> > v6:
> >     - Replaced "memory based device I/O" with "consistent memory" in
> >       docs
> >     - Added reference to DMA-API.txt to explain consistent memory
> > v5:
> >     - Renamed barriers dma_rmb and dma_wmb
> >     - Undid smp_wmb changes in x86 and PowerPC
> >     - Defined smp_rmb as __lwsync for SMP case on PowerPC
> > v4:
> >     - Renamed barriers coherent_rmb and coherent_wmb
> >     - Added smp_lwsync for use in
> smp_load_acquire/smp_store_release
> > v3:
> >     - Moved away from acquire()/store() and instead focused on barriers
> >     - Added cleanup of read_barrier_depends
> >     - Added change in r8169 to fix cur_tx/DescOwn ordering
> >     - Simplified changes to just replacing/moving barriers in r8169
> >     - Added update to documentation with code example
> > v2:
> >     - Renamed read_acquire() to be consistent with smp_load_acquire()
> >     - Changed barrier used to be consistent with smp_load_acquire()
> >     - Updated PowerPC code to use __lwsync based on IBM article
> >     - Added store_release() as this is a viable use case for drivers
> >     - Added r8169 patch which is able to fully use primitives
> >     - Added fm10k/igb/ixgbe patch which is able to test performance
> >
> > ---
> >
> > Alexander Duyck (4):
> >       arch: Cleanup read_barrier_depends() and comments
> >       arch: Add lightweight memory barriers dma_rmb() and dma_wmb()
> >       r8169: Use dma_rmb() and dma_wmb() for DescOwn checks
> >       fm10k/igb/ixgbe: Use dma_rmb on Rx descriptor reads
> >
> >
> >  Documentation/memory-barriers.txt             |   42 +++++++++++++++
> >  arch/alpha/include/asm/barrier.h              |   51 ++++++++++++++++++
> >  arch/arm/include/asm/barrier.h                |    4 +
> >  arch/arm64/include/asm/barrier.h              |    3 +
> >  arch/blackfin/include/asm/barrier.h           |   51 ++++++++++++++++++
> >  arch/ia64/include/asm/barrier.h               |   25 ++++-----
> >  arch/metag/include/asm/barrier.h              |   19 ++++---
> >  arch/mips/include/asm/barrier.h               |   61 ++--------------------
> >  arch/powerpc/include/asm/barrier.h            |   19 ++++---
> >  arch/s390/include/asm/barrier.h               |    7 ++-
> >  arch/sparc/include/asm/barrier_64.h           |    7 ++-
> >  arch/x86/include/asm/barrier.h                |   70 
> > ++++---------------------
> >  arch/x86/um/asm/barrier.h                     |   20 ++++---
> >  drivers/net/ethernet/intel/fm10k/fm10k_main.c |    6 +-
> >  drivers/net/ethernet/intel/igb/igb_main.c     |    6 +-
> >  drivers/net/ethernet/intel/ixgbe/ixgbe_main.c |    9 +--
> >  drivers/net/ethernet/realtek/r8169.c          |   29 ++++++++--
> >  include/asm-generic/barrier.h                 |    8 +++
> >  18 files changed, 258 insertions(+), 179 deletions(-)
> >
> > --
> 
> It occurs to me that I never got a sign off from any of the maintainers on
> getting this pulled in.
> 
> Since the merge window is open I was wondering which tree I should make
> sure these patches apply to and who will be the one to pull them in?
> Since I was modifying network drivers should I resubmit them for netdev, or
> should I submit them for asm-generic or some other tree?
> 
> - Alex

For at least ixgbe, it looks good to me.

Acked-by: Don Skidmore <donald.c.skidm...@intel.com>

Reply via email to