Re: better buffer size for copy

Phillip Susi Sun, 20 Nov 2005 21:45:57 -0800

What would such network filesystems report as their blocksize? I have afeeling it isn't going to be on the order of a MB. At least for localfilesystems, the ideal transfer block size is going to be quite a bitlarger than the filesystem block size ( if the filesystem is even blockoriented... think reiser4, or cramfs ). In the case of networkfilesystems, they should be performing readahead in the backgroundbetween small block copies to keep the pipeline full. As long as thecopy program isn't blocked elsewhere for long periods, say in the writeto the destination, then the readahead mechanism should keep thepipeline full. Up to a point, using larger block sizes saves some cpuby lowering the number of system calls. After a certain point, the copyprogram can start to waste enough time in the write that the readaheadstops and stalls the pipeline.If you want really fast copies of large files, then you want to senddown multiple overlapped aio ( real aio, not the glibc threadedimplementation ) O_DIRECT reads and writes, but that gets quitecomplicated. Simply using blocking O_DIRECT reads into a memory mappeddestination file buffer performs nearly as well, provided you use adecent block size. On my system I have found that 128 KB+ buffers areneeded to keep the pipeline full because I'm using a 2 disk raid0 with a64k stripe factor. As a result, blocks smaller than 128 KB only keepone disk going at a time. That's probably getting a bit too complicatedthough for this conversation.If we are talking about the conventional blocking cached read, followedby a blocking cached write, then I think you will find that using abuffer size of several pages ( say 32 or 64 KB ) will be MUCH moreefficient than 1024 bytes ( the typical local filesystem block size ),so using st_blksize for the size of the read/write buffer is not good.I think you may be ascribing meaning to st_blksize that is not there.


Robert Latham wrote:

In local file systems, i'm sure you are correct.  If you are working
with a remote file system, however, the optimal size is on the order
of megabytes, not kilobytes.  For a specific example, consider the
PVFS2 file system, where the plateau in "blocksize vs. bandwitdh" is
two orders of magnitude larger than 64 KB.  PVFS2 is a parallel file
system for linux clusters.  I am not nearly as familiar with Lustre,
GPFS, or GFS, but I suspect those filesystems too would benefit from

block sizes larger than 64 KB.

Are you taking umbrage at the idea of using st_blksize to direct how
large the transfer size should be for I/O?  I don't know what other
purpose st_blksize should have, nor are there any other fields which

are remotely valid for that purpose.Thanks for your feedback.==rob




_______________________________________________
Bug-coreutils mailing list
Bug-coreutils@gnu.org
http://lists.gnu.org/mailman/listinfo/bug-coreutils

Re: better buffer size for copy

Reply via email to