Re: etc.curl: Formal review begin

Marco Leise Wed, 31 Aug 2011 13:15:23 -0700

Am 31.08.2011, 21:12 Uhr, schrieb Andrew Wiley <wiley.andre...@gmail.com>:

Yes, but the disk should be able to do out-of-order execution of read and
write requests (which SCSI was designed to allow), which should make this
never be an issue. My understanding is that most SCSI implementations(and
it's in just about every storage protocol out there) allow this, with the
exception of Bulk Only Transport, which is used with USB hard drives and
flash drives. That will soon be replaced by USB Attached SCSI (
http://en.wikipedia.org/wiki/USB_Attached_SCSI), which specificallyallows
out of order execution.
The end idea behind much of SCSI is that the disk designers are muchbetterat knowing what's fast and what's not than the OS developers, and if weneed
to worry about it on the application level, something has seriously gone
wrong.

SCSI has internal COPY commands. Under optimal conditions the data isnever copied to main memory. That's what would be used ideally. But herewe have our application that wants to read a chunk from the file, say 64KB and the OS detects a linear file read pattern quickly and reads alarger block of say 512 KB. The chunks are written piece-wise and if allwent well the kernel will merge the writes so they become a 512 KB chunkon their own. These are sent to the disk that has an internal cache of 8MB and is in the trouble of deciding at this low level if it should write8 MB at once or allow for some read access in-between which makesapplications more responsive, but increases the seek times. Depending onthe scenario either case is preferable. In the file copy scenario it wouldbe better to write the whole cache and then switch back to reading. If onthe other hand you were saving a video and click on the 'start' menu thathas to load a lot of small icons, then the long write operation shouldhave low priority over the reads. The disk cannot make an informeddecision.I think the basic conflict is that we have desktop and server environmentsthat are heavily multi-threaded and HDDs that are inherentlysingle-threaded. They work best if you perform one continuous operation,because every context-switch requires seeking. If the issue was so easy tosolve at the low level there wouldn't be three official full blown IOschedulers in the Linux kernel now, IMO.I don't worry about it on the application level, because I know the kerneland disk will handle most use cases well without real worst-casescenarios. But if using a HDD in a multi-threaded way actually yielded ahigher throughput than the naive read-first-write-later approach I wouldbe seriously surprised. If I was to bet I'd say it is at least 5% slower(with enabled buffers and IO scheduler) and a real disaster without them.:D


- Marco

Re: etc.curl: Formal review begin

Reply via email to