Re: etc.curl: Formal review begin

Andrei Alexandrescu Tue, 30 Aug 2011 10:40:43 -0700

On 8/30/11 12:22 PM, jdrewsen wrote:

Walter suggested that I should write an article about using the wrapper.
I've now taken the first steps on writing such an article. I will have
to get the library API rock stable before I can finish it though.


I have a suggestion for you - write and test an asynchronous copy program.

It is a continuous source of surprise to me that even seasonedprogrammers don't realize that this is an inefficient copy routine:


while (read(source, buffer))
  write(target, buffer);

If the methods are synchronous and the speeds of source and target areindependent, the net transfer rate of the routine is R1*R1/(R1+R2),where R1 and R2 are the transfer rates of the source and destinationrespectively. In the worst case R1=R2 and the net transfer rate is halfthat.

This is an equation very easy to derive from first principles but manypeople are very incredulous about it. Consequently, many classic filecopying programs (including cp; I don't know about wget or curl) use theinefficient method. As the variety of data sources increases (SSD,magnetic, networked etc) I predict async I/O will become increasinglyprevalent. In an async approach with a queue, transfer proceeds at theoptimal speed min(R1, R2). That's why I'm insisting the async rangeshould be super easy to use, encapsulated, and robust: if people reachfor the async range by default for their dealings with networked data,they'll write optimal code, sometimes even without knowing it.

If your article discusses this and shows e.g. how to copy data optimallyfrom one server to another using HTTP, or from one server to a file etc,and if furthermore you show how your API makes all that a trivialfive-liner, that would be a very instructive piece.



Andrei

Re: etc.curl: Formal review begin

Reply via email to