Re: non-blocking IO in clients (and other questions)

Dustin Sallings Tue, 02 Oct 2007 11:00:18 -0700


On Oct 2, 2007, at 9:36 , Brian Aker wrote:

So I am looking at increasing the performance in libmemcached.Looking at how some of the other clients are implemented I amfinding a catch-22 that I am hoping someone can explain.
Most clients seem to be setting their IO to non-blocking, which isexcellent, but I don't understand what this is really buying since:
1) Clients are not threaded

I don't quite understand why you're implying non-blocking IO andthreading must go together. Many people implement threads justbecause non-blocking IO appears to require more thought (in reality,it seems to be the other way around, but that's a different issue).

My client is used in threaded environments, but only has one threaddedicated to IO multiplexing. It's performing non-blocking IO overas many connections as it needs... sending and receiving wheneverit's possible and completing requests when enough data arrive.

2)  The protocol always sends an ACK of some sort.

The interface provided to my client doesn't require the caller towait for ACKs. You tend to want to do that for get requests, but youmay not care in the case of deletes or sets.

That is to say, you generally don't want to not know when somethingis over (in the case of quiet gets in the binary protocol, you'llwant a noop or a regular get at the end), but you can't really send aquiet get and then wait just in case something starts arriving.Instead, just stream requests out and stream responses in. Line themup, and you're good to go.


        Non-blocking IO means you're only waiting when there's nothing to do.

Take "set" for example. I can do a "set" which is non-blocking, butthen I have to sit and spin either in the kernel or in user spacewaiting for the "STORED" to be returned. This seems to defeat thepoint of non-blocking IO.

You don't have to at all. A set is issued, and the state of the opis changed to waiting_for_response or something and it's added to aninput queue. Then you start sending the next operation from youroutput queue. If a server starts sending stuff back to you, it's forwhatever's on the top of your input queue (in the binary protocol,you can double-check this).

I must be missing something about the above, since I can't see whythere is a benefit to dealing with non-blocking IO on a set, if youwill just end up waiting on the read() (ok, recv()).


        Not with my client (unless you want to).  :)

On a different related note, I've noticed another issue with "set".When I send a "set foo 0 0 20\r\n", I have to just send thatmessage. I can't just drop the "set" and the data to be stored inthe same socket. If I do that, then the server removes whateverportion of the key that was contained in the "set". Maybe this ismy bug (though I can demonstrate it), but that seems like a waste.AKA if on the server its doing a read() for the set and tossing outthe rest of the packet then its purposely causing two roundtripsfor the same data.

By ``socket,'' do you mean ``packet?'' My client pipelines requestin such a way that multiple gets, sets, deletes, etc... can easilyget stuffed into the same packet.

Looking through all of this, I am hoping that the binary protocol,which I eagerly await reading, has a "set" which doesn't bother totell me what the result of the "set" was. You could pump a lot moredata into memcached if this was the case.

We can create a qset, but the semantics would need to be carefullyconsidered. qget just keeps its errors silent and only returnspositive results. Should a qset do the opposite, or should it neverreturn anything at all?


        Here's a fun exercise to do with memcached:

Write out a bunch of set commands to a text file, followed by aquit. Pipe that into nc with output to /dev/null. This will dovarious fun pipelining and basically show you how fast it's possibleto write. The speed isn't all that much of a protocol issue.


--
Dustin Sallings

Re: non-blocking IO in clients (and other questions)

Reply via email to