Re: Alternative Binary Protocol idea for memcached.

Brian Aker Wed, 20 Feb 2008 19:09:14 -0800

Hi!

On Feb 21, 2008, at 8:12 AM, Clint Webb wrote:

Would anyone object if I first started working on abstracting theprotocol handling routines so that they are handled by callbackinstead of the current if/else setup?

When doing this, think about how to add arbitrary new callbacks foradditional functions.

AKA for instance I can write a lock daemon set of commands, and letmemcached handle it. Or for instance I want to add new language for aqueue.

I believe either Gearman or another SA project was looking towardusing the protocol in their next redesign.


Cheers,
        -Brian

Then I was going to put all the protocol stuff in prot_ascii.h|c andprot_binary.h|c respectively.
This way we could potentially have hundreds of protocols handledwithout having a lengthy if/else comparison.
On Wed, Feb 20, 2008 at 10:06 AM, Clint Webb <[EMAIL PROTECTED]>wrote:I wasn't expecting such a quick reply. Good point about allowingmultiple protocols. I might pull out some of my old code and seehow easy it is to drop in.
I thought I'd give a little background on myself and this protocolstyle. I used to work in the controls and automation industry. Ifyou've ever checked your luggage into an airport, or sent anythingthru USPS, UPS or FedEX, or bought anything online from amazon, b&nand other large online stores, or from Walmart, kmart, target,etc... then your product has likely had some experience with mycoding somewhere along the line.
I developed this protocol for a tiny little side-project for one ofthe above mentioned companies. It was basically a connector thattook information from a large number of different systems and passedit to another. The requirements changed a lot, so I developedsomething that could be fast, but had to be flexible. If I added afeature to the server, I didn't want to be forced to update all theclients as well. Also, some of the clients were tiny littleembedded controllers, so it had to be pretty simple too.
This solution was VERY fast, as all the commands are a single byteand could be easily mapped to an array of callback routines. Thisprotocol also had to run on a real-time system also, so we had toensure that all operations preformed in a predictable fashion.
I seperated the commands by their parameter type. 0-31 had noparameter. 32-63 had a single byte parameter. 64-95 had a 32-bitparameter. 96-127 had a single-byte paramter which was the lengthof a stream of data to follow (short-string). 128-159 had a 32-bitinteger that was the length of a stream of data to follow. Thiswas our 5 different parameter types. A command could only have oneparameter.
This way, the IO handling code could retrieve all the necessary dataand then pass it off to a callback routine for that command.
Each socket connection would have a set of buffers set aside for theincoming data. In this case we would want a buffer set aside tohold the key and value data.
To speed up processing and ensure that the minimum data set has beenprovided, we used a 32-bit (or was it 64-bit?) word as flags. Eachoperation would set or clear a flag(s). So when a GO command isreceived, it can quickly determine what 'action' needs to takeplace, and which 'parameters' have been provided.
If we ran out of room having to handle more than 256 commands, wewould use a command of 0xFF which would expect that the next bytewould be another command (from a set different to the first). Inever actually implemented it though. The most commands I ever usedwas about 100 or so.
I cant imagine that a variable-length structured protocol could bemuch faster than that. Still the emphasis of this protocol is notso much on speed, but on flexibility to add functionality to theprotocol (by adding commands) without breaking existing clients (andwithout having to handle multiple versions of the protocol).
The 'noreply' stuff that I have seen around the list could probablybenefit from this protocol. I haven't looked close enough at theCAS stuff either, but I suspect that would be easy to implement too.
Also, those that want to shave off a few extra bytes in theirclient, have the option of sending a request that only includes thebits they want. If you care about expiry leave it out, same withflags, tags, cas id's, and anything else. Plus you can stream-linesome of your requests by not using the CLEAR command, and re-usingthe state.
Dang, if I had a little more time on my hands right now, I'd bereally tempted to implement it. I don't actually have a *need* forthis protocol in memcached, it was purely an intellectual itch eversince I saw people complaining about the existing protocols beingdifficult to expand.
On Feb 20, 2008 4:14 PM, Dustin Sallings <[EMAIL PROTECTED]> wrote:

On Feb 19, 2008, at 22:20, Clint Webb wrote:

> I know a considerable amount of thought and work has gone into the
> existing flavour of the binary protocol, and I dont expect that work
> to be discarded, I'm really only mentioning this new concept now as
> an alternative for the future if we ever find the current binary
> protocol to be too restrictive and inflexible.  And something to
> think about, or even use elsewhere.
The is certainly interesting. The first step of doing thebinary
protocol implementation was to create a change that allowed multiple
protocols to coexist.  It would be possible to implement this to run
in parallel with the existing protocols in the 1.3 codebase.
Intuitively, it doesn't seem as efficient to process as whatwe've
got now, but I like being proven wrong, so I'd welcome another front-
end protocol.  :)
Of course, I wrote the majority of the current binaryprotocol code
about six months ago, so I'd really like to at least have one in more
people's hands.

--
Dustin Sallings




--
"Be excellent to each other"



--
"Be excellent to each other"


--
_______________________________________________________
Brian "Krow" Aker, brian at tangent.org
Seattle, Washington
http://krow.net/                     <-- Me
http://tangent.org/                <-- Software
http://exploitseattle.com/    <-- Fun
_______________________________________________________
You can't grep a dead tree.

Re: Alternative Binary Protocol idea for memcached.

Reply via email to