Debugging session on MINA ... ouch !

Emmanuel Lecharny Fri, 19 Feb 2010 06:35:20 -0800

Hi,

here is a memo describing the debugging session we did yesterday withJulien. I have added TODO each time I think we can do better.

MINA analyze when processing one single message. The server receives"hello" and should return "HELLO".

- We start in the main IoProcessor loop, after a select() returning 1 ona channel set with OP_READ, as we have just received the message.

- the process() method is called. It does two things :
1) if the session is readable and not suspended, read the channel

2) is the session is writable and not suspended, add the session to thequeue of session ready for flush, if not already there.

TODO : at this point, we have no idea if the session has some pendingwrites. Check if the OP_WRTE flag has been set on the selectionKey

- in the read() method, we allocate a buffer with the size configuredfor the session. Usually, it's way to big if you just have hundreds ofbytes to read, and if you set the config to 65000…


TODO : use smaller buffers

- Then we call the chain's messageReceived() method

- In the ProtocolCodecFilter, we grab the decoder instance, which isstored in the session's Attributes

TODO : There is no good reason to store the instance in the attributes.It should be part of the session parameters.


- then we create a new instance of a decoderOut.

TODO : The only reason we have such an object created here is that weneed to create a queue to store the result of the decoding.

- We now decode the incoming data, until the buffer is empty. We mayhave more than one message in the buffer, so each one of them is decodedand enqueued in the encoderOut queue.

- The doDecode() method is called. This is the user provided code. Thedecoded message is enqueued, and we do that until the doDecode() methodreturns false (meaning we don't have anymore message to decode)


- We call the encodedOut.flush() method to go up in the chain

TODO : it should be done directly here

- For each decoded message in the queue, call the next filtermessageReceived() event

TODO : we went through the tail filter, which is updating some statsused by the idle session handler, but this should be done in anotherplace. Everyone does not want to deal with idle sessions … My be anIdleFilter could help ?

- In the handler, we process the message, and write the response callingsession.write( response )

- The session.write() method create a writeFuture and a WriteRequestobject containing the response, the recipient's address and the future


TODO : this WriteFuture is never used.

- We go back to the chain in reverse order, firing the FilterWrite method

TODO : This should be a separate chain

- We get the encoded and we create an encoderOut instance, containing aqueue

TODO : here, unless the encoder creates more than one buffer to be sent,there is no need to create a queue. We can also call the encoder, getback the result and flush it, doing so receptively until all the encodedpieces has been generated (but I feel like we better get back a set ofbuffers instead as a response. No need of a queue)

- The user's provided encoder method is called, the encoded buffer isenqueued. It can still be a plain Message, if we want to go thoughanother encoder

- At this point, we call the flushWithoutFuture() method which loop onthe encoderOut queue to send all the encoded buffers

- for each encoded message in the queue, we create anEncodedWriteRequest encapsulating the encoded buffer, and we go down inthe chain

- The WriteRequest is enqueued in the session WriteRequestQueue, and ifthe session is not suspended for write, we call the processor.flus() method

TODO : there is no reason to call this method now. Also thesession.getProcessor() method ask the processor pool for the session'sprocessor, which is useless, as each session is attached to a singleIoProtocol. We must store not the pool, but the processor

- If the session is not already in the queue of session waiting to beflushed, we add it into this queue, then we wake up the selector

TODO : Waking up the selector at this point is questionable. It'sprobably totally useless.

- Ok, we go back, and call the filterWrite() method once again in theprotocolCodecFilter, with a new MessageWriteRequest instance,encapsulating the response.


TODO : WTF ??? Why do we process the same message twice ???

- In the HeadFilter, we grab the message, which is empty. In fact, thisempty message is used as a marker for the 'end of message'.Nevertheless, we add this empty message to the WriteRequestQueue

- We ask the processor to flush again the session, but as the sessionhas already been added into the scheduledForFlush queue, nothing is done

- And we are done with all the process() method. We come back to themain select loop

- Time to process the session waiting to be flushed now… This is donewith a call to the flush(currentTime) method. We check that we havesessions ready to be flushed first. If so, we loop on all those sessions.

Note : a session may be marked as ready to be flushed either because wejust have some new message for it, or because a big message hasn't bewritten to the client completely n a previous loop.

At first, we remove the session from the queue of scheduled sessions. Wewill put it back if the full message hasn't been written later. If thesession is open, we call the pocessor.flushNow() method

- There, we grab he WriteRequestQueue for this session. There issomething obscure done here : we compute some number based on the maxread and write buffer.


TODO : remove all this crapity…

We reset the OP_WRITE flag (it may have been set in a previous call).

The session stores the current buffer being flushed in a special holder.if it's null, we take the first message from the queue and stores itinto this session.

TODO : This is absolutely useless. At this point, we know exactly wherewe left when we wrote it to the client. it's enough to keep the bufferin the queue, peeking it instead of polling it. We just remove it whenthe message has been completely sent.

- We now call the writeBuffer() method, responsible for the writing ofthe data to the client.

TODO : There is a totally useless loop in this method, where we try towrite to the client up to 256 times, just in case the client is slow, Iguess. This is overkilling. We have *no idea* why this loop exists,except that back in the past, some strange bug was fixed with such a'workaround'

- The buffer is written to the channel, and we get back the number ofwritten bytes. If we have written all the bytes, we call thefireMessage() method. The currentWriteRequest is cleared in the session.The WriteFuture is now set to Written state (useless, as nobody usesthis information).


TODO : Get rid of useless tasks

- In the PotocolCodecFilter, as the WriteRequest is theencodedWriteRequest, we immediately return.

TODO Why the hell are we calling the messageSent() method at all as wejust don't process the information here ??? probably because we have noclue about the fact that the WriteRequest is a message at this point. Ithas to be fixed, it's sucking useless CPU

- Then we are back, and process the ext message in the queue, which isthe empty message (used as a end-of-message marker)

- The writeBuffer() method just do nothing, as this message is empty. Itjust call the messageSent() method to inform the handler that themessage has been sent. So we go up in the chain to the handler, and back.

- There is now a stupid things done (one more …) : as we didn't wroteanything, the number of written bytes is 0, so we consider that thesocket is full, and we switch the OP_WRITE flag to true. This is a 100%guarantee that we will do a full select loop again, for nothing…


TODO : FIX ME !!!

- When we are back from the flushNow() method, we get a false, as wedidn't' flushed the empty message. But we are done with the list ofsessions scheduled for a flush.

- and we get back to the select(), which will exit immediately with atleast one selectionKey set with OP_WRITE, as wear still waiting to writethe empty message !!!. That means we will process two reads for one write.

- once we enter again in the mail select loop, the select() returns 1,we go back to the process() method, do nothing in the read part, but putback the session onto the scheduledForFlush queue, in order to beprocessed again in the flush() call.

- In the flush() method, we reset the OP_WRITE flag, so that we aren'twoke up again, and we are done.

That's it !!! Lot of hacks, lot of useless things, lot of potentialspeedup expected !





--
Regards,
Cordialement,
Emmanuel Lécharny
www.nextury.com

Debugging session on MINA ... ouch !

Reply via email to