Re: [Pvfs2-developers] bmi testcontext/testunexpected

Rob Ross Tue, 06 Jan 2009 17:51:58 -0800

Hi Sam,

My take on your email was that you were combining the two issues, so Iwanted to make sure that we were in agreement that the alternative APIwas preferred (not that I think we should necessarily do anythingabout it at the moment). I'm glad we are in agreement.

The terms "scheduling" and "priority" are being tossed around here ina way that I don't think is appropriate. The current textcontext doesneither prioritization nor scheduling, and neither would the proposedmodified API (as described thus far). The current BMI behavior is morelike a bug than anything else, although changing the behavior at thispoint would require some significant regression testing.

The I/O forwarding system probably ought to use the non-blocking PVFScalls so that it can better deal with this scenario anyway, right?


Rob

On Jan 6, 2009, at 5:54 PM, Sam Lang wrote:

On Jan 6, 2009, at 5:03 PM, Rob Ross wrote:
I think if we had this alternative design and one wanted to havedifferent priorities, one would look for messages under differentcontexts as you say. But when you don't care about priority, itwould be nice to be able to get everything in one call.
I think you're arguing for a single testcontext function, instead ofthe testcontext/testunexpected split. I agree with that, but Philand I are arguing about something else. Where should schedulingdecisions be made? Within a BMI method, or by the API consumer?I'm arguing for the latter. Changing the API to be more consistentor user friendly doesn't affect where we choose to set the priority.
-sam
Rob

On Jan 6, 2009, at 4:57 PM, Sam Lang wrote:
Changing the API as you describe would actually bring back theoriginal problem. As is, the BMI_tcp_testcontext call knows thatthere are unexpected messages waiting, so it returns immediately(expecting a call to testunexpected to follow). This is aspecific policy hard-coded in the tcp method.
With just a single testcontext call and all expected andunexpected messages going to that context, the tcp code would haveto put all the unexpected messages at the top of the context togive them priority. This would fix the particular problem thatNawab has, but its still dictating policy (which messages getpriority) from within the particular BMI method.
I agree that forcing the application to define the policy (withthreads or timeouts) is moving the problem elsewhere, but itsmoving the problem to where it belongs. Its our pvfs server thatwants unexpected messages to have priority, the bmi code itselfshouldn't dictate that priority. We could define interfaces toBMI that allow the policy to be set, but that's even further fromwhere we are now.
-sam

On Jan 6, 2009, at 2:52 PM, Rob Ross wrote:
Yeah a special named context for unexpected message would be aclean way to have done things... -- Rob
On Jan 6, 2009, at 2:49 PM, Phil Carns wrote:
Yeah, I don't particularly like adding special cases either.
I feel like making the consumer play with timeouts or use anextra thread would be just as much of a hack/workaround,though. Its just moving the problem elsewhere.
Fundamentally it seems more like a BMI API flaw. It would havemade more sense (for example) if unexpected messages wereassigned to a specific context and the testunexpected() andtestcontext() functions were combined. The consumer could thenuse a single test call to retrieve both unexpected and normalmessages at once if they are in the same context (as in thepvfs2-server use case). Testing on a different context wouldignore the presence of unexpected messages (as in the problemtriggering use case here).
There are other ways to deal with it, that's just an example.We just need the API to better express the intention of thecaller (preferably in one function) so that BMI doesn't have tooptimize by guessing about what else is going on.
That is more work than just adding a flag, though :) Itprobably depends on if we think the use case is going to bearound long enough to justify tweaking the API.
-Phil

Sam Lang wrote:
I've committed the set_info fix for this. I'm not crazy aboutit, but it should work for now. In the long term, we shouldprobably move away from method specific hacks like this. I.e.it should be up to the API consumer (our server) to adjusttimeouts or call testunexpected in a separate thread.Nawab, in the zoidfs init code after initializing BMI you needto call:
int check = 0;
BMI_set_info(0, BMI_TCP_CHECK_UNEXPECTED, &check);
-sam
On Dec 23, 2008, at 2:01 PM, Phil Carns wrote:
Sam Lang wrote:
Hi All,
I think Nawab has found a bug (or untested code path) in theBMI tcp method. He's running a daemon that both receivesunexpected requests (as a server), and receives expectedresponses (as a client).In the BMI_testcontext call, if there aren't any completed(expected) operations, and there are completed unexpectedreceives, we return immediately, assuming thatBMI_testunexpected will be called in turn. I think the ideahere is that we want to keep our latency down for unexpectedmessages, instead of doing work on expected messages whileunexpected messages are waiting in the hopper. But thedaemon is single threaded, and making blocking PVFS_sys_*calls, so we essentially spin forever calling BMI_testcontextover and over.I'm not sure of the best way to fix this. Easy fixes wouldbe to remove the check for completed unexpected receives, and/or do tcp_do_work for a shorter timeout.It seems like we have a special case for blocking PVFS_sys_*calls. We want to ignore unexpected receives just in thatcase, and actually call tcp_do_work. In other contexts, Ithink we want the behavior that we have now, where we assumethat a BMI_testunexpected call will follow a BMI_testcontextcall. We could modify the testcontext call to take aseparate parameter, but that seems messy. We might also beable to handle this with separate BMI contexts somehow...
I haven't dug in the code yet to see if I see any more elegantway to handle it, but I wanted to mention that if you want toadd a special flag to toggle the behavior, it might be betterto just set it globally with the set_info() function ratherthan modifying the testcontext() api. That way you don't haveto change any of the other BMI methods. There are already acouple of similar set_info() calls to toggle BMI behavior fordifferent use cases.
-Phil
_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers


_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Re: [Pvfs2-developers] bmi testcontext/testunexpected

Reply via email to