Re: [Pvfs2-developers] terminating state machines

Walter B. Ligon III Thu, 27 Jul 2006 07:57:33 -0700


Sam Lang wrote:

On Jul 26, 2006, at 6:16 PM, Phil Carns wrote:
I think I'm getting voted down here, so I should probably justshutup, but I don't think in practice we're going to have that manychild state machines that iterating through the list is at allcostly. I'm arguing for simpler mechanisms that fit in with thejob subsystem over something more fancy and possibly slightlybetter performing.
Well, as far as the number of SMs goes, I would rather not risk it.I still hope this is lightweight enough that we could eventually useit in more places that would generate a lot of children (like are-architected sys-io implementation), though I don't know if thatwill pan out in practice. I got bitten by a similar assumption inthe flow protocol- it used to track all of its posted operations fortesting rather than relying on someone to notify it of completion.Admittedly the flow protocol is a more obvious case and I should haveknown better, but at the time it seemed reasonable :)
Hmm...I had been thinking about a flow implementation that used the newconcurrent state machine code...it sounds like that's a bad ideabecause the testing and restarting would take too long to switchbetween bmi and trove? We use the post/test model through pvfs2though, so maybe I don't understand the issue.
I think that the way that you describe would work fine too, but itwould require a little more active work to check the status of thearray of child SMs and would require more code to keep track of them.
Probably a bit more code yes, but it seems cleaner than keepingaround backpointers and checking for parents. Instead of drivingall state machines from one place, this event notification schemeessentially replaces the last child state machine with the parent,which seems like a bit of hack and harder to debug.
I think I'm lost now. What do you mean by replace? The states arestill isolated, jobs trigger the transitions, only one state actiongets executed at a time, there still may be a time gap betweencompletion of any given child and when the parent picks up processingagain, and there are still frames. I think both approaches will lookthe same when running unless I missed something. If Walt puts alongjmp() in there we can both hit him over the head.
Heh.  Don't give him ideas! ;-)
I was operating under the constraint that a state machine can only posta job for itself. If I understand the current plan correctly, usingjob_null in the child state machine to post a job for the parent breaksthat constraint, and so in some sense is a replace (the job_nullactually takes the parent smcb pointer). I think you're probably rightthat its not a big difference either way, its just cleaner in my headto only have state machines posting jobs for themselves.
I think having a pointer to the parent actually improves debugability(though I'm not sure this approach actually requires it, all youreally need is either a job descriptor or a pointer to a counter).If I have a state machine that does something bad or gets stuck itwould be nice to be able to work backwards to find out who invokedit, without having to search for it in a seperate data structure.
I don't mean to keep struggling with this issue- I honestly thinkthat both approaches are pretty good, and if Walt implements it theway I think he is going to, then 95% of developers won't notice thedifference anyway. At this point I am mostly hammering away to makesure I am not missing a larger issue...
Walt probably got more discussion than he bargained for, but at theleast, lively discussion keeps me awake in the afternoon ;-).
-sam
-Phil

Good discussion. Phil has convinced me the level of dependency is low,and unless I completely misunderstand Sam, the complexity of the parentpointer/job_null approach is a lot less than the alternative, and I likelow complexity. I also think debugging will be simpler. So that'swhere I'm going.

I'll hae to think of other topics to get you guys going form time totime! ;-)


Now off to figure out a way to use setjmp/longjmp in my implementation!

Walt
--
Dr. Walter B. Ligon III
Associate Professor
ECE Department
Clemson University
_______________________________________________
Pvfs2-developers mailing list
Pvfs2-developers@beowulf-underground.org
http://www.beowulf-underground.org/mailman/listinfo/pvfs2-developers

Re: [Pvfs2-developers] terminating state machines

Reply via email to