Re: A new implementation of TaskManager

Peter Firmstone Tue, 06 Jul 2010 16:30:30 -0700

Patricia Shanahan wrote:

Peter Firmstone wrote:
Peter Firmstone wrote:
Patricia Shanahan wrote:
On 7/5/2010 8:39 PM, Peter Firmstone wrote:
Patricia Shanahan wrote:
Peter Firmstone wrote:
...
This is where you have to be careful of Distributed computing, I
think you'd have to branch out in both directions, checkingolder andyounger tasks as the tasks arrival may be a combination ofprocessing
remote and local calls. In fact a task might arrive so far out of
sequence that it could be at the opposite end of the queue.
...
So what happens if Task x arrives so late that we have already done
the runAfter tests for Task y, but y should run after x?

Patricia
Hmm, yes I was just thinking that, need to look at theimplementations
again...
I've checked both the comments and the code. The TaskManager classJavadoc comment talks about "not required to run after any of thetasks that precede it in the queue", and I believe that is the wayit is implemented. For example, takeTask sets the size to i in therunAfter call for a candidate at index i in the list.
It seems to be the caller's responsibility to make sure that a taskis not added to a TaskManager until after any task it needs to runafter.
Patricia
Hmm, yes you have a point, that is the current behaviour.
It'll be interesting to see if we can simulate RemoteEvent'sarriving out of order with a time delay between them. I suspectthat the state would just get muddled without any complaint, we'dneed to test that with a current implementation, then decide whichparty should be responsible.
I'm interested in getting River to run on the Internet, currentlyRiver / Jini is at home on local intranet's where there is probablya very low likelyhood that RemoteEvents will be received out oforder. I suspect this behaviour was overlooked or missed by thedesigner, the caller cannot always know, if it did, I wouldn't needa TaskManager that manages dependencies, just a fifo queue. But Icould be wrong. It's a pity the original author isn't around tocomment. I find my understanding improves as I implement things, wedon't have to know the right answer up front, so experiment away,I'm confident you'll work out a good solution, based on yourcomments to date.
My assumption (and that's all it is) is based on tasks takingsufficient time, combined with enough queue length and currentlocking with poor scalability to allow all RemoteEvents and thustasks to arrive on the queue on a low latency network for it not tohave been an issue. I suspect that you'll fix it so well, that thequeue will be empty, waiting on the network and therefore thedependency's won't get checked at all.
Thanks for your confidence.
My last job (I was a graduate student from 2002 to late last year) wasas a large SPARC server platform architect. To improve prototypesystem testing, I wrote an extremely silly but extremely usefulprogram called "parstore". It just block stores the floating pointregisters on a specified number of processors to memory, repeatedly,as fast as it can. The effect is to fill queues, and generally disturband stress the interconnect. It never detected any errors, butprototypes were more likely to crash and operating system stress testswere more likely to fail while it was running.
If one of the River developers has an intranet test environment it maybe possible to simulate the effect of running over the Internet by asimilar trick. Create some workload that keeps the network very busy,and run it in parallel with a quality assurance test.
In some cases it may not matter which of two transactions is donefirst, but it is important to make sure there is a consistent orderbetween them.
Patricia

Cool & Wow!

Still do most of my development on SPARC, will migrate to Linux x86shortly, I don't think Oracle or Fujitsu intend to support developerswith SPARC workstations, mine's still going, but it's getting old.

Cheers,

Peter.

Re: A new implementation of TaskManager

Reply via email to