Re: TaskManager progress

Peter Firmstone Wed, 21 Jul 2010 17:27:37 -0700

Hi Patricia,

Instead of Task passing an Iterable<Task>, it could pass anEnumerator<Task>, which is inherently immutable. One more comment inline.


Patricia Shanahan wrote:

On 7/21/2010 12:58 PM, Gregg Wonderly wrote:
Here are some thoughts/questions as I look at the code.  Thanks for
taking this task on!
Thanks for your comments. After several years of working on individualprojects with no code review, it is refreshing to be back in anenvironment in which other programmers will look at my code andcomment on it.
One general conclusion from your comments is that I had design rulesin my mind that I forgot to write down. That may be a bad habit I gotinto while I was the only reader of my code. It is not reasonable toexpect you to read my mind, rather than the code and its comments, tofind out how it is intended to work.
in add(Task)
is the order of taskToWrapper.put() vs addTasks.add() important
compared to the use of the contents of those? Is this an atomic
relationship that needs to involve some logic to make the 'view'
atomic in nature, such as reordering these two statements?
In TaskWrapper.endTask() the order is reversed and this implies
that the order of these statements is important to your design,
so I just want to make sure I understand their relationship.
Both structures are supposed to be kept consistent, and only accessedor modified in code that is synchronized on the TaskManager. I feel itis a bit tidier, even in single thread code, to unwind thingsbackwards from setting them up, but I don't feel strongly about thatand will change the order in one place or the other if you feel itwould enhance readability.
in remove(TaskWrapper,boolean)
Can there be a transition of states between the time the switch
statement is started and the time the specific case executes?
Do we need a lock here to hold the thread in its current state
so that WAITING and READY threads are correctly "stopped"?
A TaskWrapper's state should only be accessed or changed in code thatis synchronized on the TaskManager.
--runnable and ++runnable
This is not concurrency safe. I'd suggest an AtomicInteger instead,
especially if there is no other reason to use "synchronized" where
this is done. Visibility needs to be guaranteed using some happens
before as well.
runnableTasks should only be accessed or changed in code that issynchronized on the TaskManager.
When I write code of this nature, attempting to remove all contention, I
try
to list every "step" that changes the "view" of the world, and thinkabouthow that "view" can be made atomic by using explicit ordering ofstatements
rather than synchronized{} blocks. Visibility still has to work, so one
also
needs to worry about "happens before" as well. This looks like a really
good
start on the algorithmic steps. Hopefully, some others can look things
over and contribute any other issues or improvements they have.
I feel that avoiding "synchronized" often makes reasoning aboutconcurrency and visibility more complicated than it needs to be. Ifirst want to make TaskManager operations reasonably efficient. Onlyafter that, I'll consider making them more parallel, if benchmarkingindicates it is useful to do so.
For a TaskManager holding n tasks, there are only two O(n) operationsleft. This should reduce the need for parallelism, compared to theArrayList based design. One of them, construction of a list of pendingtasks, is inherently O(n) and probably infrequent. The other is therunAfter scan. Everything else is no worse than O(log n). The areawhere parallelism is most likely to be useful is in the runAfter scans.
My intent is to make this a very simple case for concurrency andvisibility issues. With one exception, all changes to the shared datastructures should be in blocks that are synchronized on theTaskManager, with all steps needed for a logical change done withoutreleasing synchronization. Anything that is done in a block that issynchronized on a given TaskManager happens-before any subsequentaction that is synchronized on the same TaskManager instance.
The exception is the physical removal of a TaskWrapper from thereadyTasks queue. The poll call has to be outside any block that isTaskManager synchronized, so the TaskRunner has to recheck terminatedand the status of the TaskWrapper after obtaining TaskManagersynchronization but before changing the status of the TaskWrapper toRUNNING. The PriorityBlockingQueue looks after its own concurrency andvisibility issues.
In considering your comments I found a bug in this area. If the statusof a TaskWrapper is changed to REMOVED in the timing window betweengetting it from the queue and getting synchronization, it has alreadybeen wrapped up by the remove code. Logically, the task is out of theTaskManager. The TaskWrapper is just hanging on by the reference inthe TaskRunner.
Which would be better, slapping "synchronized" on a private methodthat I only intend to call from synchronized code, or adding a commentdocumenting the intent?


Add a comment documenting the intent ;)

Thanks again for your comments.

Patricia
Gregg Wonderly

Patricia Shanahan wrote:
I've uploaded a new version with tabs for indentation.

Patricia


On 7/21/2010 9:33 AM, Patricia Shanahan wrote:
No problem. That sort of thing I'll handle later by reformatting after
setting up the project settings in Eclipse.

Patricia



On 7/21/2010 8:55 AM, Gregg Wonderly wrote:
One of my most desired attributes of code is that tabs be used inplaceof spaces. The reason for this, is so that I can change tabexpansion,
on the fly, to narrow or widen the view of nested blocks to help me
better see what is there.

This is a religious kind of issue, and I know there are countless
people
who think otherwise. As a VI user, I, countless times, have typed':set
ts=4 sw=4'
and ':set ts=8 sw=8' in code to change my viewpoint.
I know that others have reasons why they prefer spaces. I've justneverbeen able to find any override factors that make spaces a goodchoice,
especially when you are in an editor without a mouse.

Gregg Wonderly

Peter Firmstone wrote:
Thanks Patricia, looking good, will take some time to digest it
further.

We don't have a set of coding conventions, unless someone wants to
write a tool, there used to be one in com.sun.jini.tool, asevidenced
by one of the jtreg tests

trunk/qa/jtreg/com/sun/jini/tool/CheckCodeStyle
Perhaps there are some widely available tool that we could settleon?
I like to follow Kent Beck's style in his book ImplementationPatternsISBN-10 0-321-41309-1, it's quite a small book and makes easyreading,
but that's just my personal preference.

Cheers,

Peter.

Patricia Shanahan wrote:
I've uploaded my current work-in-progress code as
http://www.patriciashanahan.com/apache/NewTaskManager.java

Please send me any comments, questions, or suggestions for
improvement.

The change of name is temporary, to allow a smoother transition. I
plan to work through the callers, changing them one at a time touse
the new Task interface. When they have all been changed, and there
are no more TaskManager references, the name can be changed to
TaskManager.

I'll need to set up the correct formatting in Eclipse, but once I
find the rules that won't take long. Any other coding conventions I
need to watch out for?

Meanwhile, I'm working on more testing and benchmarking. It
definitely improves performance when there are a lot of tasks or
runAfter dependencies, but I need to do more testing for shorttasksin simple cases, the case in which it is most likely to be worsethan
the current code.

Patricia



On 7/20/2010 2:48 PM, Peter Firmstone wrote:
Looking forward to seeing some code. SVN builds clean again.


Patricia Shanahan wrote:
I did the first tests of my new TaskManager today. I can't
benchmark
very accurately because of a QA test running on the samecomputer,
but
it seems to be about the same without dependencies, and
significantly
faster with dependencies. Specifically, it removes the singletask
bottleneck.

I'll next do more testing, benchmarking, and tuning in my own
environment.

Patricia

Re: TaskManager progress

Reply via email to