Re: TaskManager progress

Gregg Wonderly Mon, 26 Jul 2010 11:28:43 -0700

Patricia Shanahan wrote:

On 7/21/2010 12:58 PM, Gregg Wonderly wrote:
...
When I write code of this nature, attempting to remove all contention, I
try
to list every "step" that changes the "view" of the world, and thinkabouthow that "view" can be made atomic by using explicit ordering ofstatements
rather than synchronized{} blocks.  ...
I would like to discuss how to approach performance improvement, andespecially scaling improvement. We seem to have different philosophies,and I'm interested in understanding other people's approaches toprogramming.
I try to first find the really big wins, which are almost always datastructure and algorithm changes. That should result in code that isefficient in terms of total CPU time and memory. During that part of theprocess, I prefer to keep the concurrency design as simple as possible,which in Java often means using synchronization at a coarse level, suchas synchronization on a TaskManager instance.

I don't think we are too far different. The big wins are the ones to go for.What I've learned over the years, debugging Java performance issues in Jiniapplications, and elsewhere, is that "synchronized", while the most "correct"choice in many cases, is also the "slowest" form of concurrency control.

In a "service" application in particular, it can, in many cases, be the casethat the total CPU time needed to perform the actual work, is a smaller fractionof the time that "synchronized" injects as latency in the execution path. Ithink you understand this issue, but I want to illustrate it to make sure.

File I/O and network I/O latency can create similar circumstances. If you lookat this with some numbers (I put ms but any magnitude will show the samebehavior) such as the following:


2ms to enter server (Security/Permission stuff)
1ms CPU to arrive at synchronized
3ms through synchronized lock
1ms to return result

Then if there are 30 such threads running through the server, eventually, all ofthem will be standing in line at the synchronized (monitor entry) because theycan get through all the other stuff in short order. As the total number ofthreads increases, the "synchronized" section time must be multiplied by thenumber of threads, and so with 10 threads, it becomes 30ms of time, because eachthread must wait it's turn. Thus, 30ms becomes the minimum latency

through that part of the system, instead of the 3ms there would be with 1 
thread.

So, eliminating synchronized as a "global" encounter is what I always considerfirst.

In my earlier email, I talked about using ConcurrentHashMap instead of HashSetspecifically because of this issue. ConcurrentHashMap will distribute locksamongst groups of key values so that there is not near the order of magnitude of"standing in line" that there would be if you used HashSet and synchronized

access to it.

Once that is done, I review the performance. If it is fast and scalableI stop there. If that is not the case, I look for the bottlenecks, andconsider whether parallelism, or some other strategy, will best improvethem. Any increase in concurrency complication has to be justified by ademonstrated improvement in performance.
My big picture objective is to find the simplest implementation thatmeets the performance requirements (or cannot reasonably be madesignificantly faster, if the requirement is just "make it fast"). Ivalue simplicity in concurrency design over simplicity in datastructures or algorithms for two reasons:
1. Making the code more parallel does nothing to reduce the totalresources is uses. Better algorithms, on the other hand, cansignificantly reduce total resources.
2. Reasoning about data structures and algorithms is generally easierthan reasoning about concurrency.
It sounds as though you are advocating almost the opposite approach -aim for maximum concurrency from the start, without analysis ormeasurement to see what it gains, or even having a baselineimplementation for comparison. Is that accurate? If so, could youexplain the thinking and objectives behind your approach? Or maybe I'mmisunderstanding, and you can clarify a bit?

I think we are thinking alike, but just have some slightly different order ofattention to the details.

Because I've been affected by so many concurrency issues, it is one of the topthings that I look at. I do consider, first, how common the execution path isbefore spending inordinate amounts of time there.

I've found ConcurrentHashMap to be a good solution to a number of things,because it does so well at removing concurrency issues.

Thanks,

Patricia


Gregg

Re: TaskManager progress

Reply via email to