Re: TaskManager progress

Peter Firmstone Wed, 21 Jul 2010 18:26:51 -0700

I have a similar mindset to Gregg, memory and disk is relativelyinexpensive these days, if I can avoid locks by using atomic operationsand immutable objects or concurrent utilities, I'm happy since it's oneless possible dead lock or live lock bug I haven't thought about.

If updated state doesn't depend on previous state, I'll go for animmutable object with a volatile reference. If the object is notimmutable and it can be defensively copied, I do that before updatingthe volatile reference and I defensively copy it again before returningit to a caller.

If updated state depends on previous state, I might use an immutableobject with an AtomicReference, where the update is only made when noother update was received in the interim. If I can, I try to makeobject's effectively immutable, with defensive copying.

If internal accesor methods don't need to concern themselves with areference update during a routine, I copy an object's reference ratherthan synchronize on it, the copy will still refer to the old object whenthe volatile reference is updated. If the routine is in a loop, and Iwant to restart this if the reference is updated, I'll use while( a ==b) (or something similar), where b is a reference to the object referredto by a until a is changed.

I try to keep synchronized blocks as small as possible, not so much forperformance, but for bugs, not even necessarily my own bugs but clientcode concurrency bugs. In the synchronized block, I don't call objectswhich may be accessible from outside the object I'm calling from. Statethat needs to be atomically updated, I group together using the samelock, I also consider using the ReadWriteLock, if reads will outnumberwrites. If multiple objects must be updated atomically, I might groupthem together into an encapsulating object with the methods I need tomake it atomic. This is better than holding multiple locks.

On some occasions I find a simple class that isn't threadsafe at all isthe best approach, letting something else handle the concurrency orensuring it's only used by one thread.


For me it basically comes down to avoiding bugs first, followed by scale.

Obviously memory consumption can be an impediment to scale, so there areoccasions where this is the wrong approach, but it's a generalisation,to be taken with a grain of salt.

If memory is an issue, there usually isn't much concurrency to be had,if that's the case then good old fashioned synchronization or none atall might be the best way to go.

In that case, I might consider an interface, and separateimplementations for different platforms, one for memory, the other forconcurrency.

It's true that concurrency is harder, people often forget to check thereturn value of putIfAbsent, on ConcurrentMap.

Horses for courses I suppose, everyone has their style, you don't haveto adopt mine, I'm just happy to have some help. There's plenty of codein River that uses synchronized and has no issues. You probably haveenough experience to avoid the locking bugs by now, I'm happy with yourapproach. It's probably more performant than mine;) Some concurrencyutilities can chew some memory.


Maybe it's a reflection of my debugging abilities ;)

Cheers,

Peter.

Patricia Shanahan wrote:

On 7/21/2010 12:58 PM, Gregg Wonderly wrote:
...
When I write code of this nature, attempting to remove all contention, I
try
to list every "step" that changes the "view" of the world, and thinkabouthow that "view" can be made atomic by using explicit ordering ofstatements
rather than synchronized{} blocks.  ...
I would like to discuss how to approach performance improvement, andespecially scaling improvement. We seem to have differentphilosophies, and I'm interested in understanding other people'sapproaches to programming.
I try to first find the really big wins, which are almost always datastructure and algorithm changes. That should result in code that isefficient in terms of total CPU time and memory. During that part ofthe process, I prefer to keep the concurrency design as simple aspossible, which in Java often means using synchronization at a coarselevel, such as synchronization on a TaskManager instance.
Once that is done, I review the performance. If it is fast andscalable I stop there. If that is not the case, I look for thebottlenecks, and consider whether parallelism, or some other strategy,will best improve them. Any increase in concurrency complication hasto be justified by a demonstrated improvement in performance.
My big picture objective is to find the simplest implementation thatmeets the performance requirements (or cannot reasonably be madesignificantly faster, if the requirement is just "make it fast"). Ivalue simplicity in concurrency design over simplicity in datastructures or algorithms for two reasons:
1. Making the code more parallel does nothing to reduce the totalresources is uses. Better algorithms, on the other hand, cansignificantly reduce total resources.
2. Reasoning about data structures and algorithms is generally easierthan reasoning about concurrency.
It sounds as though you are advocating almost the opposite approach -aim for maximum concurrency from the start, without analysis ormeasurement to see what it gains, or even having a baselineimplementation for comparison. Is that accurate? If so, could youexplain the thinking and objectives behind your approach? Or maybe I'mmisunderstanding, and you can clarify a bit?
Thanks,

Patricia

Re: TaskManager progress

Reply via email to