Re: DN and valueOf( "" ) method

Emmanuel Lecharny Mon, 09 Aug 2010 07:51:24 -0700

 On 8/9/10 4:16 PM, Matthew Swift wrote:

On 28/07/10 13:07, Emmanuel Lecharny wrote:
 On 7/28/10 11:31 AM, Stefan Seelmann wrote:
I was thinking lately about the DN class. I know that OpenDS (andprobablyUnboundId, but not sure) has a DN.valueOf( "<a DN>" ) factory thatreturns a
DN potentially leveraging a cache associated to a ThreadLocal.
...
I don't think it's such a good idea :
- first, as it's ThreadLocal based, you will have as many cache asyou havethreads processing requests. Not sure it competes with a uniquecache, not
sure either we can't use the memory in a better way...
An advantage to use ThreadLocal is that you don't need to synchronize
access to the cache Could be worth to measure the performance
Using ConcurrentHashMap should not be a major performance penalty. Imean, it *will* be more costly than not having any synchronizationbut it sounds acceptable.
Unfortunately a CHM won't help either since you need to manage cacheeviction, assuming that you want the cache to have a finite size.LinkedHashMap has an eviction strategy which can be defined byoverriding the removeEldestEntry method, but unfortunately LHM is notthread safe.

Doh !!! I should have thought about it immediately... That's the problemwhen you are pushing random thoughts on the ML instead of *really*coding them.

difference, I wonder if the OpenDS team did some performance analysis?
I did some testing some time back and I have forgotten the exactfigures that I got. I do remember finding a substantial performanceimprovement when parsing DNs when caching is enabled - something like30ns with caching vs 300ns without for DNs containing 4 RDN components(i.e. about an order of magnitude IIRC).
We implement our DNs using a recursive RDN + parent DN structure so weare usually able to fast track the decoding process to just a singleRDN for DNs having a common ancestor (pretty common).

Our idea was that the first step would be to quickly compute the DN (asa String) hashcode, and check if it has already been parsed. If not,then we fallback to a plain parsing. But having the low level DN storedin the cache is a good idea.

There are definitively many options, we should conduct some perf testsbased on real world DN to see what's the best.

We opted for the ThreadLocal approach due to the synchronizationlimitations of using a single global cache. However, I have oftenworried about this approach as it will not scale for applicationshaving large numbers of threads, resulting in OOM exceptions.

yes, true. But this is also a fast-track solution, bringing immediatebenefits.

Another approach I have thought about is to use a single globaltwo-level cache comprising of a fixed size array of LinkedHashMaps(think of it as a Map of Maps) each one having its ownsynchronization. We then distribute the DNs over the LHMs and amortizethe synchronization costs across multiple locks (in a similar mannerto CHM).

Another aspect we are interested in is the pining of frequently used DN(cn=schema, etc). Not sure it worth the effort though...

This idea needs testing. In particular, we'd need to figure out theoptimal array size (i.e. number of locks / LHMs). For example,distributing the cache over 16 LHMs is not going to help much forreally big multi-threaded apps containing 16000+ threads (1000 threadscontending per lock).

But are you going to have 16K threads anyway ?

A major problem with this approach if we choose to use it in theOpenDS SDK is that common ancestor DNs (e.g. "dc=com") are going toend up in a single LHM so, using our current design (RDN + parent DN),all decoding attempts will usually end up contending on the same lockanyway :-( So we may need to change our DN implementation to bettercope with this caching strategy.
We are not alone though: a concurrent Map implementation which can beused for caching in a similar manner to LHM is one of the mostfrequently demanded enhancements to the java.util.concurrent library.

There might be some other data structure available, we may need to dosome research in this area...



--
Regards,
Cordialement,
Emmanuel Lécharny
www.iktek.com

Re: DN and valueOf( "" ) method

Reply via email to