Re: Query Performance and Optimization

Marcel Reutegger Wed, 14 Mar 2007 05:08:01 -0800

Christoph Kiehl wrote:

Christoph Kiehl wrote:
I was digging a bit into Jackrabbit today and found another placewhere some caching did provide a substantial performance gain toqueries which check one attribute for more than one value (like/foo/[EMAIL PROTECTED]:bar='john' or foo:bar='doe']). The BitSet incalculateDocFilter() is right now created twice for the query above.On large repositories this takes about 200ms per BitSet on my machinefor a particular field. Caching these BitSets per IndexReader andfield in a WeakHashMap with the IndexReader as a key gave me some realimprovements.

agreed, this should definitively be cached per index segment and is doable withreasonable effort.


I've created a jira issue: http://issues.apache.org/jira/browse/JCR-791

Replying to myself ;):
- I was referring to calculateDocFilter() inorg.apache.jackrabbit.core.query.lucene.MatchAllScorer- The achieved performance improvement varied between 30-60% dependingon the actual query


but that means your query is rather:

/foo/[EMAIL PROTECTED]:bar]

right?

@foo:bar='john' should be translated into a term query.

regards
 marcel

Re: Query Performance and Optimization

Reply via email to