Re: [jira] Commented: (LUCENE-855) MemoryCachedRangeFilter to boost performance of Range queries

robert engels Thu, 04 Dec 2008 14:04:46 -0800

The biggest benefit I see of using the field cache to do filtercaching, is that the same cache can be used for sorting - therebyimproving the performance and memory usage.

The downside I see is that if you have a common filter that is builtfrom many fields, you are going to use a lot more memory, as everyfield used needs to be cached. With my code you would only have asingle "bitset" for the filter.


On Dec 4, 2008, at 4:00 PM, robert engels wrote:

Lucene-831 is far more comprehensive.
I also think that by exposing access to the sub-readers it can befar simpler (closer to what I have provided).
In the mean-time, you should be able to use the provided class witha few modifications.
The "reload the entire cache" was a deal breaker for us, so I cameup the attached. Works very well.
On Dec 4, 2008, at 3:54 PM, Uwe Schindler wrote:
I am looking all the time to LUCENE-831, which is a new version of
FieldCache that is compatible with IndexReader.reopen() andinvalidates onlyreloaded segments. In each release of Lucene I am very unhappy,because itis still not in. The same problem like yours is if you have a onemilliondocuments index that is updated by adding a few documents eachhalf hour. Ifyou use sorting by a field, whenever the index is reopened and youreallyonly a very small segment is added, nevertheless the completeFieldCache is
rebuild, very bad :(.
So I think the ultimative fix would be to hopefully applyLUCENE-831 soon
and also use LUCENE-1461 as RangeFilter cache.

-----
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: [EMAIL PROTECTED]
________________________________________
From: robert engels [mailto:[EMAIL PROTECTED]
Sent: Thursday, December 04, 2008 9:39 PM
To: java-dev@lucene.apache.org
Subject: Re: [jira] Commented: (LUCENE-855)MemoryCachedRangeFilter to boost
performance of Range queries

I can't seem to post to Jira, so I am attaching here...

I attached QueryFilter.java.
In reading this patch, and other similar ones, the problem seemsto be thatif the index is modified, the cache is invalidated, causing acomplete
reload of the cache. Do I have this correct?
The attached patch works really well in a highly interactiveenvironment, as
the cache is only invalidated at the segment level.

The MyMultiReader is a subclass that allows access to the underlying
SegmentReaders.
The patch cannot be applied, but I think the implementation worksfar betterin many cases - it is also far less memory intensive. Scanning thebitset
could also be optimized very easily using internal skip values.
Maybe this is completely off-base, but the solution has workedvery well forus. Maybe this is a completely different issue and separateincident should
be opened ?

is there any interest in this?



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: [jira] Commented: (LUCENE-855) MemoryCachedRangeFilter to boost performance of Range queries

Reply via email to