Re: Ability to sort integer fields on large index

Mark Miller Fri, 04 Apr 2008 10:07:33 -0700

You should use DateTools to break up your time stamp into multiple
fields. This can work a lot faster than using a field with so many
different terms.


Are you using a RangeQuery? If you are, ditch it and use a
ConstantScoreRangeQuery. The former will expand the query to a boolean
that contains each matching unique term while the latter will just score
1 for each matching doc.


> My questions:
> - Is Lucene sorting then search or is it doing search then sort the  
> results?
> - Is there a way to get this type of searches to return faster with  
> heavier weight on the timestamp not on relevancy?

I believe that Lucene is scoring docs, and as it scores them, dropping
them into a priority queue that sorts...normally by the score, but
optionally by fields as well.



- Mark

On Fri, 2008-04-04 at 09:49 -0700, Fleming Shi wrote:
> Here is the problem:
> 
> - Single Large index with upto 200 million documents
> - Each document contains field using epoch timestamp format (padding  
> is required when creating range requests)
> - One of the frequently used search query,  a range request on the  
> timestamp field (10 digits)
> - Other searches are fine, but the search using range request  
> (1207190000 to 1207190258), even small result set, still slow when  
> "sort" is requested
> 
> My questions:
> - Is Lucene sorting then search or is it doing search then sort the  
> results?
> - Is there a way to get this type of searches to return faster with  
> heavier weight on the timestamp not on relevancy?
> 
> Any insight would be greatly appreciated
> 
> /Regards
> 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Ability to sort integer fields on large index

Reply via email to