OK thanks Otis.  Any gut feeling on the best approach to get this
collapsed data?  I hate to ask you to do my homework, but I'm coming
to the
end of my Solr/Lucene knowledge.  I don't code java too well - used
to, but switched to Python a while back.

gene




On Wed, Sep 17, 2008 at 12:47 PM, Otis Gospodnetic
<[EMAIL PROTECTED]> wrote:
> Gene,
>
> The latest patch from Bojan for SOLR-236 works with whatever revision of Solr 
> he used when he made the patch.
>
> I didn't follow this thread to know your original requirements, but running 
> 1+10 queries doesn't sound good to me from scalability/performance point of 
> view.
>
> Otis
> --
> Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
>
>
>
> ----- Original Message ----
>> From: ristretto.rb <[EMAIL PROTECTED]>
>> To: solr-user@lucene.apache.org
>> Sent: Tuesday, September 16, 2008 6:45:02 PM
>> Subject: Re: Filtering results
>>
>> thanks.  very interesting.  The plot thickens.  And, yes, I think
>> field collapsing is exactly what I'm after.
>>
>> I'm am considering now trying this patch.  I have a solr 1.2 instance
>> on Jetty.  I looks like I need to install the patch.
>> Does anyone use that patch?  Recommend it?  The wiki page
>> (http://wiki.apache.org/solr/FieldCollapsing) says
>> "This patch is not complete, but it will be useful to keep this page
>> updated while the interface evolves."  And the page
>> was last updated over a year ago, so I'm not sure if that is a good.
>> I'm trying to read through all the comments now.
>>
>> .....  I'm also considering creating a second index of just the
>> categories which contains all the content from the main index
>> collapsed
>> down in to the corresponding categories - basically a complete
>> collapsed index.
>> Initial searches will be done against this collapsed category index,
>> and then the first 10 results
>> will be used to do 10 field queries against the main index to get the
>> "top" records to return with each Category.
>>
>> Haven't decided which path to take yet.
>>
>> cheers
>> gene
>>
>>
>> On Wed, Sep 17, 2008 at 9:42 AM, Chris Hostetter
>> wrote:
>> >
>> > : 1.  Identify all records that would match search terms.  (Suppose I
>> > : search for 'dog', and get 450,000 matches)
>> > : 2.  Of those records, find the distinct list of groups over all the
>> > : matches.  (Suppose there are 300.)
>> > : 3.  Now get the top ranked record from each group, as if you search
>> > : just for docs in the group.
>> >
>> > this sounds similar to "Field Collapsing" although i don't really
>> > understand it or your specific use case enough to be certain that it's the
>> > same thing.  You may find the patch, and/or the discussions about the
>> > patch useful starting points...
>> >
>> > https://issues.apache.org/jira/browse/SOLR-236
>> > http://wiki.apache.org/solr/FieldCollapsing
>> >
>> >
>> > -Hoss
>> >
>> >
>
>

Reply via email to