OK thanks Otis. Any gut feeling on the best approach to get this collapsed data? I hate to ask you to do my homework, but I'm coming to the end of my Solr/Lucene knowledge. I don't code java too well - used to, but switched to Python a while back.
gene On Wed, Sep 17, 2008 at 12:47 PM, Otis Gospodnetic <[EMAIL PROTECTED]> wrote: > Gene, > > The latest patch from Bojan for SOLR-236 works with whatever revision of Solr > he used when he made the patch. > > I didn't follow this thread to know your original requirements, but running > 1+10 queries doesn't sound good to me from scalability/performance point of > view. > > Otis > -- > Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch > > > > ----- Original Message ---- >> From: ristretto.rb <[EMAIL PROTECTED]> >> To: solr-user@lucene.apache.org >> Sent: Tuesday, September 16, 2008 6:45:02 PM >> Subject: Re: Filtering results >> >> thanks. very interesting. The plot thickens. And, yes, I think >> field collapsing is exactly what I'm after. >> >> I'm am considering now trying this patch. I have a solr 1.2 instance >> on Jetty. I looks like I need to install the patch. >> Does anyone use that patch? Recommend it? The wiki page >> (http://wiki.apache.org/solr/FieldCollapsing) says >> "This patch is not complete, but it will be useful to keep this page >> updated while the interface evolves." And the page >> was last updated over a year ago, so I'm not sure if that is a good. >> I'm trying to read through all the comments now. >> >> ..... I'm also considering creating a second index of just the >> categories which contains all the content from the main index >> collapsed >> down in to the corresponding categories - basically a complete >> collapsed index. >> Initial searches will be done against this collapsed category index, >> and then the first 10 results >> will be used to do 10 field queries against the main index to get the >> "top" records to return with each Category. >> >> Haven't decided which path to take yet. >> >> cheers >> gene >> >> >> On Wed, Sep 17, 2008 at 9:42 AM, Chris Hostetter >> wrote: >> > >> > : 1. Identify all records that would match search terms. (Suppose I >> > : search for 'dog', and get 450,000 matches) >> > : 2. Of those records, find the distinct list of groups over all the >> > : matches. (Suppose there are 300.) >> > : 3. Now get the top ranked record from each group, as if you search >> > : just for docs in the group. >> > >> > this sounds similar to "Field Collapsing" although i don't really >> > understand it or your specific use case enough to be certain that it's the >> > same thing. You may find the patch, and/or the discussions about the >> > patch useful starting points... >> > >> > https://issues.apache.org/jira/browse/SOLR-236 >> > http://wiki.apache.org/solr/FieldCollapsing >> > >> > >> > -Hoss >> > >> > > >