[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12753335#action_12753335
 ] 

Paul Nelson edited comment on SOLR-236 at 9/9/09 5:07 PM:
----------------------------------------------------------

Hey All:  Just upgraded to 1.4 to get the new patch (many thanks, Martijn). The 
new algorithm appears to be sensitive to the size and complexity of the query 
(rather than simply the count of documents) - should this be the case? 
Unfortunately, we have rather large and complex queries with dozens of terms 
and several phrases, and while these queries are <0.5sec without collapsing, 
they are 3-4sec with collapsing. Meanwhile, collapse using *:* or other simple 
queries come back in <0.5sec - so it appears to be primarily a query-complexity 
issue.

I'm wondering if the filter cache (or some other cache) might be able to help 
with this situation?

      was (Author: pnelsoncomposer):
    Hey All:  Just upgraded to 1.4 to get the new patch (many thanks, Martijn). 
The new algorithm appears to be sensitive to the size and complexity of the 
query (rather than simply the count of documents) - should this be the case? 
Unfortunately, we have rather large and complex queries with dozens of terms 
and several phrases, and while these queries are <0.5sec without collapsing, 
they are 3-4sec with collapsing.

I'm wondering if the filter cache (or some other cache) might be able to help 
with this situation?
  
> Field collapsing
> ----------------
>
>                 Key: SOLR-236
>                 URL: https://issues.apache.org/jira/browse/SOLR-236
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>    Affects Versions: 1.3
>            Reporter: Emmanuel Keller
>             Fix For: 1.5
>
>         Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, field-collapse-3.patch, 
> field-collapse-4-with-solrj.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, solr-236.patch, SOLR-236_collapsing.patch, 
> SOLR-236_collapsing.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given 
> field to a single entry in the result set. Site collapsing is a special case 
> of this, where all results for a given web site is collapsed into one or two 
> entries in the result set, typically with an associated "more documents from 
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before 
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to