[jira] [Issue Comment Edited] (SOLR-236) Field collapsing

2011-03-27 Thread George P. Stathis (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13011812#comment-13011812
 ] 

George P. Stathis edited comment on SOLR-236 at 3/27/11 4:00 PM:
-

Bump on Yuriy's last question:
- Are performance issues around the number of documents matched, the size of 
the index, or both?

E.g. our index contains over 12 million documents already. Should we even 
consider using this feature?

Adding a few more questions:
- Are performance concerns around the 1.4 patch, the current Solr 4.0 branch or 
both?
- Is sharding an option to alleviate some of these issues? Reading the comments 
in this ticket, it seems there are caveats getting this to work with shards?

  was (Author: gstathis):
Bump on Yuriy's last question:
- Are performance issues around the number of documents matched, the size of 
the index, or both?

E.g. our index contains over 12 million documents already. Should we even 
consider using this feature?

Adding a few more questions:
- Are performance concerns around the 1.4 patch, the current Solr 4.0 branch or 
both?
- Is sharding an option to alleviate some of these issues? Reading the comments 
in this ticket, it seems there are caveats getting this to work with shards?
- Would this work with distributed solr and if so, would it have the same 
issues?
  
> Field collapsing
> 
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
>  Issue Type: New Feature
>  Components: search
>Affects Versions: 1.3
>Reporter: Emmanuel Keller
>Assignee: Shalin Shekhar Mangar
> Fix For: Next
>
> Attachments: DocSetScoreCollector.java, 
> NonAdjacentDocumentCollapser.java, NonAdjacentDocumentCollapserTest.java, 
> SOLR-236-1_4_1-NPEfix.patch, SOLR-236-1_4_1-paging-totals-working.patch, 
> SOLR-236-1_4_1.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-branch_3x.patch, SOLR-236-distinctFacet.patch, SOLR-236-trunk.patch, 
> SOLR-236-trunk.patch, SOLR-236-trunk.patch, SOLR-236-trunk.patch, 
> SOLR-236-trunk.patch, SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, 
> SOLR-236.patch, SOLR-236_collapsing.patch, SOLR-236_collapsing.patch, 
> collapsing-patch-to-1.3.0-dieter.patch, collapsing-patch-to-1.3.0-ivan.patch, 
> collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, field-collapse-3.patch, 
> field-collapse-4-with-solrj.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> quasidistributed.additional.patch, solr-236.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given 
> field to a single entry in the result set. Site collapsing is a special case 
> of this, where all results for a given web site is collapsed into one or two 
> entries in the result set, typically with an associated "more documents from 
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before 
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Issue Comment Edited: (SOLR-236) Field collapsing

2010-08-02 Thread cruz fernandez (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12894129#action_12894129
 ] 

cruz fernandez edited comment on SOLR-236 at 8/2/10 5:35 PM:
-

I'm having an issue with the facet exclude filter parameters 
(http://wiki.apache.org/solr/SimpleFacetParameters#Tagging_and_excluding_Filters).
 I have added this exclude tags and the facet result I'm getting is without 
collapsing (it's counting the uncollapsed items).

For example, in my first page it shows something like this (the facet result 
gives something like this):

* book (11)
* website (20)
* journal (5)

after clicking on book it shows 11 results correctly, but the faceting with the 
exclude applied shows:

* book (230)
* website (25)
* journal (5)

I am using the parameter collapse.facet=after

The collapsed count of books is 11, and the uncollapsed count is 230, I 
verified it.

  was (Author: cruzf):
I'm having an issue with the facet exclude filter parameters 
(http://wiki.apache.org/solr/SimpleFacetParameters#Tagging_and_excluding_Filters).
 I have added this exclude tags and the facet result I'm getting is without 
collapsing (it's counting the uncollapsed items).

For example, in my first page it shows something like this (the facet result 
gives something like this):

* book (11)
* website (20)
* journal (5)

after clicking on book it shows 11 results correctly, but the faceting with the 
exclude applied shows:

* book (230)
* website (25)
* journal (5)

The collapsed count of books is 11, and the uncollapsed count is 230, I 
verified it.
  
> Field collapsing
> 
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
>  Issue Type: New Feature
>  Components: search
>Affects Versions: 1.3
>Reporter: Emmanuel Keller
>Assignee: Shalin Shekhar Mangar
> Fix For: Next
>
> Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, DocSetScoreCollector.java, 
> field-collapse-3.patch, field-collapse-4-with-solrj.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> NonAdjacentDocumentCollapser.java, NonAdjacentDocumentCollapserTest.java, 
> quasidistributed.additional.patch, SOLR-236-1_4_1.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-trunk.patch, SOLR-236-trunk.patch, 
> SOLR-236-trunk.patch, SOLR-236-trunk.patch, SOLR-236-trunk.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, solr-236.patch, 
> SOLR-236_collapsing.patch, SOLR-236_collapsing.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given 
> field to a single entry in the result set. Site collapsing is a special case 
> of this, where all results for a given web site is collapsed into one or two 
> entries in the result set, typically with an associated "more documents from 
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before 
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] Issue Comment Edited: (SOLR-236) Field collapsing

2010-11-17 Thread peterwang (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12932905#action_12932905
 ] 

peterwang edited comment on SOLR-236 at 11/17/10 6:21 AM:
--

SOLR-236-1_4_1-paging-totals-working.patch patch failed with following errors:

patch:  malformed patch at line 3348: Index: 
src/test/org/apache/solr/search/fieldcollapse/DistributedFieldCollapsingIntegrationTest.java

seems caused by hand edit (delete 6 lines without fix diff hunk number) patch 
files, possible fix:

 $ diff -u SOLR-236-1_4_1.patch SOLR-236-1_4_1-paging-totals-working.patch
--- SOLR-236-1_4_1.patch2010-11-17 18:22:25.0 +0800
+++ SOLR-236-1_4_1-paging-totals-working.patch  2010-11-17 19:17:20.0 
+0800
@@ -2834,7 +2834,7 @@
 ===
 --- 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
 +++ 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
-@@ -0,0 +1,517 @@
+@@ -0,0 +1,511 @@
 +/**
 + * Licensed to the Apache Software Foundation (ASF) under one or more
 + * contributor license agreements.  See the NOTICE file distributed with
@@ -2939,12 +2939,6 @@
 +collapseDoc = new NonAdjacentCollapseGroup(0, 0, documentComparator, 
collapseThreshold, currentValue);
 +collapsedDocs.put(currentValue, collapseDoc);
 +collapsedGroupPriority.add(collapseDoc);
-+
-+if (collapsedGroupPriority.size() > maxNumberOfGroups) {
-+  NonAdjacentCollapseGroup inferiorGroup = 
collapsedGroupPriority.first();
-+  collapsedDocs.remove(inferiorGroup.fieldValue);
-+  collapsedGroupPriority.remove(inferiorGroup);
-+}
 +  }
 +  // dropoutId has a value smaller than the smallest value in the queue 
and therefore it was removed from the queue
 +  Integer dropOutId = (Integer) 
collapseDoc.priorityQueue.insertWithOverflow(currentId);



  was (Author: peterwang):
SOLR-236-1_4_1-paging-totals-working.patch patch failed with following 
errors:

patch:  malformed patch at line 3348: Index: 
src/test/org/apache/solr/search/fieldcollapse/DistributedFieldCollapsingIntegrationTest.java

seems caused by hand edit (delete 6 lines without fix diff hunk number) patch 
files, possible fix:

# diff -u SOLR-236-1_4_1.patch SOLR-236-1_4_1-paging-totals-working.patch
--- SOLR-236-1_4_1.patch2010-11-17 18:22:25.0 +0800
+++ SOLR-236-1_4_1-paging-totals-working.patch  2010-11-17 19:17:20.0 
+0800
@@ -2834,7 +2834,7 @@
 ===
 --- 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
 +++ 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
-@@ -0,0 +1,517 @@
+@@ -0,0 +1,511 @@
 +/**
 + * Licensed to the Apache Software Foundation (ASF) under one or more
 + * contributor license agreements.  See the NOTICE file distributed with
@@ -2939,12 +2939,6 @@
 +collapseDoc = new NonAdjacentCollapseGroup(0, 0, documentComparator, 
collapseThreshold, currentValue);
 +collapsedDocs.put(currentValue, collapseDoc);
 +collapsedGroupPriority.add(collapseDoc);
-+
-+if (collapsedGroupPriority.size() > maxNumberOfGroups) {
-+  NonAdjacentCollapseGroup inferiorGroup = 
collapsedGroupPriority.first();
-+  collapsedDocs.remove(inferiorGroup.fieldValue);
-+  collapsedGroupPriority.remove(inferiorGroup);
-+}
 +  }
 +  // dropoutId has a value smaller than the smallest value in the queue 
and therefore it was removed from the queue
 +  Integer dropOutId = (Integer) 
collapseDoc.priorityQueue.insertWithOverflow(currentId);


  
> Field collapsing
> 
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
>  Issue Type: New Feature
>  Components: search
>Affects Versions: 1.3
>Reporter: Emmanuel Keller
>Assignee: Shalin Shekhar Mangar
> Fix For: Next
>
> Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, DocSetScoreCollector.java, 
> field-collapse-3.patch, field-collapse-4-with-solrj.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-c

[jira] Issue Comment Edited: (SOLR-236) Field collapsing

2010-11-17 Thread peterwang (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12932905#action_12932905
 ] 

peterwang edited comment on SOLR-236 at 11/17/10 6:23 AM:
--

SOLR-236-1_4_1-paging-totals-working.patch patch failed with following errors:

patch:  malformed patch at line 3348: Index: 
src/test/org/apache/solr/search/fieldcollapse/DistributedFieldCollapsingIntegrationTest.java

seems caused by hand edit (delete 6 lines without fix diff hunk number) patch 
files, possible fix:

{code}
$ diff -u SOLR-236-1_4_1.patch SOLR-236-1_4_1-paging-totals-working.patch
--- SOLR-236-1_4_1.patch2010-11-17 18:22:25.0 +0800
+++ SOLR-236-1_4_1-paging-totals-working.patch  2010-11-17 19:17:20.0 
+0800
@@ -2834,7 +2834,7 @@
 ===
 --- 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
 +++ 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
-@@ -0,0 +1,517 @@
+@@ -0,0 +1,511 @@
 +/**
 + * Licensed to the Apache Software Foundation (ASF) under one or more
 + * contributor license agreements.  See the NOTICE file distributed with
@@ -2939,12 +2939,6 @@
 +collapseDoc = new NonAdjacentCollapseGroup(0, 0, documentComparator, 
collapseThreshold, currentValue);
 +collapsedDocs.put(currentValue, collapseDoc);
 +collapsedGroupPriority.add(collapseDoc);
-+
-+if (collapsedGroupPriority.size() > maxNumberOfGroups) {
-+  NonAdjacentCollapseGroup inferiorGroup = 
collapsedGroupPriority.first();
-+  collapsedDocs.remove(inferiorGroup.fieldValue);
-+  collapsedGroupPriority.remove(inferiorGroup);
-+}
 +  }
 +  // dropoutId has a value smaller than the smallest value in the queue 
and therefore it was removed from the queue
 +  Integer dropOutId = (Integer) 
collapseDoc.priorityQueue.insertWithOverflow(currentId);
{code} 

  was (Author: peterwang):
SOLR-236-1_4_1-paging-totals-working.patch patch failed with following 
errors:

patch:  malformed patch at line 3348: Index: 
src/test/org/apache/solr/search/fieldcollapse/DistributedFieldCollapsingIntegrationTest.java

seems caused by hand edit (delete 6 lines without fix diff hunk number) patch 
files, possible fix:

 $ diff -u SOLR-236-1_4_1.patch SOLR-236-1_4_1-paging-totals-working.patch
--- SOLR-236-1_4_1.patch2010-11-17 18:22:25.0 +0800
+++ SOLR-236-1_4_1-paging-totals-working.patch  2010-11-17 19:17:20.0 
+0800
@@ -2834,7 +2834,7 @@
 ===
 --- 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
 +++ 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
-@@ -0,0 +1,517 @@
+@@ -0,0 +1,511 @@
 +/**
 + * Licensed to the Apache Software Foundation (ASF) under one or more
 + * contributor license agreements.  See the NOTICE file distributed with
@@ -2939,12 +2939,6 @@
 +collapseDoc = new NonAdjacentCollapseGroup(0, 0, documentComparator, 
collapseThreshold, currentValue);
 +collapsedDocs.put(currentValue, collapseDoc);
 +collapsedGroupPriority.add(collapseDoc);
-+
-+if (collapsedGroupPriority.size() > maxNumberOfGroups) {
-+  NonAdjacentCollapseGroup inferiorGroup = 
collapsedGroupPriority.first();
-+  collapsedDocs.remove(inferiorGroup.fieldValue);
-+  collapsedGroupPriority.remove(inferiorGroup);
-+}
 +  }
 +  // dropoutId has a value smaller than the smallest value in the queue 
and therefore it was removed from the queue
 +  Integer dropOutId = (Integer) 
collapseDoc.priorityQueue.insertWithOverflow(currentId);


  
> Field collapsing
> 
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
>  Issue Type: New Feature
>  Components: search
>Affects Versions: 1.3
>Reporter: Emmanuel Keller
>Assignee: Shalin Shekhar Mangar
> Fix For: Next
>
> Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, DocSetScoreCollector.java, 
> field-collapse-3.patch, field-collapse-4-with-solrj.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.p

[jira] Issue Comment Edited: (SOLR-236) Field collapsing

2010-11-17 Thread peterwang (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12932905#action_12932905
 ] 

peterwang edited comment on SOLR-236 at 11/17/10 6:28 AM:
--

SOLR-236-1_4_1-paging-totals-working.patch patch failed with following errors:

patch:  malformed patch at line 3348: Index: 
src/test/org/apache/solr/search/fieldcollapse/DistributedFieldCollapsingIntegrationTest.java

seems caused by hand edit SOLR-236-1_4_1.patch to produce 
SOLR-236-1_4_1-paging-totals-working.patch (delete 6 lines without fix diff 
hunk number) 
possible fix:

{code}
diff -u SOLR-236-1_4_1-paging-totals-working.patch.orig 
SOLR-236-1_4_1-paging-totals-working.patch
--- SOLR-236-1_4_1-paging-totals-working.patch.orig 2010-11-17 
19:26:05.0 +0800
+++ SOLR-236-1_4_1-paging-totals-working.patch  2010-11-17 19:17:20.0 
+0800
@@ -2834,7 +2834,7 @@
 ===
 --- 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
 +++ 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
-@@ -0,0 +1,517 @@
+@@ -0,0 +1,511 @@
 +/**
 + * Licensed to the Apache Software Foundation (ASF) under one or more
 + * contributor license agreements.  See the NOTICE file distributed with
{code} 

  was (Author: peterwang):
SOLR-236-1_4_1-paging-totals-working.patch patch failed with following 
errors:

patch:  malformed patch at line 3348: Index: 
src/test/org/apache/solr/search/fieldcollapse/DistributedFieldCollapsingIntegrationTest.java

seems caused by hand edit (delete 6 lines without fix diff hunk number) patch 
files, possible fix:

{code}
$ diff -u SOLR-236-1_4_1.patch SOLR-236-1_4_1-paging-totals-working.patch
--- SOLR-236-1_4_1.patch2010-11-17 18:22:25.0 +0800
+++ SOLR-236-1_4_1-paging-totals-working.patch  2010-11-17 19:17:20.0 
+0800
@@ -2834,7 +2834,7 @@
 ===
 --- 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
 +++ 
src/java/org/apache/solr/search/fieldcollapse/NonAdjacentDocumentCollapser.java 
   (revision )
-@@ -0,0 +1,517 @@
+@@ -0,0 +1,511 @@
 +/**
 + * Licensed to the Apache Software Foundation (ASF) under one or more
 + * contributor license agreements.  See the NOTICE file distributed with
@@ -2939,12 +2939,6 @@
 +collapseDoc = new NonAdjacentCollapseGroup(0, 0, documentComparator, 
collapseThreshold, currentValue);
 +collapsedDocs.put(currentValue, collapseDoc);
 +collapsedGroupPriority.add(collapseDoc);
-+
-+if (collapsedGroupPriority.size() > maxNumberOfGroups) {
-+  NonAdjacentCollapseGroup inferiorGroup = 
collapsedGroupPriority.first();
-+  collapsedDocs.remove(inferiorGroup.fieldValue);
-+  collapsedGroupPriority.remove(inferiorGroup);
-+}
 +  }
 +  // dropoutId has a value smaller than the smallest value in the queue 
and therefore it was removed from the queue
 +  Integer dropOutId = (Integer) 
collapseDoc.priorityQueue.insertWithOverflow(currentId);
{code} 
  
> Field collapsing
> 
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
>  Issue Type: New Feature
>  Components: search
>Affects Versions: 1.3
>Reporter: Emmanuel Keller
>Assignee: Shalin Shekhar Mangar
> Fix For: Next
>
> Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, DocSetScoreCollector.java, 
> field-collapse-3.patch, field-collapse-4-with-solrj.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> NonAdjacentDocumentCollapser.java, NonAdjacentDocumentCollapserTest.java, 
> quasidistributed.additional.patch, 
> SOLR-236-1_4_1-paging-totals-working.patch, SOLR-236-1_4_1.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-trunk.patch, SOLR-236-trunk.patch, 
> SOLR-236-trunk.patch, SOLR-236-trunk.patch, SOL

[jira] Issue Comment Edited: (SOLR-236) Field collapsing

2010-11-23 Thread Luke Bochsler (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12934825#action_12934825
 ] 

Luke Bochsler edited comment on SOLR-236 at 11/23/10 9:03 AM:
--

"Is anyone working on the ability to calculate facets AFTER the group?"

This would be great to have that possibility! Sorry I'm not a Java Programmer 
so I cannot contribute a solution, instead I contribute to other open source 
systems. However, would that be a big deal for you guys to implement it? I'm 
using Solr in a web project as search solution and desperately need this 
feature along with the great grouping functionality. The grouping in general 
has made my life so much easier so far, so it seems we are just one step away 
from having it all covered by Solr!

Thank you so much!

Luke

  
> Field collapsing
> 
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
>  Issue Type: New Feature
>  Components: search
>Affects Versions: 1.3
>Reporter: Emmanuel Keller
>Assignee: Shalin Shekhar Mangar
> Fix For: Next
>
> Attachments: collapsing-patch-to-1.3.0-dieter.patch, 
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch, 
> collapsing-patch-to-1.3.0-ivan_3.patch, DocSetScoreCollector.java, 
> field-collapse-3.patch, field-collapse-4-with-solrj.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch, 
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch, 
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch, 
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff, 
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff, 
> NonAdjacentDocumentCollapser.java, NonAdjacentDocumentCollapserTest.java, 
> quasidistributed.additional.patch, 
> SOLR-236-1_4_1-paging-totals-working.patch, SOLR-236-1_4_1.patch, 
> SOLR-236-distinctFacet.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch, 
> SOLR-236-trunk.patch, SOLR-236-trunk.patch, SOLR-236-trunk.patch, 
> SOLR-236-trunk.patch, SOLR-236-trunk.patch, SOLR-236.patch, SOLR-236.patch, 
> SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, SOLR-236.patch, 
> SOLR-236.patch, SOLR-236.patch, solr-236.patch, SOLR-236_collapsing.patch, 
> SOLR-236_collapsing.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given 
> field to a single entry in the result set. Site collapsing is a special case 
> of this, where all results for a given web site is collapsed into one or two 
> entries in the result set, typically with an associated "more documents from 
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before 
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org