[
https://issues.apache.org/jira/browse/SOLR-236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shalin Shekhar Mangar updated SOLR-236:
---------------------------------------
Attachment: SOLR-236.patch
Patch in sync with trunk.
# CollapseComponent is PluginInfoInitialized. Removed changes to SolrConfig.
Note, the collapseCollectorFactories array and the separate fieldCollapsing
element has been removed from configuration. this patch has the following
configuration:
{code:xml}
<searchComponent name="collapse"
class="org.apache.solr.handler.component.CollapseComponent">
<collapseCollectorFactory name="groupDocumentsCounts"
class="solr.fieldcollapse.collector.DocumentGroupCountCollapseCollectorFactory"
/>
<collapseCollectorFactory name="groupFieldValue"
class="solr.fieldcollapse.collector.FieldValueCountCollapseCollectorFactory" />
<collapseCollectorFactory name="groupDocumentsFields"
class="solr.fieldcollapse.collector.DocumentFieldsCollapseCollectorFactory" />
<collapseCollectorFactory name="groupAggregatedData"
class="org.apache.solr.search.fieldcollapse.collector.AggregateCollapseCollectorFactory">
<lst name="aggregateFunctions">
<str
name="sum">org.apache.solr.search.fieldcollapse.collector.aggregate.SumFunction</str>
<str
name="avg">org.apache.solr.search.fieldcollapse.collector.aggregate.AverageFunction</str>
<str
name="min">org.apache.solr.search.fieldcollapse.collector.aggregate.MinFunction</str>
<str
name="max">org.apache.solr.search.fieldcollapse.collector.aggregate.MaxFunction</str>
</lst>
</collapseCollectorFactory>
<fieldCollapseCache
class="solr.FastLRUCache"
size="512"
initialSize="512"
autowarmCount="128"/>
</searchComponent>
{code}
# I couldn't find where the fieldCollapseCache was being regenerated. It seems
it is not being thrown away after commits? I have changed it to be re-created
on newSearcher event.
# Removed changes to JettySolrRunner,CoreContainer and SolrDispatchFilter for
the distributed test case. We will refactor it to use
BaseDistributedSearchTestCase (not implemented yet)
> Field collapsing
> ----------------
>
> Key: SOLR-236
> URL: https://issues.apache.org/jira/browse/SOLR-236
> Project: Solr
> Issue Type: New Feature
> Components: search
> Affects Versions: 1.3
> Reporter: Emmanuel Keller
> Assignee: Shalin Shekhar Mangar
> Fix For: 1.5
>
> Attachments: collapsing-patch-to-1.3.0-dieter.patch,
> collapsing-patch-to-1.3.0-ivan.patch, collapsing-patch-to-1.3.0-ivan_2.patch,
> collapsing-patch-to-1.3.0-ivan_3.patch, field-collapse-3.patch,
> field-collapse-4-with-solrj.patch, field-collapse-5.patch,
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch,
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch,
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch,
> field-collapse-5.patch, field-collapse-5.patch, field-collapse-5.patch,
> field-collapse-5.patch, field-collapse-5.patch,
> field-collapse-solr-236-2.patch, field-collapse-solr-236.patch,
> field-collapsing-extended-592129.patch, field_collapsing_1.1.0.patch,
> field_collapsing_1.3.patch, field_collapsing_dsteigerwald.diff,
> field_collapsing_dsteigerwald.diff, field_collapsing_dsteigerwald.diff,
> quasidistributed.additional.patch, SOLR-236-FieldCollapsing.patch,
> SOLR-236-FieldCollapsing.patch, SOLR-236-FieldCollapsing.patch,
> SOLR-236.patch, solr-236.patch, SOLR-236_collapsing.patch,
> SOLR-236_collapsing.patch
>
>
> This patch include a new feature called "Field collapsing".
> "Used in order to collapse a group of results with similar value for a given
> field to a single entry in the result set. Site collapsing is a special case
> of this, where all results for a given web site is collapsed into one or two
> entries in the result set, typically with an associated "more documents from
> this site" link. See also Duplicate detection."
> http://www.fastsearch.com/glossary.aspx?m=48&amid=299
> The implementation add 3 new query parameters (SolrParams):
> "collapse.field" to choose the field used to group results
> "collapse.type" normal (default value) or adjacent
> "collapse.max" to select how many continuous results are allowed before
> collapsing
> TODO (in progress):
> - More documentation (on source code)
> - Test cases
> Two patches:
> - "field_collapsing.patch" for current development version
> - "field_collapsing_1.1.0.patch" for Solr-1.1.0
> P.S.: Feedback and misspelling correction are welcome ;-)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.