[
https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789607#action_12789607
]
Andrzej Bialecki commented on SOLR-1632:
-----------------------------------------
I believe the API that I propose would support such implementation as well.
Please note that it's usually not feasible to compute and distribute the
complete IDF table for all terms - you would have to replicate a union of all
term dictionaries across the cluster. In practice, you limit the amount of
information by various means, e.g. only distributing data related to the
current request (this implementation) or reducing the frequency of updates
(e.g. LRU caching), or approximating global DF with a constant for frequent
terms (where the contribution of their IDF to the score would be negligible
anyway).
> Distributed IDF
> ---------------
>
> Key: SOLR-1632
> URL: https://issues.apache.org/jira/browse/SOLR-1632
> Project: Solr
> Issue Type: New Feature
> Components: search
> Affects Versions: 1.5
> Reporter: Andrzej Bialecki
> Attachments: distrib.patch
>
>
> Distributed IDF is a valuable enhancement for distributed search across
> non-uniform shards. This issue tracks the proposed implementation of an API
> to support this functionality in Solr.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.