[ https://issues.apache.org/jira/browse/SOLR-1632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13195632#comment-13195632 ]
Robert Muir commented on SOLR-1632: ----------------------------------- {quote} Multi-term queries like range query, prefix query, etc, do not depend on term stats, and can consist of millions of terms. {quote} No, they cannot. it can't be millions of terms because a million exceeds the boolean max clause count, in which it will always use a filter. {quote} Ideally, we wouldn't even do a rewrite in order to collect terms {quote} You don't have to, Lucene's test case (ShardSearchingTestBase) doesn't do an extra rewrite to collect terms. {code} @Override public Query rewrite(Query original) throws IOException { final Query rewritten = super.rewrite(original); final Set<Term> terms = new HashSet<Term>(); rewritten.extractTerms(terms); // Make a single request to remote nodes for term // stats: ... return rewritten; } {code} {quote} - rewrite itself has gotten much more expensive in some circumstances (i.e. iterating the first 350 terms to determine what style of rewrite should be used) {quote} Got any benchmarks to back this up with? Its incorrect to say rewrite has gotten more expensive? More expensive than what? Its the opposite: its actually much faster when rewriting to boolean queries in 4.0 because it always works per-segment. > Distributed IDF > --------------- > > Key: SOLR-1632 > URL: https://issues.apache.org/jira/browse/SOLR-1632 > Project: Solr > Issue Type: New Feature > Components: search > Affects Versions: 1.5 > Reporter: Andrzej Bialecki > Attachments: 3x_SOLR-1632_doesntwork.patch, SOLR-1632.patch, > SOLR-1632.patch, SOLR-1632.patch, distrib-2.patch, distrib.patch > > > Distributed IDF is a valuable enhancement for distributed search across > non-uniform shards. This issue tracks the proposed implementation of an API > to support this functionality in Solr. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org