[
https://issues.apache.org/jira/browse/LUCENE-1424?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12642615#action_12642615
]
Mark Harwood commented on LUCENE-1424:
--------------------------------------
>> Are the score differences caused by the rewrite-to-BooleanQuery
>> implementations ever "useful"?
So we need to consider what we are losing - TF, IDF, coordination, length norm,
doc boosts.
I can only think of one use case which relates to coordination factor.
If you have a "category" field for a product e.g. given Lucene docs for these
books:
Title: Lucene in Action
Category: /Books/Computing/Languages/Java
/Books/Computing/InformationRetrieval
Title: The Long Tail
Category: /Books/Business/Internet
/Books/Computing
You might then use a wildcard search of /Books/Computing/* and "Lucene in
Action" would rank higher than "The Long Tail" because a BooleanQuery would
score a higher coordination factor suggesting LIA got more hits under this
"/Books/Computing.." category. There would still be the issue of IDF
potentially skewing results but the coordination factor is potentially useful
here.
I think in general IDF tends to be useless for "auto-expanded" terms e.g.
Wildcard, fuzzy etc. Incidentally, we still see that IDF issue in fuzzy queries
ranking rare mis-spellings higher but that's another issue (one I resolved in
contrib's FuzzyLikeThisQuery).
I suppose one other consideration is for people who have created any doc boosts
e.g. trying to use this to boost by date.
I don't think any of these cases necessarily outweigh the benefit to be
obtained from switching "wildcard/prefix to constant score queries"
Cheers,
Mark
> Add ConstantScorePrefixQuery and ConstantScoreWildcardQuery
> -----------------------------------------------------------
>
> Key: LUCENE-1424
> URL: https://issues.apache.org/jira/browse/LUCENE-1424
> Project: Lucene - Java
> Issue Type: New Feature
> Reporter: Mark Miller
> Assignee: Michael McCandless
> Priority: Minor
> Attachments: LUCENE-1424.patch
>
>
> If we want to be able to highlight these queries, they need to be added to
> Lucene core or contrib (solr's WildCardFilter can be used to create the
> ConstantScoreWildcardQuery). They are very useful anyway.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]