[
https://issues.apache.org/jira/browse/LUCENE-6283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14333365#comment-14333365
]
Robert Muir commented on LUCENE-6283:
-------------------------------------
I am also curious, what is the difference between skipTerms and MoreLikeThis's
setStopWords? It already has a separate stoplist (usually you want something
more aggressive here):
{code}
/**
* Set the set of stopwords.
* Any word in this set is considered "uninteresting" and ignored.
* Even if your Analyzer allows stopwords, you might want to tell the
MoreLikeThis code to ignore them, as
* for the purposes of document similarity it seems reasonable to assume that
"a stop word is never interesting".
*
* @param stopWords set of stopwords, if null it means to allow stop words
* @see #getStopWords
*/
public void setStopWords(Set<?> stopWords) {
{code}
> More Like This: skip terms, like Fields and lenient defaults
> ------------------------------------------------------------
>
> Key: LUCENE-6283
> URL: https://issues.apache.org/jira/browse/LUCENE-6283
> Project: Lucene - Core
> Issue Type: Improvement
> Affects Versions: Trunk
> Reporter: Alex Ksikes
> Priority: Minor
> Attachments: LUCENE-6283.patch
>
>
> - added skip terms: list of terms not to be considered as interesting
> - added like on Fields and Terms objects
> - made defaults more lenient
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]