[ 
https://issues.apache.org/jira/browse/LUCENE-1491?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12715849#action_12715849
 ] 

viobade commented on LUCENE-1491:
---------------------------------

I think is better to keep the main goal of ngram: groups of characters between 
min and max.  If is need in any practical situation for minimum ngram equals 
with one or two characters, this can be done setting the minimum....otherwise 
the filter must  work in the way that is expected.. If I expect subword with 
minimum 3 length why do I get a token with two characters while it is not 
accomplish the condition?

> EdgeNGramTokenFilter stops on tokens smaller then minimum gram size.
> --------------------------------------------------------------------
>
>                 Key: LUCENE-1491
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1491
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Analysis
>    Affects Versions: 2.4, 2.4.1, 2.9, 3.0
>            Reporter: Todd Feak
>            Assignee: Otis Gospodnetic
>             Fix For: 2.9
>
>         Attachments: LUCENE-1491.patch
>
>
> If a token is encountered in the stream that is shorter in length than the 
> min gram size, the filter will stop processing the token stream.
> Working up a unit test now, but may be a few days before I can provide it. 
> Wanted to get it in the system.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to