[ 
https://issues.apache.org/jira/browse/LUCENE-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12783932#action_12783932
 ] 

Robert Muir commented on LUCENE-2094:
-------------------------------------

bq. But if this is so, you should have initialized the stop filter in persian 
analyzer with a fixed "false". Bt it also used 
StopFilter.getEnablePositionIncrementsVersionDefault() and used the version 
default. Should we fix this?

I don't think so. I think its up to the user to decide how they want the search 
to work, even in this example.
If they don't like the defaults for how phrasequery works, they can create an 
analyzer that uses the stopfilter differently.

I don't think the issue is clear for any given language, I think it always 
depends on how your application works.
I mean we add a hole for "the" in english, but in bulgarian (LUCENE-2062) this 
is a suffix attached to the end of a noun.
With arabic its always a prefix. I don't think we need to have options to add a 
posinc gap if we stem leading ال off an arabic word.

I'm just trying to show some examples of why a user might want to change the 
defaults.


> Prepare CharArraySet for Unicode 4.0
> ------------------------------------
>
>                 Key: LUCENE-2094
>                 URL: https://issues.apache.org/jira/browse/LUCENE-2094
>             Project: Lucene - Java
>          Issue Type: Bug
>          Components: Analysis
>    Affects Versions: 3.0
>            Reporter: Simon Willnauer
>            Assignee: Uwe Schindler
>             Fix For: 3.1
>
>         Attachments: LUCENE-2094.patch, LUCENE-2094.patch, LUCENE-2094.patch, 
> LUCENE-2094.patch, LUCENE-2094.patch, LUCENE-2094.patch, LUCENE-2094.txt, 
> LUCENE-2094.txt, LUCENE-2094.txt
>
>
> CharArraySet does lowercaseing if created with the correspondent flag. This 
> causes that  String / char[] with uncode 4 chars which are in the set can not 
> be retrieved in "ignorecase" mode.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to