[
https://issues.apache.org/jira/browse/LUCENE-2015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12771496#action_12771496
]
Cédrik LIME commented on LUCENE-2015:
-------------------------------------
Robert,
All I did is refactor the big switch(c) into its own method:
public static final int foldToASCII(char c, char[] output, int outputPos)
and change the caller (public void foldToASCII(char[] input, int length))
accordingly.
I can submit a patch without formatting changes, but that means the source
won't be nicely indented...
Please advise.
As for the ISOLatin1AccentFilter patch, it really is to enable us to remove a
workaround for an issue we had with some special (yet frequent) chars. Feel
free to ignore it should you think this part is not relevant.
> ASCIIFoldingFilter: expose folding logic + small improvements to
> ISOLatin1AccentFilter
> --------------------------------------------------------------------------------------
>
> Key: LUCENE-2015
> URL: https://issues.apache.org/jira/browse/LUCENE-2015
> Project: Lucene - Java
> Issue Type: Improvement
> Components: Analysis
> Reporter: Cédrik LIME
> Priority: Minor
> Attachments: Filters.patch
>
>
> This patch adds a couple of non-ascii chars to ISOLatin1AccentFilter (namely:
> left & right single quotation marks, en dash, em dash) which we very
> frequently encounter in our projects. I know that this class is now
> deprecated; this improvement is for legacy code that hasn't migrated yet.
> It also enables easy access to the ascii folding technique use in
> ASCIIFoldingFilter for potential re-use in non-Lucene-related code.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]