[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1466: --- Attachment: LUCENE-1466-TestCharFilter.patch an additional test for CharFilter that I forgot to move from Solr... Mike, can you commit this? Thank you. :) CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Assignee: Michael McCandless Priority: Minor Fix For: 2.9 Attachments: LUCENE-1466-back.patch, LUCENE-1466-TestCharFilter.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1466: --- Attachment: LUCENE-1466.patch Added TestMappingCharFilter test case (copied from Solr). CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Assignee: Michael McCandless Priority: Minor Fix For: 2.9 Attachments: LUCENE-1466-back.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1466: --- Attachment: LUCENE-1466-back.patch LUCENE-1466.patch I think we should make an exception to back-compat here, and simply change TokenStream.input from Reader to CharStream (subclasses Reader). Properly respecting back-compat will be alot of work, and, if external subclasses are directly assigning to input, they really ought to be using reaset(Reader) instead. I updated the patch with the above issues, fixed some whitespace issues, added Tokenizer.reset(CharStream) and patched back-compat. CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Assignee: Michael McCandless Priority: Minor Fix For: 2.9 Attachments: LUCENE-1466-back.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1466: --- Attachment: LUCENE-1466.patch updated patch attached. - sync trunk (smart chinese analyzer(LUCENE-1629), etc.) - added a useful idiom to get ChatStream and make private CharReader constructor CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Assignee: Michael McCandless Priority: Minor Fix For: 2.9 Attachments: LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1466: --- Attachment: LUCENE-1466.patch renamed correctPosition() to correct() because it is for correcting token offset, not for token position. CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Priority: Minor Fix For: 2.9 Attachments: LUCENE-1466.patch, LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated LUCENE-1466: --- Lucene Fields: [New, Patch Available] (was: [Patch Available, New]) Fix Version/s: 2.9 CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Priority: Minor Fix For: 2.9 Attachments: LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1466: --- Description: This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html was: This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: SOLR-822 http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html Lucene Fields: [New, Patch Available] (was: [New]) CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Priority: Minor Attachments: LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: - SOLR-822 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org For additional commands, e-mail: java-dev-h...@lucene.apache.org
[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer
[ https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Sekiguchi updated LUCENE-1466: --- Attachment: LUCENE-1466.patch a patch attached. CharFilter - normalize characters before tokenizer -- Key: LUCENE-1466 URL: https://issues.apache.org/jira/browse/LUCENE-1466 Project: Lucene - Java Issue Type: New Feature Components: Analysis Affects Versions: 2.4 Reporter: Koji Sekiguchi Priority: Minor Attachments: LUCENE-1466.patch This proposes to import CharFilter that has been introduced in Solr 1.4. Please see for the details: SOLR-822 http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]