[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2009-06-23 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated LUCENE-1466:
---

Attachment: LUCENE-1466-TestCharFilter.patch

an additional test for CharFilter that I forgot to move from Solr... Mike, can 
you commit this? Thank you. :)

 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9

 Attachments: LUCENE-1466-back.patch, 
 LUCENE-1466-TestCharFilter.patch, LUCENE-1466.patch, LUCENE-1466.patch, 
 LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 - SOLR-822
 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2009-06-22 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated LUCENE-1466:
---

Attachment: LUCENE-1466.patch

Added TestMappingCharFilter test case (copied from Solr).

 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9

 Attachments: LUCENE-1466-back.patch, LUCENE-1466.patch, 
 LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 - SOLR-822
 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2009-06-20 Thread Michael McCandless (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless updated LUCENE-1466:
---

Attachment: LUCENE-1466-back.patch
LUCENE-1466.patch

I think we should make an exception to back-compat here, and simply
change TokenStream.input from Reader to CharStream (subclasses
Reader).  Properly respecting back-compat will be alot of work, and,
if external subclasses are directly assigning to input, they really
ought to be using reaset(Reader) instead.

I updated the patch with the above issues, fixed some whitespace
issues, added Tokenizer.reset(CharStream) and patched back-compat.


 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9

 Attachments: LUCENE-1466-back.patch, LUCENE-1466.patch, 
 LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 - SOLR-822
 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2009-06-18 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated LUCENE-1466:
---

Attachment: LUCENE-1466.patch

updated patch attached.
- sync trunk (smart chinese analyzer(LUCENE-1629), etc.)
- added a useful idiom to get ChatStream and make private CharReader constructor

 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Assignee: Michael McCandless
Priority: Minor
 Fix For: 2.9

 Attachments: LUCENE-1466.patch, LUCENE-1466.patch, LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 - SOLR-822
 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2009-03-19 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated LUCENE-1466:
---

Attachment: LUCENE-1466.patch

renamed correctPosition() to correct() because it is for correcting token 
offset, not for token position.

 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Priority: Minor
 Fix For: 2.9

 Attachments: LUCENE-1466.patch, LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 - SOLR-822
 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2009-02-28 Thread Michael McCandless (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael McCandless updated LUCENE-1466:
---

Lucene Fields: [New, Patch Available]  (was: [Patch Available, New])
Fix Version/s: 2.9

 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Priority: Minor
 Fix For: 2.9

 Attachments: LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 - SOLR-822
 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2008-12-12 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated LUCENE-1466:
---

  Description: 
This proposes to import CharFilter that has been introduced in Solr 1.4.

Please see for the details:
- SOLR-822
- http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

  was:
This proposes to import CharFilter that has been introduced in Solr 1.4.

Please see for the details:
SOLR-822
http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

Lucene Fields: [New, Patch Available]  (was: [New])

 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Priority: Minor
 Attachments: LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 - SOLR-822
 - http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org



[jira] Updated: (LUCENE-1466) CharFilter - normalize characters before tokenizer

2008-11-23 Thread Koji Sekiguchi (JIRA)

 [ 
https://issues.apache.org/jira/browse/LUCENE-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Koji Sekiguchi updated LUCENE-1466:
---

Attachment: LUCENE-1466.patch

a patch attached.

 CharFilter - normalize characters before tokenizer
 --

 Key: LUCENE-1466
 URL: https://issues.apache.org/jira/browse/LUCENE-1466
 Project: Lucene - Java
  Issue Type: New Feature
  Components: Analysis
Affects Versions: 2.4
Reporter: Koji Sekiguchi
Priority: Minor
 Attachments: LUCENE-1466.patch


 This proposes to import CharFilter that has been introduced in Solr 1.4.
 Please see for the details:
 SOLR-822
 http://www.nabble.com/Proposal-for-introducing-CharFilter-to20327007.html

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]