[ 
https://issues.apache.org/jira/browse/LUCENE-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Grant Ingersoll updated LUCENE-1077:
------------------------------------

    Attachment: LUCENE-1077.patch

Adds to the sinks package:
DateRecognizerSinkTokenizer which only adds dates that can be parsed by a 
DateFormat object

TokenRangeSinkTokenizer as described earlier

TokenTypeSinkTokenizer only adds to the sink if the token type is a specific 
value.

Adds the payloads package, which contains
NumericPayloadTokenFilter -- Assigns a predefined float-based payload to a 
Token if the type matches the specified input type of the Token.  As a use 
case, this could be used to assign a payload for all tokens that are marked as 
"bold" or some other value.

> New Analysis  Contributions
> ---------------------------
>
>                 Key: LUCENE-1077
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1077
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Analysis, contrib/*
>            Reporter: Grant Ingersoll
>            Assignee: Grant Ingersoll
>            Priority: Minor
>             Fix For: 2.3
>
>         Attachments: LUCENE-1077.patch, LUCENE-1077.patch
>
>
> With the advent of the new TeeTokenFilter and SinkTokenizer, there now exists 
> some interesting new things that can be done in the analysis phase of 
> indexing.  See LUCENE-1058.
> This patch provides some new implementations of SinkTokenizer that may be 
> useful.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to