[ 
https://issues.apache.org/jira/browse/SOLR-211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ryan McKinley updated SOLR-211:
-------------------------------

    Attachment: SOLR-211-RegexSplitTokenizer.patch

Using a Matcher to generate the tokens makes it easy enough to return the match 
as token -- not just the split()

* Updated to take a "group" argument - if the group is less then zero, it 
behaves as a split, otherwise it uses the matched group as the token.

* Changed the name to PatternTokenizerFactory as it is more general then just 
split

> regex split() Tokenizer
> -----------------------
>
>                 Key: SOLR-211
>                 URL: https://issues.apache.org/jira/browse/SOLR-211
>             Project: Solr
>          Issue Type: New Feature
>          Components: search
>            Reporter: Ryan McKinley
>         Attachments: SOLR-211-RegexSplitTokenizer.patch, 
> SOLR-211-RegexSplitTokenizer.patch, SOLR-211-RegexSplitTokenizer.patch
>
>
> A TokenizerFactory that makes tokens from:
>   string.split( regex );

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to