[
https://issues.apache.org/jira/browse/SOLR-211?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ryan McKinley updated SOLR-211:
-------------------------------
Attachment: SOLR-211-RegexSplitTokenizer.patch
Using a Matcher to generate the tokens makes it easy enough to return the match
as token -- not just the split()
* Updated to take a "group" argument - if the group is less then zero, it
behaves as a split, otherwise it uses the matched group as the token.
* Changed the name to PatternTokenizerFactory as it is more general then just
split
> regex split() Tokenizer
> -----------------------
>
> Key: SOLR-211
> URL: https://issues.apache.org/jira/browse/SOLR-211
> Project: Solr
> Issue Type: New Feature
> Components: search
> Reporter: Ryan McKinley
> Attachments: SOLR-211-RegexSplitTokenizer.patch,
> SOLR-211-RegexSplitTokenizer.patch, SOLR-211-RegexSplitTokenizer.patch
>
>
> A TokenizerFactory that makes tokens from:
> string.split( regex );
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.