[ https://issues.apache.org/jira/browse/LUCENE-3880?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234341#comment-13234341 ]
Kai Gülzau commented on LUCENE-3880: ------------------------------------ That was fast! Thank _you_ :-) > UAX29URLEmailTokenizer fails to recognize emails as such when the mailto: > scheme is prepended > --------------------------------------------------------------------------------------------- > > Key: LUCENE-3880 > URL: https://issues.apache.org/jira/browse/LUCENE-3880 > Project: Lucene - Java > Issue Type: Bug > Affects Versions: 3.5, 4.0 > Reporter: Steven Rowe > Assignee: Steven Rowe > Priority: Minor > Fix For: 3.6, 4.0 > > Attachments: LUCENE-3880.patch > > > As [reported by Kai Gülzau on > solr-user|http://markmail.org/message/n32kji3okqm2c5qn]: > UAX29URLEmailTokenizer seems to split at the wrong place: > {noformat}mailto:t...@example.org{noformat} -> > {noformat}mailto:test{noformat} > {noformat}example.org{noformat} > As a workaround I use > {code:xml} > <charFilter class="solr.PatternReplaceCharFilterFactory" pattern="mailto:" > replacement="mailto: "/> > {code} -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org