Hi Kai,

I have created an issue for this: 
https://issues.apache.org/jira/browse/LUCENE-3880

Thanks for reporting!

Steve

-----Original Message-----
From: Kai Gülzau [mailto:kguel...@novomind.com] 
Sent: Friday, March 16, 2012 9:59 AM
To: solr-user@lucene.apache.org
Subject: mailto: scheme aware tokenizer

Is there any analyzer out there which handles the mailto: scheme?

UAX29URLEmailTokenizer seems to split at the wrong place:

mailto:t...@example.org ->
mailto:test
example.org

As a workaround I use

<charFilter class="solr.PatternReplaceCharFilterFactory" pattern="mailto:"; 
replacement="mailto: "/>

Regards,

Kai Gülzau

novomind AG
__________________________________

Bramfelder Straße 121 • 22305 Hamburg

phone +49 (0)40 808071138 • fax +49 (0)40 808071-100 email 
kguel...@novomind.com • http://www.novomind.com

Vorstand : Peter Samuelsen (Vors.) • Stefan Grieben • Thomas Köhler
Aufsichtsratsvorsitzender: Werner Preuschhof
Gesellschaftssitz: Hamburg • HR B93508 Amtsgericht Hamburg

Reply via email to