The behaviour of the StandardTokenizerFactory changed with solr 3.1. The actual output is now: "Please", "email", "john.doe", "foo.com", "by", "03", "09", "re", "m37","xq"
Viele Grüße aus Augsburg Markus Klose SHI Elektronische Medien GmbH -----Ursprüngliche Nachricht----- Von: Alok Bhandari [mailto:alokomprakashbhand...@gmail.com] Gesendet: Dienstag, 19. Juni 2012 07:33 An: solr-user@lucene.apache.org Betreff: Re: StandardTokenizerFactory behaviour Just to make sure that there is no ambiguity the In: "Please, email john....@foo.com by 03-09, re: m37-xq." is the input given to this field for indexing and the Expected Out: "Please", "email", "john....@foo.com", "by", "03-09", "re", "m37-xq" is expected output tokens. -- View this message in context: http://lucene.472066.n3.nabble.com/StandardTokenizerFactory-behaviour-tp3990215p3990216.html Sent from the Solr - User mailing list archive at Nabble.com.