On Thu, Aug 9, 2012 at 6:49 AM, Michael McCandless <luc...@mikemccandless.com> wrote: > The text_general field type is meant to be a good default for all languages.
What many of us not familiar with the tokenizing rules of the standard tokenizer just realized is that it's not a good default for english and probably most other european languages. > If you want English-specific behavior, you should use one of the > English field types (text_en, text_en_splitting, > text_en_splitting_tight). Seems like we should be showing best-practice and using these english fields in our english examples. -Yonik http://lucidimagination.com --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org