On Thu, Aug 9, 2012 at 6:49 AM, Michael McCandless <[email protected]> wrote: > The text_general field type is meant to be a good default for all languages.
What many of us not familiar with the tokenizing rules of the standard tokenizer just realized is that it's not a good default for english and probably most other european languages. > If you want English-specific behavior, you should use one of the > English field types (text_en, text_en_splitting, > text_en_splitting_tight). Seems like we should be showing best-practice and using these english fields in our english examples. -Yonik http://lucidimagination.com --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
