Hello,
I have a field that is using the whitespace tokenizer, but I also want to
tokenize on hyphens (-) like the standard analyzer does. I'm having
trouble figuring out what additional custom settings I would have to put in
there in order to be able to tokenize off of hyphens as well.
Thanks,
Thanks! i'll go ahead and try the pattern tokenizer route.
On Mon, Oct 27, 2014 at 1:22 PM, Ivan Brusic i...@brusic.com wrote:
You can either use a pattern tokenizer with your patterns being whitespace
+ hypen, or further decompose your token post tokenization with the word
delimiter token
Or you could cheat and use a character filter to turn the hyphen into
spaces. Lots of ways to skin a cat.
On Mon, Oct 27, 2014 at 7:07 PM, Mike Topper top...@gmail.com wrote:
Thanks! i'll go ahead and try the pattern tokenizer route.
On Mon, Oct 27, 2014 at 1:22 PM, Ivan Brusic