analyzer settings for breaking up words on hyphens

2014-10-27 Thread Mike Topper
Hello, I have a field that is using the whitespace tokenizer, but I also want to tokenize on hyphens (-) like the standard analyzer does. I'm having trouble figuring out what additional custom settings I would have to put in there in order to be able to tokenize off of hyphens as well. Thanks,

Re: analyzer settings for breaking up words on hyphens

2014-10-27 Thread Mike Topper
Thanks! i'll go ahead and try the pattern tokenizer route. On Mon, Oct 27, 2014 at 1:22 PM, Ivan Brusic i...@brusic.com wrote: You can either use a pattern tokenizer with your patterns being whitespace + hypen, or further decompose your token post tokenization with the word delimiter token

Re: analyzer settings for breaking up words on hyphens

2014-10-27 Thread Nikolas Everett
Or you could cheat and use a character filter to turn the hyphen into spaces. Lots of ways to skin a cat. On Mon, Oct 27, 2014 at 7:07 PM, Mike Topper top...@gmail.com wrote: Thanks! i'll go ahead and try the pattern tokenizer route. On Mon, Oct 27, 2014 at 1:22 PM, Ivan Brusic