Currently I'm using StandardTokenizerFactory which tokenizes the words bases on spaces. For Toy Story it will create tokens toy and story. Ideally, I would want to extend the functionality ofStandardTokenizerFactory to create tokens toy, story, and toy story. How do I do that?
- Modify the StandardTokenizerFactory to concatenate all wo... Kevin
- Re: Modify the StandardTokenizerFactory to concatena... Benson Margulies