On Nov 17, 10:15 am, labwor...@gmail.com wrote: > That's probably an OpenNLP question, but here it goes. Is there a way to > tell the tokenizer to make tokens of more than one word according to a > multi-word lexicon? > > Thanks for any ideas. > melipone
Not sure I understand what you're trying to get at 100%, but you should be able to train the tokenizer to split words however you'd like, take a look at the training documentation[1] and feel free to email me if you run into any snags. - Lee Hinman [1]: https://github.com/dakrone/clojure-opennlp/blob/master/TRAINING.markdown -- You received this message because you are subscribed to the Google Groups "Clojure" group. To post to this group, send email to clojure@googlegroups.com Note that posts from new members are moderated - please be patient with your first post. To unsubscribe from this group, send email to clojure+unsubscr...@googlegroups.com For more options, visit this group at http://groups.google.com/group/clojure?hl=en