On Nov 17, 10:15 am, labwor...@gmail.com wrote:
> That's probably an OpenNLP question, but here it goes. Is there a way to
> tell the tokenizer to make tokens of more than one word according to a
> multi-word lexicon?
>
> Thanks for any ideas.
> melipone

Not sure I understand what you're trying to get at 100%, but you
should be able to train the tokenizer to split words however you'd
like, take a look at the training documentation[1] and feel free to
email me if you run into any snags.

- Lee Hinman

[1]: https://github.com/dakrone/clojure-opennlp/blob/master/TRAINING.markdown

-- 
You received this message because you are subscribed to the Google
Groups "Clojure" group.
To post to this group, send email to clojure@googlegroups.com
Note that posts from new members are moderated - please be patient with your 
first post.
To unsubscribe from this group, send email to
clojure+unsubscr...@googlegroups.com
For more options, visit this group at
http://groups.google.com/group/clojure?hl=en

Reply via email to