Hi all!

I encountered a small problem (as I think), with POSTaggerTrainer.

Train file contains russian and english words, ex. "бежать_action" in UTF-8
encoding. So in training (with or without -encoding UTF-8 option) I have
following:

opennlp.tools.postag.WordTagSampleStream read
WARNING: Error during parsing, ignoring sentence: ъєяшы_action ....(the
rest of sentence)

Where can be the problem?

Reply via email to