Hi all!
I encountered a small problem (as I think), with POSTaggerTrainer. Train file contains russian and english words, ex. "бежать_action" in UTF-8 encoding. So in training (with or without -encoding UTF-8 option) I have following: opennlp.tools.postag.WordTagSampleStream read WARNING: Error during parsing, ignoring sentence: ъєяшы_action ....(the rest of sentence) Where can be the problem?
