Hi Everyone, please, can someone indicate me what is the current postagger dictionary format ? I would like to use the POSTaggerTrainer in command line.
First I attempt to generate it based on http://www.mail-archive.com/[email protected]/msg01352.html Second (since the previous attempt failed) I looked at the code http://svn.apache.org/viewvc/incubator/opennlp/trunk/opennlp-tools/src/main/java/opennlp/tools/postag/POSTaggerTrainer.java?view=markup But I still have errors... [Fatal Error] :1:1: Content is not allowed in prolog. IO error while reading training data or indexing data: The profile data stream has an invalid format! Thank you for your help /Nicolas
