Hi, what if i want to train new dataset ? i tried to train new dictionary with new dataset, but always found error like this :
in thread "main" java.lang.IllegalArgumentException: opennlp.tools.util.InvalidFormatException: Tag dictionary contains tags which are unknown by the model! The unknown tags are: Pr C Adj Knj Em M N Prp Adv ADv v Ng So so Kh how can i train my own dataset for different language? please give me proof of code. Big Thanks. On Thu, Sep 17, 2015 at 9:41 PM, Aliaksandr Autayeu <[email protected]> wrote: > Penn Treebank: https://www.cis.upenn.edu/~treebank/ > > On 16 September 2015 at 21:26, Nishant Kelkar <[email protected]> > wrote: > > > Hi all, > > > > Just wanted to know: what is the data set used to train the default POS > > tagger en-pos-maxent.bin, and where can I download it? > > > > Thanks! > > > > Best Regards, > > Nishant Kelkar > > >
