Hi, what if i want to train new dataset ?
i tried to train new dictionary with new dataset, but always found error
like this :

in thread "main" java.lang.IllegalArgumentException:
opennlp.tools.util.InvalidFormatException: Tag dictionary contains tags
which are unknown by the model! The unknown tags are: Pr C Adj Knj Em M N
Prp Adv ADv v Ng So so Kh


how can i train my own dataset for different language? please give me proof
of code.

Big Thanks.

On Thu, Sep 17, 2015 at 9:41 PM, Aliaksandr Autayeu <[email protected]>
wrote:

> Penn Treebank: https://www.cis.upenn.edu/~treebank/
>
> On 16 September 2015 at 21:26, Nishant Kelkar <[email protected]>
> wrote:
>
> > Hi all,
> >
> > Just wanted to know: what is the data set used to train the default POS
> > tagger en-pos-maxent.bin, and where can I download it?
> >
> > Thanks!
> >
> > Best Regards,
> > Nishant Kelkar
> >
>

Reply via email to