Hello again,

I believe I can wrap my head around the theoretical part, but the English
and German corpora in the Moses factored model tutorial (
http://www.statmt.org/moses/?n=Moses.FactoredTutorial) look beautifully
factored, so my question is how were the original corpora processed? Was a
specific tagger used and was there any manual/script postprocessing done?

And since I am already bugging everyone, how is the language model pos.lm
created? Is it extracted from a file, created manually or in another way?

Thank you in advance for all the replies.

Best regards,

Sašo

2016-05-02 19:45 GMT+02:00 Marwa Refaie <basmal...@hotmail.com>:

> Corpus for translation model should be on 2 parallel files in the format
> Word | pos | Lema .... For example , by a file for each language. You can
> prepare files using word net , Stanford , or any tagger & stemmer  as can
> deal with your language pairs. May be before enter the files to moses you
> should adjust the text files by a python script (write it your self)
>
> For language model ... You must build it as follows
> Verb noun noun
> Noun Det adj
> ....... Depending on the target language only ,, Then build it as usual
> n-gram lm.
>
> Sent from my iPad
>
> > On May 2, 2016, at 10:11, Sašo Kuntaric <saso.kunta...@gmail.com> wrote:
> >
> > Hi all,
> >
> > I am having some issues producing the corpora in the correct format for
> Moses to execute factored training.
> >
> > I am looking at the factored tutorial on the Moses website and I am
> wondering, how to get such consistent corpora for two languages. What tools
> are being used and can they be trained for specific languages (Slovenian in
> my example). Are such tools available for download or is such data produced
> with custom scripts?
> >
> > --
> > Best regards,
> >
> > Sašo
> > _______________________________________________
> > Moses-support mailing list
> > Moses-support@mit.edu
> > http://mailman.mit.edu/mailman/listinfo/moses-support
>



-- 
lp,

Sašo
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to