date:20160502

Re: [Moses-support] Data for building a factored model

2016-05-02 Thread Marwa Refaie

Corpus for translation model should be on 2 parallel files in the format Word | pos | Lema For example , by a file for each language. You can prepare files using word net , Stanford , or any tagger & stemmer as can deal with your language pairs. May be before enter the files to moses you sh

[Moses-support] Data for building a factored model

2016-05-02 Thread Sašo Kuntaric

Hi all, I am having some issues producing the corpora in the correct format for Moses to execute factored training. I am looking at the factored tutorial on the Moses website and I am wondering, how to get such consistent corpora for two languages. What tools are being used and can they be traine