Corpus for translation model should be on 2 parallel files in the format
Word | pos | Lema For example , by a file for each language. You can
prepare files using word net , Stanford , or any tagger & stemmer as can deal
with your language pairs. May be before enter the files to moses you sh
Hi all,
I am having some issues producing the corpora in the correct format for
Moses to execute factored training.
I am looking at the factored tutorial on the Moses website and I am
wondering, how to get such consistent corpora for two languages. What tools
are being used and can they be traine