Hi,

I can't find a sort of "tutorial " on domain adaptation path to follow.
I read this in the doc :
The language model should be trained on a corpus that is suitable to the 
domain. If the translation model is trained on a parallel corpus, then 
the language model should be trained on the output side of that corpus, 
although using additional training data is often beneficial.

And in the training section of the EMS, there is a sub section with 
domain-features=....

What is the best practice ?

Let's say for instance that I would like to specialize my modem in 
finance translation, with specific corpus.

Should I train the Language model with finance stuff ?
Should I include parallel corpus in the translation model training ?
Should I tune with financial data sets ?

Please help me to understand.
Vincent

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to