Hello,
I'am trying to build the baseline SMT for english to german language. I've
got a "strange" problem because i work for the 1st time on german language
and i've got a  score Bleu on Dev corpus = 16 and on the test corpus = 9 !
i prepared the corpus before training and mert as usual like this:
tokenization + cleaning + lowercase

Is there somthing special for german language?
PS: training, dev and test corpus are in the same domain (Europarl)

Thank you in advance

Best regards

Cyrine
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to