Hello, I'am trying to build the baseline SMT for english to german language. I've got a "strange" problem because i work for the 1st time on german language and i've got a score Bleu on Dev corpus = 16 and on the test corpus = 9 ! i prepared the corpus before training and mert as usual like this: tokenization + cleaning + lowercase
Is there somthing special for german language? PS: training, dev and test corpus are in the same domain (Europarl) Thank you in advance Best regards Cyrine
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support