Yes. Each tuning with the same test set will give you small variations in the final BLEU. Yours looks like they're in a normal range.
Date: Sun, 11 Oct 2015 04:23:56 +0000 From: Davood Mohammadifar <davood...@hotmail.com> Subject: [Moses-support] BLEU score difference about 0.13 for one dataset is normal? To: Moses Support <moses-support@mit.edu> Hello every one I noticed different BLEU scores for same dataset. Also the difference is not so much and is about 0.13. I trained my dataset and tuned development set for Persian-English translation. after testing, the score was 21.95. For second time i did the same process and obtained 21.82. (my tools were mgiza, mert, ...) is this difference normal? My system: CPU: Core i7-4790K RAM: 16GB OS: ubuntu 12.04 Thanks
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support