Hi,

I would like to understand better the analysis.perl script that
generates the bleu-annotation file.

Is there an easy way to get the uncased bleu score of each line instead
of the cased calculation ?
Am I right that this script recompute its own Bleu score without calling
the Nist-Bleu nor Multi-Bleu external scripts ?


Also I find it strange sometimes when there is only one or two words :

Translation / reference / score
Contents / Content / 0.8409
Ireland / Irish / 0.8409
Issuer / Italie / 0.8409
PT / US / 0.8409
.....
and so on, two words, unrelated will always generate similar 0.8409 scores.

for 2-grams
Very strong / Very high / 0.7598
Public sector / Public Sector / 0.7598
However : / But : / 0.7598

so, for 2-grams, when one word only is good it will generate a score of
0.7598


Thanks,

Vincent


_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to