Hi, these numbers are computed of the merged n-best lists, so the actual BLEU score my differ due to search error.
-phi On Tue, Jul 2, 2019 at 1:46 PM rmogla <rmogl...@gmail.com> wrote: > hello, > I am training moses , I ran it twice with the same database using the > baseline system but both the time mert.out file show different numbers of > run. > As in mert.out file: > > (8) BEST at 8: 0.118606 0.00894043 0.0616433 0.0270803 -0.00458744 > 0.0636065 0.0394991 0.121296 -0.0393199 -0.349773 0.0781373 0.00882911 > 0.0719328 0.00674891 => 0.54342 at Fri... > > (7) BEST at 7: 0.042261 0.0543911 0.0644467 0.0409684 -0.114557 0.0567117 > 0.0807067 0.099067 -0.0280285 -0.277286 0.0593325 0.0104217 0.0663609 > -0.00546069 => 0.546516 at Tue... > > though the BLEU score of finally trained moses varies by .01, > > Please suggest if I am doing something wrong or is it normal to have > various number of mert runs for the same database. > > Thanking in advance. > Best Regards > > >
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support