Hi all,
Why are MT test sets the sizes they are? Most are between 1200 and 3000
sentences, usually with one reference, but occasionally some have 4
references. How are these sizes justified? I am sure they are not arbitrary,
but I did not find an answer in most conference proceedings. What is the
I'm trying to decode using two factors and two Generation Models. This used
to work (using a moses repository from approximately Feb of 2015), but no
longer does. It generates blank translations.
Debugging its workings, I've found that the first GM works fine, but the
second one, it seems to try