Native Moses components (MGIZA++, lmplz, train-model.perl, mert-moses.pl and other scripts/binaries) currently limit the training corpora (parallel and LM) to Posix newline (\n) only. Is this a legacy of Posix origins and/or a matter of limited resources to update the system to support both?
Is there some reason why they should NOT be updated to allow Windows newline (\r\n)? Would anyone object if we do the work and contribute transparently support that allows Linux or Windows newline? _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support