Hi, I'm using the script for compound splitting (/mosesdecoder/scripts/generic/compound-splitter.perl) on the german side of my parallel corpora. The corpora contains around 4M. sentences and may contains few english sentences in it (as I just noticed). The scripts is actually running for 14h on a 4-core 3GHz 16Gb RAM machine and seems to be stuck where these english sentences appear.
Is it normal for it to run for such a long time ? May the english sentences cause the trouble in the corpora ? Thanks
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support