Hi,

I'm using the script for compound splitting 
(/mosesdecoder/scripts/generic/compound-splitter.perl) on the german side of my 
parallel corpora. The corpora contains around 4M. sentences and may contains 
few english sentences in it (as I just noticed). The scripts is actually 
running for 14h on a 4-core 3GHz 16Gb RAM machine and seems to be stuck where 
these english sentences appear.

Is it normal for it to run for such a long time ? May the english sentences 
cause the trouble in the corpora ?

Thanks
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to