Hi, If this happens in scripts/training/clean-corpus-n.perl then you should check whether a parallel corpus with the same number of lines on source and target side is passed to that script. Maybe there's an issue with your training data or something went wrong in a previous step of the preprocessing pipeline if the line numbers differ.
Cheers, Matthias On Fri, 2014-11-28 at 21:51 +0100, emna hkiri wrote: > Dear friends > i need your help please > i have a problem of the cleaning phase of the arabic text > every time moses returns the message sentences number 1562783 is too > short!!! > (in fact it is the last sentence in the text) so i delete it and again > and > again he tell me that this new last sentence is too short !!!! > and i do delete the last sentences and i have always the same problem > > Can someone please throw some light on this. > > Thanks & Regards > _______________________________________________ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support