Hi,

If this happens in scripts/training/clean-corpus-n.perl then you should
check whether a parallel corpus with the same number of lines on source
and target side is passed to that script. Maybe there's an issue with
your training data or something went wrong in a previous step of the
preprocessing pipeline if the line numbers differ.

Cheers,
Matthias


On Fri, 2014-11-28 at 21:51 +0100, emna hkiri wrote:
> Dear friends
> i need your help please
> i have a problem of the cleaning phase of the arabic text
> every time moses returns the message sentences number 1562783 is too
> short!!!
> (in fact it is the last sentence in the text) so i delete it and again
> and
> again he tell me that this new last sentence is too short !!!!
> and i do delete the last sentences and i have always the same problem
> 
> Can someone please throw some light on this.
> 
> Thanks & Regards
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support



-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to