El jue, 18-09-2008 a las 02:30 +0800, Nirav escribió: > Hi, > > I would like to know that how to align the two files one is having > Unicode characters ( Indian regional language) and one is having ascii > text ( English), > also is there any changes needed to train and evaluate the model.
It should Just Work™ -- afaik all the tools work with Unicode text, although depending on the regional language in question you might benefit from pre-tokenisation. Fran _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support