Cyrine

This is not a problem. It's the design. The tokenizer.pl script escapes characters that Moses reserves for its own use. When you use the detokenizer.pl script unescapes these characters after translations.



On 02/21/2014 08:20 PM, [email protected] wrote:
reserves for
Hello all,

I have a problem with the tokenizer.pl <http://tokenizer.pl> script. i get as a result a text ith some special punctuation , like this for example :

EU &apos;s Luxembourg-based statistical office reported

The input file is a .txt file

Is there any solution for this problem

Thank you in advance


Bests
--
/Cyrine/


_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to