hi i am working on baseline SMT with moses for Kannada-english MT. in the tokenization step the input unicode fonts of kannada words will get added with their unicode references because of glyph substitution.
due to this i am not able to get good translation. if i give the tokenized sentences as input to decoder i am getting correct translation. how to solve this problem? i am using ubuntu12.04 and moses. regards, ShivaKumar KM Asst.Professor, Amrita VishwaVidyaPeetham Mysore Campus Bogadi 2nd stage Mysore 9611913393 _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support