Hi, following the baseline instructions I've tokenized and recased the text before training. And consequently I get similar output when translating.
Are there any scripts available to get back a normal text from the output? Especially the html-encoding for some characters e.g. the french é, è and ê makes reading uncomfortable. A production system would have to produce readable output anyway. What's the standard work flow? Yours, Per Tunedal _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support