I've read the NAACL 2013 paper on Dyer Fast Align (http://www.ark.cs.cmu.edu/cdyer/fast_valign.pdf) and it seems pretty straight forward.
There's a comment on statme.org (http://www.statmt.org/moses/?n=FactoredTraining.EMS#ntoc13), it's faster and maybe better, "especially for language pairs without much large-scale reordering." Other than the risk associated with the reordering, has anyone uncovered any other potential draw-backs of using Fast Align? For example, although BerkeleyAligner is nice, its multi-threading is buggy and tends to randomly fault when using a large thread pool. _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support