Dear all, I am working on a translation task with a very large parallel corpus. Because of computational cost of training such a parallel corpus, i am going to filter it regarding to the test set ( of course , by the filtering, the evaluation must be still fair).
I am looking for a solution or a tool for filtering parallel corpus sentences. Note that i do not need to filter phrase table. I know that the filter_ moses tool reduces the phrase table size. cheers -- S.Farzi, Ph.D. Student Natural Language Processing Lab, School of Electrical and Computer Eng., Tehran University Tel: +9821-6111-9719 _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support