Hi Ryan My two cents:
First of all, a way of detecting Google-translated text might be to take the original text and run it through Google Translate (if they have access to an API, I mean). Then see if an alleged machine translated-text is similar to this. Here is another pointer to literature for you: http://www.aclweb.org/anthology/P14-2048 <http://www.aclweb.org/anthology/P14-2048> Regards Mathias > On 6 Apr 2018, at 04:14, Ryan Coughlin <rya...@sis.edu> wrote: > > Hi all, > > Hope you all had a happy Easter and are having a good spring (or autumn). > > One of the admins from Open Subtitles asked me if I knew of a way to detect > if a translation was machine or machine or human translation. The website > seems to have a lot of submissions that are simply Google translated, and > they're looking for a script to flag these submissions. Do you have any ideas > how they might go about this? > > I was thinking of running the SL against a few of the popular SMT programs, > and seeing if the BLEU scores were too similar, but the method seems too > random and computationally heavy. I'd appreciate any thoughts. > > thank you, > Ryan > _______________________________________________ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support