Re: [Moses-support] detecting if a translation is machine or human translation

2018-04-06 Thread Kenneth Heafield
Google watermarked their translation output:

https://research.google.com/pubs/archive/37162.pdf

Would be good to check if they're still doing this with neural systems.

On 04/06/2018 09:14 AM, Mathias Müller wrote:
> Hi Ryan
> 
> My two cents:
> 
> First of all, a way of detecting Google-translated text might be to take
> the original text and run it through Google Translate (if they have
> access to an API, I mean). Then see if an alleged machine
> translated-text is similar to this.
> 
> Here is another pointer to literature for you:
> 
> http://www.aclweb.org/anthology/P14-2048
> 
> Regards
> Mathias
> 
>> On 6 Apr 2018, at 04:14, Ryan Coughlin > > wrote:
>>
>> Hi all,
>>
>>   Hope you all had a happy Easter and are having a good spring (or
>> autumn).
>>
>>   One of the admins from Open Subtitles asked me if I knew of a way to
>> detect if a translation was machine or machine or human translation.
>> The website seems to have a lot of submissions that are simply Google
>> translated, and they're looking for a script to flag these
>> submissions. Do you have any ideas how they might go about this?
>>
>>   I was thinking of running the SL against a few of the popular SMT
>> programs, and seeing if the BLEU scores were too similar, but the
>> method seems too random and computationally heavy. I'd appreciate any
>> thoughts.
>>
>> thank you,
>> Ryan
>> ___
>> Moses-support mailing list
>> Moses-support@mit.edu 
>> http://mailman.mit.edu/mailman/listinfo/moses-support
> 
> 
> 
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
> 
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] detecting if a translation is machine or human translation

2018-04-06 Thread Mathias Müller
Hi Ryan

My two cents:

First of all, a way of detecting Google-translated text might be to take the 
original text and run it through Google Translate (if they have access to an 
API, I mean). Then see if an alleged machine translated-text is similar to this.

Here is another pointer to literature for you:

http://www.aclweb.org/anthology/P14-2048 


Regards
Mathias

> On 6 Apr 2018, at 04:14, Ryan Coughlin  wrote:
> 
> Hi all,
> 
>   Hope you all had a happy Easter and are having a good spring (or autumn).
> 
>   One of the admins from Open Subtitles asked me if I knew of a way to detect 
> if a translation was machine or machine or human translation. The website 
> seems to have a lot of submissions that are simply Google translated, and 
> they're looking for a script to flag these submissions. Do you have any ideas 
> how they might go about this?
> 
>   I was thinking of running the SL against a few of the popular SMT programs, 
> and seeing if the BLEU scores were too similar, but the method seems too 
> random and computationally heavy. I'd appreciate any thoughts.
> 
> thank you,
> Ryan
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support

___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] detecting if a translation is machine or human translation

2018-04-05 Thread Philipp Koehn
Hi,

there has been some work on this.

See Section 2.4 in
http://www.aclweb.org/anthology/W16-2347
for some references.

The main ideas are to look for properties of machine translation - i.e.,
that is very literal and may have certain systematic errors.

This is a hard problem, though.

-phi

On Thu, Apr 5, 2018 at 10:14 PM, Ryan Coughlin  wrote:

> Hi all,
>
>   Hope you all had a happy Easter and are having a good spring (or autumn).
>
>   One of the admins from Open Subtitles asked me if I knew of a way to
> detect if a translation was machine or machine or human translation. The
> website seems to have a lot of submissions that are simply Google
> translated, and they're looking for a script to flag these submissions. Do
> you have any ideas how they might go about this?
>
>   I was thinking of running the SL against a few of the popular SMT
> programs, and seeing if the BLEU scores were too similar, but the method
> seems too random and computationally heavy. I'd appreciate any thoughts.
>
> thank you,
> Ryan
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] detecting if a translation is machine or human translation

2018-04-05 Thread Ryan Coughlin
Hi all,

  Hope you all had a happy Easter and are having a good spring (or autumn).

  One of the admins from Open Subtitles asked me if I knew of a way to
detect if a translation was machine or machine or human translation. The
website seems to have a lot of submissions that are simply Google
translated, and they're looking for a script to flag these submissions. Do
you have any ideas how they might go about this?

  I was thinking of running the SL against a few of the popular SMT
programs, and seeing if the BLEU scores were too similar, but the method
seems too random and computationally heavy. I'd appreciate any thoughts.

thank you,
Ryan
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support