Hi Joe
You could also look at the entropy of the distribution. I'll leave Matt
to post the one-liner for that one,
cheers - Barry
On 13/05/16 15:10, Matt Post wrote:
gzip -cd model/phrase-table.gz | cut -d\| -f1 | sort | uniq -c | sort
-nr | head -n5
(according to one definition of "ambiguous")
On May 11, 2016, at 2:53 AM, Joe Jean <j...@joejean.net
<mailto:j...@joejean.net>> wrote:
Hello,
How would you go about finding the top 5 most ambiguous words in a
translation system just by looking at the phrase table and the
lexical translation tables? Thanks.
<http://www.facebook.com/joe.jean3>
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu <mailto:Moses-support@mit.edu>
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support