Re: [Moses-support] Number of Unique Hypotheses in the N-best List

2015-02-25 Thread Erinç Dikici
Hello again, On Tue, Feb 24, 2015 at 10:18 PM, Rico Sennrich wrote: did you actually cut away the scores? It's possible that you have duplicates with different scores, so they will show up as different lines with 'sort | uniq', but will be merged if you do 'cut -d'|' -f4 | sort | uniq' as

Re: [Moses-support] Number of Unique Hypotheses in the N-best List

2015-02-24 Thread Hieu Hoang
On 24/02/15 19:08, Erinç Dikici wrote: (Apparently the Gmane web interface turned my reply into garbled text, sorry for the double posting) Thanks again for your quick answers. Yes, 32 and 2 are the counts after sort | uniq | wc -l. The total number of hypotheses returned for both cases was

Re: [Moses-support] Number of Unique Hypotheses in the N-best List

2015-02-24 Thread Matthias Huck
somewhere between 2.1 and 3.0, the keyword 'distinct' was Oops, that was me. And it wasn't intended. I'm using this for my own setups and apparently copied it to master when I added some other stuff. Hope I didn't mess up other people's experiments. It's been in master since 7 August 2014

Re: [Moses-support] Number of Unique Hypotheses in the N-best List

2015-02-24 Thread Erinç Dikici
Matthias Huck mhuck@... writes: Hi Erinç, On Tue, 2015-02-24 at 16:24 +, Matthias Huck wrote: I'd assume that your 32 entries of the n-best list weren't actually unique, though, but a number of duplicates of the (two) very same outputs, as distinct should simply avoid duplicate