Hi,

I just ran a count of different sized n-grams in the source side of my phrase 
table and this is what I got.


unigrams     85,233

bigrams       991,701

trigrams   2,697,341

4-grams    3,876,180

5-grams    4,209,094

6-grams    3,702,813

7-grams    2,560,251

8-grams                   0


So, up until the 5-grams the results are what I expected the number is 
increasing. But then it drops for the 6-grams and drops again for the 7-grams.


Does anybody know why?


James

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to