Re: [Moses-support] "source phrase does not match" when training hierarchical model

2011-12-26 Thread Bill_Lang(Gmail)
Hi Hieu Hoang, Thanks for your so professional quick answers. After double check again, I found such error in my training.out, "Error: unequal numbers of non-terminals. Make sure the text does not contain words in square brackets (like [xxx])." When I deep analysis why my 100,000 corpus

Re: [Moses-support] "source phrase does not match" when training hierarchical model

2011-12-26 Thread Hieu Hoang
hi lang this could arise for a number of reasons. TO solve it: 1. make sure the phrase-table.half were sorted, and make sure that you set LC_ALL=C when you sorted the extract files AND the phrase-table.half files. 2. double check that you don't have control characters in your corpus, and

[Moses-support] "source phrase does not match" when training hierarchical model

2011-12-25 Thread Bill_Lang(Gmail)
Hi moses friends, I am running moses hierarchical training scripts several times. When I used small corpus (100,000 sentence pairs), everything was ok with the final bleus. But when I used one bigger curpus (about 300,000 sentence pairs), there were the same errors as follows: ---