Hi Hieu Hoang,
Thanks for your so professional quick answers. After double check
again, I found such error in my training.out, "Error: unequal numbers of
non-terminals. Make sure the text does not contain words in square brackets
(like [xxx])."
When I deep analysis why my 100,000 corpus
hi lang
this could arise for a number of reasons. TO solve it:
1. make sure the phrase-table.half were sorted, and make sure that you set
LC_ALL=C
when you sorted the extract files AND the phrase-table.half files.
2. double check that you don't have control characters in your corpus, and
Hi moses friends,
I am running moses hierarchical training scripts several times. When
I used small corpus (100,000 sentence pairs), everything was ok with the
final bleus. But when I used one bigger curpus (about 300,000 sentence
pairs), there were the same errors as follows:
---