Hi Nadeem
If your sgm file has 800 sentences, and your tc file only has 1
sentences then clearly something has gone wrong in between. Check the
outputs of the intervening stages to see where the sentences are lost,
running the steps manually if necessary. It depends on your setup, but
Hi Barry;
I checked all the input steps files generated by EMS but cannot find any thing
wrong there anywhere not in sgm file names nor the formatting of those ref and
src files.
Cannot getting it as its just not working on my 800 above testset otherwise
whole experiment done successfully when
Hi Nadeem
It looks like something went wrong earlier in the EVALUATION section,
possibly in the input-from-sgm step. I would check all the steps in this
section for errors.
It is also not clear to me that the truecaser will work with Hindi as it
is designed for languages written in the latin