Re: [Moses-support] EMS-Decoder-Problem

2013-12-17 Thread Barry Haddow
Hi Nadeem

If your sgm file has 800 sentences, and your tc file only has 1 
sentences then clearly something has gone wrong in between. Check the 
outputs of the intervening stages to see where the sentences are lost, 
running the steps manually if necessary. It depends on your setup, but 
probably there will be input-from-sgm, tokenisation and truecasing. Are 
you using the standard Moses tools for Hindi?

The formatting of the sgm file is quite delicate, so check it carefully. 
It's easy to make a mistake that is hard to spot and the nist-bleu 
script does not give helpful error messages,

cheers - Barry

On 17/12/13 01:05, nadeem khan wrote:
> Hi Barry;
>
> I checked all the input steps files generated by EMS but cannot find 
> any thing wrong there anywhere not in sgm file names nor the 
> formatting of those ref and src files.
> Cannot getting it as its just not working on my 800 above testset 
> otherwise whole experiment done successfully when using the fr-eng toy 
> data set.
>
>
> Regards
> Nadeem
>
>
> On Tuesday, December 10, 2013 9:52 PM, Barry Haddow 
>  wrote:
> Hi Nadeem
>
> It looks like something went wrong earlier in the EVALUATION section,
> possibly in the input-from-sgm step. I would check all the steps in this
> section for errors.
>
> It is also not clear to me that the truecaser will work with Hindi as it
> is designed for languages written in the latin script,
>
> cheers - Barry
>
> On 07/12/13 18:51, nadeem khan wrote:
> >
> >
> >
> > Hello Sir;
> >
> >I am using EMS now and getting into a problem with my data of hindi
> > language.
> > I ran EMS on config.toy just fine there was not a single error but
> > when it comes to my own data and experiment I am getting stuck with
> > BLEU and BLEU-c Crashed.
> > When I invistaged the problem there is only 1 single Input Segment in
> > test.input.tc.1. why and how the EMS taking only 1 segment from my
> > input test-src.sgm file? and when I investigated further there is a
> > fatal error under EVALUATION_test_nist-bleu-c.1.STDERR of no id in
> > srcset. why I am getting that as I am giving it the complete sgm
> > frame for wrapping out the output.
> >
> > I am sending you my those testdata sgm file as well as the input and
> > output generated by EMS for my dataset.
> > Please have a look at it and Reply with your kind comments to resolve
> > these  issues
> > Waiting for your kind response
> >
> > THANK YOU
> > Regards
> > nadeem
> >
> >
> >
> >
> > ___
> > Moses-support mailing list
> > Moses-support@mit.edu 
> > http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
>
> -- 
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
>
>
>
>


-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] EMS-Decoder-Problem

2013-12-16 Thread nadeem khan
Hi Barry;

I checked all the input steps files generated by EMS but cannot find any thing 
wrong there anywhere not in sgm file names nor the formatting of those ref and 
src files.
Cannot getting it as its just not working on my 800 above testset otherwise 
whole experiment done successfully when using the fr-eng toy data set.


Regards
Nadeem



On Tuesday, December 10, 2013 9:52 PM, Barry Haddow 
 wrote:
 
Hi Nadeem

It looks like something went wrong earlier in the EVALUATION section, 
possibly in the input-from-sgm step. I would check all the steps in this 
section for errors.

It is also not clear to me that the truecaser will work with Hindi as it 
is designed for languages written in the latin script,

cheers - Barry


On 07/12/13 18:51, nadeem khan wrote:
>
>
>
> Hello Sir;
>
>    I am using EMS now and getting into a problem with my data of hindi 
> language.
> I ran EMS on config.toy just fine there was not a single error but 
> when it comes to my own data and experiment I am getting stuck with 
> BLEU and BLEU-c Crashed.
> When I invistaged the problem there is only 1 single Input Segment in 
> test.input.tc.1. why and how the EMS taking only 1 segment from my 
> input test-src.sgm file? and when I investigated further there is a 
> fatal error under EVALUATION_test_nist-bleu-c.1.STDERR of no id in 
> srcset. why I am getting that as I am giving it the complete sgm 
> frame for wrapping out the output.
>
> I am sending you my those testdata sgm file as well as the input and 
> output generated by EMS for my dataset.
> Please have a look at it and Reply with your kind comments to resolve 
> these  issues
> Waiting for your kind response
>
> THANK YOU
> Regards
> nadeem
>
>
>
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support


-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] EMS-Decoder-Problem

2013-12-10 Thread Barry Haddow
Hi Nadeem

It looks like something went wrong earlier in the EVALUATION section, 
possibly in the input-from-sgm step. I would check all the steps in this 
section for errors.

It is also not clear to me that the truecaser will work with Hindi as it 
is designed for languages written in the latin script,

cheers - Barry

On 07/12/13 18:51, nadeem khan wrote:
>
>
>
> Hello Sir;
>
>I am using EMS now and getting into a problem with my data of hindi 
> language.
> I ran EMS on config.toy just fine there was not a single error but 
> when it comes to my own data and experiment I am getting stuck with 
> BLEU and BLEU-c Crashed.
> When I invistaged the problem there is only 1 single Input Segment in 
> test.input.tc.1. why and how the EMS taking only 1 segment from my 
> input test-src.sgm file? and when I investigated further there is a 
> fatal error under EVALUATION_test_nist-bleu-c.1.STDERR of no id in 
> srcset. why I am getting that as I am giving it the complete sgm 
> frame for wrapping out the output.
>
> I am sending you my those testdata sgm file as well as the input and 
> output generated by EMS for my dataset.
> Please have a look at it and Reply with your kind comments to resolve 
> these  issues
> Waiting for your kind response
>
> THANK YOU
> Regards
> nadeem
>
>
>
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support


-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support