[Moses-support] EMS, mgiza, and SGE

Lane Schwartz Wed, 19 Sep 2012 12:19:52 -0700

I'm trying to get up to speed using EMS. I have a small dataset (IWSLT
2008) that I am using to train, tune, and test using EMS.


I am able to reliably run EMS on my data on a single machine.

My config file specifies jobs=10 and qsub-settings="-l
hostname=*machinesA*|*machinesB*|*machinesC*" where the hostname
patterns match machine names in my grid.

When I run experiment.perl with the -cluster flag, the experiment
runs, but it consistently dies while running run-giza and
run-giza-inverse. Strangely, when I look in the steps directory and
the training directory, it appears that mgiza has run successfully in
both directions. I don't see any error messages. Does anyone have any
idea what might be going on here?

I am using the exact same config file, and it runs successfully when I
launch experiment.perl without the -cluster flag. When I use the
-cluster flag, everything runs successfully until it gets to the giza
steps, which it appears to run, and then EMS dies.

Thanks,
Lane Schwartz
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

[Moses-support] EMS, mgiza, and SGE

Reply via email to