I'm trying to get up to speed using EMS. I have a small dataset (IWSLT 2008) that I am using to train, tune, and test using EMS.
I am able to reliably run EMS on my data on a single machine. My config file specifies jobs=10 and qsub-settings="-l hostname=*machinesA*|*machinesB*|*machinesC*" where the hostname patterns match machine names in my grid. When I run experiment.perl with the -cluster flag, the experiment runs, but it consistently dies while running run-giza and run-giza-inverse. Strangely, when I look in the steps directory and the training directory, it appears that mgiza has run successfully in both directions. I don't see any error messages. Does anyone have any idea what might be going on here? I am using the exact same config file, and it runs successfully when I launch experiment.perl without the -cluster flag. When I use the -cluster flag, everything runs successfully until it gets to the giza steps, which it appears to run, and then EMS dies. Thanks, Lane Schwartz _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
