Lewis John McGibbney created JOSHUA-304:
-------------------------------------------

             Summary: word-align.conf alignment template file not compatable 
with berkeley aligner
                 Key: JOSHUA-304
                 URL: https://issues.apache.org/jira/browse/JOSHUA-304
             Project: Joshua
          Issue Type: Bug
          Components: templates, berkeley, alignment
    Affects Versions: 6.0.5
            Reporter: Lewis John McGibbney
            Priority: Blocker
             Fix For: 6.1


It takes me quite some time to debug what was going on and why pipeline's were 
failing when using the berkeley aligner.
It turns out that the word-align.conf template provided at
https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf
is not compatible with the berkeley aligner. 
In particular the following lines are non compatible
https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15

Evidence of this is provided below
{code}
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g 
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar 
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g 
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar 
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g 
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar 
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g 
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar 
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP|JOINT
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g 
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar 
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Exception in thread "main" java.lang.NumberFormatException: For input string: 
"5 5"
        at 
java.lang.NumberFormatException.forInputString(NumberFormatException.java:65)
        at java.lang.Integer.parseInt(Integer.java:580)
        at java.lang.Integer.parseInt(Integer.java:615)
        at 
edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:143)
        at 
edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:240)
        at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294)
        at 
edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile(OptionsParser.java:555)
        at 
edu.berkeley.nlp.fig.basic.OptionsParser.doParse(OptionsParser.java:604)
        at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293)
        at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149)
lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 -Xmx10g 
-jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar 
++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf
Cannot create directory: alignments/0
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to