[jira] [Closed] (JOSHUA-302) Remove concurrent package and replace with builtins
[ https://issues.apache.org/jira/browse/JOSHUA-302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post closed JOSHUA-302. Resolution: Fixed This was fixed in an earlier commit. The inclusion of the specified libraries was likely done inadvertently through clumsy suggestions by Eclipse. > Remove concurrent package and replace with builtins > --- > > Key: JOSHUA-302 > URL: https://issues.apache.org/jira/browse/JOSHUA-302 > Project: Joshua > Issue Type: Improvement >Reporter: Max Thomas >Priority: Minor > > According to this site: > http://gee.cs.oswego.edu/dl/classes/EDU/oswego/cs/dl/util/concurrent/intro.html > this package is essentially replaced by SDK builtins from JDK5.0+. > Is there a reason this is still a dependency and cannot be replaced with > default library code (which, according to the site, "includes improved, more > efficient, standardized versions of the main components in this package")? > It seems to only be used in 2 places, one of which is about 3000 lines of > copy/pasted code. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (JOSHUA-295) Revamp dependency organization in Joshua
[ https://issues.apache.org/jira/browse/JOSHUA-295?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post updated JOSHUA-295: - Fix Version/s: 6.2 > Revamp dependency organization in Joshua > > > Key: JOSHUA-295 > URL: https://issues.apache.org/jira/browse/JOSHUA-295 > Project: Joshua > Issue Type: Improvement >Affects Versions: 6.2 >Reporter: Kellen Sunderland > Fix For: 6.2 > > > We would like to separate dependencies in Joshua by create a multi-module > maven project. This will allow us to decouple our codebase and make it more > modular. This means consumers of Joshua who are only interested in a core > library do not have to pull in dependencies for things like Http servers or > database clients. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (JOSHUA-306) Translations.java consumes potentially infinite resources
[ https://issues.apache.org/jira/browse/JOSHUA-306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post updated JOSHUA-306: - Fix Version/s: 6.2 > Translations.java consumes potentially infinite resources > - > > Key: JOSHUA-306 > URL: https://issues.apache.org/jira/browse/JOSHUA-306 > Project: Joshua > Issue Type: Bug >Reporter: Matt Post > Fix For: 6.2 > > > Translations is used to asynchronously provide a caller with access to > translations as they are produced. However, it is implemented as an > underlying synchronized list that grows and grows. In the presence of an > infinite stream (STDIN?), this will eventually consume all resources. This > might not be much to worry about, but maybe we should have old translations > be expunged as they are consumed. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (JOSHUA-291) Improve code quality via static analysis
[ https://issues.apache.org/jira/browse/JOSHUA-291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post updated JOSHUA-291: - Fix Version/s: 6.2 > Improve code quality via static analysis > > > Key: JOSHUA-291 > URL: https://issues.apache.org/jira/browse/JOSHUA-291 > Project: Joshua > Issue Type: Improvement > Components: core >Reporter: Tommaso Teofili >Assignee: Tommaso Teofili > Fix For: 6.2 > > > We can improve code quality / readability leveraging code analysis from tools > like FindBugs and others integrated in IDEs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (JOSHUA-291) Improve code quality via static analysis
[ https://issues.apache.org/jira/browse/JOSHUA-291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15447853#comment-15447853 ] Matt Post commented on JOSHUA-291: -- [~maxthomas], do you want to take a look at the 7 branch? This was merged into master but may have gotten dropped when I merged a number of changes on master into 7. > Improve code quality via static analysis > > > Key: JOSHUA-291 > URL: https://issues.apache.org/jira/browse/JOSHUA-291 > Project: Joshua > Issue Type: Improvement > Components: core >Reporter: Tommaso Teofili >Assignee: Tommaso Teofili > Fix For: 6.2 > > > We can improve code quality / readability leveraging code analysis from tools > like FindBugs and others integrated in IDEs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (JOSHUA-296) Refactor threading code
[ https://issues.apache.org/jira/browse/JOSHUA-296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kellen Sunderland resolved JOSHUA-296. -- Resolution: Fixed Fixed in this PR https://github.com/apache/incubator-joshua/pull/45 > Refactor threading code > --- > > Key: JOSHUA-296 > URL: https://issues.apache.org/jira/browse/JOSHUA-296 > Project: Joshua > Issue Type: Improvement >Reporter: Matt Post >Assignee: Kellen Sunderland >Priority: Minor > Fix For: 6.1 > > > The thread-handling code is a bit more complicated than it needs to be. We'd > like to simplify this using Executors while maintaining the current > stream-based processing features: > - Input stream: decoding starts and is multithreaded even before the whole > input has been received (e.g., so that STDIN works) > - Multithreading: translations are automatically assigned across threads in a > thread pool > - Output stream: decoding returns right away and callers can block while > waiting for translations to assemble -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (JOSHUA-307) Java-based tokenization and normalization
[ https://issues.apache.org/jira/browse/JOSHUA-307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15447105#comment-15447105 ] Kellen Sunderland commented on JOSHUA-307: -- +1. This would be great, and could go into the CLI module. > Java-based tokenization and normalization > - > > Key: JOSHUA-307 > URL: https://issues.apache.org/jira/browse/JOSHUA-307 > Project: Joshua > Issue Type: Improvement >Reporter: Matt Post >Priority: Minor > Fix For: 6.2 > > > Currently, Joshua expects data to be lowercased, normalized, and tokenized > consistent with the way the training data was prepared before being passed > in. This requires calling Perl scripts on the input data. It would be nice if > these Perl scripts (located under $JOSHUA/scripts/preparation) were rewritten > in Java (under org.apache.joshua.util) so that Joshua could do this > normalization itself. This would be particularly useful for the language > packs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (JOSHUA-307) Java-based tokenization and normalization
Matt Post created JOSHUA-307: Summary: Java-based tokenization and normalization Key: JOSHUA-307 URL: https://issues.apache.org/jira/browse/JOSHUA-307 Project: Joshua Issue Type: Improvement Reporter: Matt Post Priority: Minor Fix For: 6.2 Currently, Joshua expects data to be lowercased, normalized, and tokenized consistent with the way the training data was prepared before being passed in. This requires calling Perl scripts on the input data. It would be nice if these Perl scripts (located under $JOSHUA/scripts/preparation) were rewritten in Java (under org.apache.joshua.util) so that Joshua could do this normalization itself. This would be particularly useful for the language packs. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Resolved] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner
[ https://issues.apache.org/jira/browse/JOSHUA-304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Post resolved JOSHUA-304. -- Resolution: Fixed > word-align.conf alignment template file not compatible with berkeley aligner > > > Key: JOSHUA-304 > URL: https://issues.apache.org/jira/browse/JOSHUA-304 > Project: Joshua > Issue Type: Bug > Components: alignment, berkeley, templates >Affects Versions: 6.0.5 >Reporter: Lewis John McGibbney >Priority: Blocker > Fix For: 6.1 > > > It takes me quite some time to debug what was going on and why pipeline's > were failing when using the berkeley aligner. > It turns out that the word-align.conf template provided at > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf > is not compatible with the berkeley aligner. > In particular the following lines are non compatible > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15 > Evidence of this is provided below > {code} > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP|JOINT > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Exception in thread "main" java.lang.NumberFormatException: For input string: > "5 5" > at > java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) > at java.lang.Integer.parseInt(Integer.java:580) > at java.lang.Integer.parseInt(Integer.java:615) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:143) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:240) > at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294) > at > edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile(OptionsParser.java:555) > at > edu.berkeley.nlp.fig.basic.OptionsParser.doParse(OptionsParser.java:604) > at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293) > at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149) > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Cannot create directory: alignments/0 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner
[ https://issues.apache.org/jira/browse/JOSHUA-304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15446876#comment-15446876 ] Lewis John McGibbney commented on JOSHUA-304: - [~post] np at all. No need for sorry. I just tested after clean download of third party deps that this works a charm. Thanks for looking in to it I really appreciate it. I am +1 for merge into master and resolve this as fixed [~post] > word-align.conf alignment template file not compatible with berkeley aligner > > > Key: JOSHUA-304 > URL: https://issues.apache.org/jira/browse/JOSHUA-304 > Project: Joshua > Issue Type: Bug > Components: alignment, berkeley, templates >Affects Versions: 6.0.5 >Reporter: Lewis John McGibbney >Priority: Blocker > Fix For: 6.1 > > > It takes me quite some time to debug what was going on and why pipeline's > were failing when using the berkeley aligner. > It turns out that the word-align.conf template provided at > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf > is not compatible with the berkeley aligner. > In particular the following lines are non compatible > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15 > Evidence of this is provided below > {code} > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP|JOINT > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Exception in thread "main" java.lang.NumberFormatException: For input string: > "5 5" > at > java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) > at java.lang.Integer.parseInt(Integer.java:580) > at java.lang.Integer.parseInt(Integer.java:615) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:143) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:240) > at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294) > at > edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile(OptionsParser.java:555) > at > edu.berkeley.nlp.fig.basic.OptionsParser.doParse(OptionsParser.java:604) > at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293) > at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149) > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Cannot create directory: alignments/0 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner
[ https://issues.apache.org/jira/browse/JOSHUA-304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15446657#comment-15446657 ] Matt Post commented on JOSHUA-304: -- Sorry, you have to also install the Berkeley aligner jar. See the last two lines of download-deps.sh. Then it should work. > word-align.conf alignment template file not compatible with berkeley aligner > > > Key: JOSHUA-304 > URL: https://issues.apache.org/jira/browse/JOSHUA-304 > Project: Joshua > Issue Type: Bug > Components: alignment, berkeley, templates >Affects Versions: 6.0.5 >Reporter: Lewis John McGibbney >Priority: Blocker > Fix For: 6.1 > > > It takes me quite some time to debug what was going on and why pipeline's > were failing when using the berkeley aligner. > It turns out that the word-align.conf template provided at > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf > is not compatible with the berkeley aligner. > In particular the following lines are non compatible > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15 > Evidence of this is provided below > {code} > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP|JOINT > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Exception in thread "main" java.lang.NumberFormatException: For input string: > "5 5" > at > java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) > at java.lang.Integer.parseInt(Integer.java:580) > at java.lang.Integer.parseInt(Integer.java:615) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:143) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:240) > at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294) > at > edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile(OptionsParser.java:555) > at > edu.berkeley.nlp.fig.basic.OptionsParser.doParse(OptionsParser.java:604) > at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293) > at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149) > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Cannot create directory: alignments/0 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner
[ https://issues.apache.org/jira/browse/JOSHUA-304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15446643#comment-15446643 ] Lewis John McGibbney commented on JOSHUA-304: - Hi [~post] What new steps did you actually add? I've wiped everything that was generated by Joshua. I've rebuilt JOSHUA-304 branch. I'm getting the following {code} $JOSHUA/bin/pipeline.pl --type hiero --rundir /usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0 --readme "Baseline Hiero run 0 --lm-gen berkeleylm --lm berkeleylm --aligner berkeley JOSHUA-304" --source es --target en --lm-gen berkeleylm --lm berkeleylm --aligner berkeley --corpus $SPANISH/corpus/asr/callhome_train --corpus $SPANISH/corpus/asr/fisher_train --tune $SPANISH/corpus/asr/fisher_dev --test $SPANISH/corpus/asr/callhome_devtest ... snip ... [test-vocab-es] rebuilding... dep=/usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/corpus.es [CHANGED] dep=/usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/vocab.es [NOT FOUND] cmd=cat /usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/corpus.es | /usr/local/incubator-joshua/scripts/training/build-vocab.pl > /usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/vocab.es took 0 seconds (0s) [test-vocab-en] rebuilding... dep=/usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/corpus.en [CHANGED] dep=/usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/vocab.en [NOT FOUND] cmd=cat /usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/corpus.en | /usr/local/incubator-joshua/scripts/training/build-vocab.pl > /usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/test/vocab.en took 0 seconds (0s) [source-numlines] rebuilding... dep=/usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/train/corpus.es [CHANGED] cmd=cat /usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/train/corpus.es | wc -l took 0 seconds (0s) [source-numlines] retrieved cached result => 151810 [berkeley-aligner-chunk-0] rebuilding... dep=alignments/0/word-align.conf [CHANGED] dep=/usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/train/splits/corpus.es.0 [NOT FOUND] dep=/usr/local/jpl/xdata/joshua_experiments/fisher_callhome_experiment/0/data/train/splits/corpus.en.0 [NOT FOUND] dep=alignments/0/training.align [NOT FOUND] cmd=java -d64 -Xmx10g -jar /usr/local/incubator-joshua/ext/berkeleyaligner/distribution/berkeleyaligner.jar ++alignments/0/word-align.conf JOB FAILED (return code 1) [aligner-combine] rebuilding... dep=alignments/0/training.en-es.align [NOT FOUND] dep=alignments/training.align [NOT FOUND] cmd=cat alignments/0/training.en-es.align > alignments/training.align JOB FAILED (return code 1) cat: alignments/0/training.en-es.align: No such file or directory {code} > word-align.conf alignment template file not compatible with berkeley aligner > > > Key: JOSHUA-304 > URL: https://issues.apache.org/jira/browse/JOSHUA-304 > Project: Joshua > Issue Type: Bug > Components: alignment, berkeley, templates >Affects Versions: 6.0.5 >Reporter: Lewis John McGibbney >Priority: Blocker > Fix For: 6.1 > > > It takes me quite some time to debug what was going on and why pipeline's > were failing when using the berkeley aligner. > It turns out that the word-align.conf template provided at > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf > is not compatible with the berkeley aligner. > In particular the following lines are non compatible > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15 > Evidence of this is provided below > {code} > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar >
[jira] [Commented] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner
[ https://issues.apache.org/jira/browse/JOSHUA-304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15446555#comment-15446555 ] Matt Post commented on JOSHUA-304: -- It's easiest if you just wipe everything, but you could remove just alignments/ and data/train/ > word-align.conf alignment template file not compatible with berkeley aligner > > > Key: JOSHUA-304 > URL: https://issues.apache.org/jira/browse/JOSHUA-304 > Project: Joshua > Issue Type: Bug > Components: alignment, berkeley, templates >Affects Versions: 6.0.5 >Reporter: Lewis John McGibbney >Priority: Blocker > Fix For: 6.1 > > > It takes me quite some time to debug what was going on and why pipeline's > were failing when using the berkeley aligner. > It turns out that the word-align.conf template provided at > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf > is not compatible with the berkeley aligner. > In particular the following lines are non compatible > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15 > Evidence of this is provided below > {code} > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP|JOINT > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Exception in thread "main" java.lang.NumberFormatException: For input string: > "5 5" > at > java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) > at java.lang.Integer.parseInt(Integer.java:580) > at java.lang.Integer.parseInt(Integer.java:615) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:143) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:240) > at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294) > at > edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile(OptionsParser.java:555) > at > edu.berkeley.nlp.fig.basic.OptionsParser.doParse(OptionsParser.java:604) > at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293) > at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149) > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Cannot create directory: alignments/0 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
Re: [jira] [Commented] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner
Hi Matt do you mean the '.cachepipe'? On Aug 29, 2016 10:19 AM, "Matt Post (JIRA)"wrote: > > [ https://issues.apache.org/jira/browse/JOSHUA-304?page= > com.atlassian.jira.plugin.system.issuetabpanels:comment- > tabpanel=15446470#comment-15446470 ] > > Matt Post commented on JOSHUA-304: > -- > > I emailed days ago but don't see that it posted here. You need to wipe out > your old run and re-run. I can see that the new versions of the commands > were not run. > > > word-align.conf alignment template file not compatible with berkeley > aligner > > > > > > > Key: JOSHUA-304 > > URL: https://issues.apache.org/jira/browse/JOSHUA-304 > > Project: Joshua > > Issue Type: Bug > > Components: alignment, berkeley, templates > >Affects Versions: 6.0.5 > >Reporter: Lewis John McGibbney > >Priority: Blocker > > Fix For: 6.1 > > > > > > It takes me quite some time to debug what was going on and why > pipeline's were failing when using the berkeley aligner. > > It turns out that the word-align.conf template provided at > > https://github.com/apache/incubator-joshua/blob/master/ > scripts/training/templates/alignment/word-align.conf > > is not compatible with the berkeley aligner. > > In particular the following lines are non compatible > > https://github.com/apache/incubator-joshua/blob/master/ > scripts/training/templates/alignment/word-align.conf#L12-L15 > > Evidence of this is provided below > > {code} > > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_ > experiment/6/alignments/0/word-align.conf > > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC| > NONE > > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_ > experiment/6/alignments/0/word-align.conf > > Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC| > NONE > > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_ > experiment/6/alignments/0/word-align.conf > > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC| > NONE > > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_ > experiment/6/alignments/0/word-align.conf > > Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP| > JOINT > > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_ > experiment/6/alignments/0/word-align.conf > > Exception in thread "main" java.lang.NumberFormatException: For input > string: "5 5" > > at java.lang.NumberFormatException.forInputString( > NumberFormatException.java:65) > > at java.lang.Integer.parseInt(Integer.java:580) > > at java.lang.Integer.parseInt(Integer.java:615) > > at edu.berkeley.nlp.fig.basic.OptInfo.interpretValue( > OptionsParser.java:143) > > at edu.berkeley.nlp.fig.basic.OptInfo.interpretValue( > OptionsParser.java:240) > > at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294) > > at edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile( > OptionsParser.java:555) > > at edu.berkeley.nlp.fig.basic.OptionsParser.doParse( > OptionsParser.java:604) > > at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293) > > at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149) > > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_ > experiment/6/alignments/0/word-align.conf > > Cannot create directory: alignments/0 > > {code} > > > > -- > This message was sent by Atlassian JIRA > (v6.3.4#6332) >
[jira] [Commented] (JOSHUA-304) word-align.conf alignment template file not compatible with berkeley aligner
[ https://issues.apache.org/jira/browse/JOSHUA-304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15446470#comment-15446470 ] Matt Post commented on JOSHUA-304: -- I emailed days ago but don't see that it posted here. You need to wipe out your old run and re-run. I can see that the new versions of the commands were not run. > word-align.conf alignment template file not compatible with berkeley aligner > > > Key: JOSHUA-304 > URL: https://issues.apache.org/jira/browse/JOSHUA-304 > Project: Joshua > Issue Type: Bug > Components: alignment, berkeley, templates >Affects Versions: 6.0.5 >Reporter: Lewis John McGibbney >Priority: Blocker > Fix For: 6.1 > > > It takes me quite some time to debug what was going on and why pipeline's > were failing when using the berkeley aligner. > It turns out that the word-align.conf template provided at > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf > is not compatible with the berkeley aligner. > In particular the following lines are non compatible > https://github.com/apache/incubator-joshua/blob/master/scripts/training/templates/alignment/word-align.conf#L12-L15 > Evidence of this is provided below > {code} > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1, HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'MODEL1 HMM'; valid choices: MODEL1|MODEL2|HMM|SYNTACTIC|NONE > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Invalid enum: 'JOINT JOINT'; valid choices: FORWARD|REVERSE|BOTH_INDEP|JOINT > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Exception in thread "main" java.lang.NumberFormatException: For input string: > "5 5" > at > java.lang.NumberFormatException.forInputString(NumberFormatException.java:65) > at java.lang.Integer.parseInt(Integer.java:580) > at java.lang.Integer.parseInt(Integer.java:615) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:143) > at > edu.berkeley.nlp.fig.basic.OptInfo.interpretValue(OptionsParser.java:240) > at edu.berkeley.nlp.fig.basic.OptInfo.set(OptionsParser.java:294) > at > edu.berkeley.nlp.fig.basic.OptionsParser.readOptionsFile(OptionsParser.java:555) > at > edu.berkeley.nlp.fig.basic.OptionsParser.doParse(OptionsParser.java:604) > at edu.berkeley.nlp.fig.exec.Execution.init(Execution.java:293) > at edu.berkeley.nlp.wordAlignment.Main.main(Main.java:149) > lmcgibbn@LMC-032857 /usr/local/incubator-joshua/lib(master) $ java -d64 > -Xmx10g -jar /usr/local/incubator-joshua/lib/berkeleyaligner.jar > ++/usr/local/incubator-joshua/experiments/fisher_callhome_experiment/6/alignments/0/word-align.conf > Cannot create directory: alignments/0 > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[GitHub] incubator-joshua pull request #46: This update merges from master with threa...
Github user asfgit closed the pull request at: https://github.com/apache/incubator-joshua/pull/46 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[jira] [Commented] (JOSHUA-285) Not all RuntimeExceptions are caught
[ https://issues.apache.org/jira/browse/JOSHUA-285?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15446080#comment-15446080 ] Kellen Sunderland commented on JOSHUA-285: -- This is fixed in PR https://github.com/apache/incubator-joshua/pull/45 . Any uncaught exception will now be propagated from the threadpool thread that it occurs on, back to the main thread that is iterating over translation results. The main thread can have control over how to handle these failures, but they will likely be fatal. In the case of the CLI tool for example we can just crash with a stack trace. There's also a test specifically causing a runtime exception on a worker thread and ensuring that it propagates to the main response thread. > Not all RuntimeExceptions are caught > > > Key: JOSHUA-285 > URL: https://issues.apache.org/jira/browse/JOSHUA-285 > Project: Joshua > Issue Type: Bug >Reporter: Matt Post >Assignee: Kellen Sunderland > Fix For: 6.1 > > > In many instances Joshua threads will throw a RuntimeException that is not > caught, causing the decoder to hang indefinitely. These should be caught and, > if serious enough, cause the decoder to die. An example of an error that is > caught is running out of memory. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[GitHub] incubator-joshua pull request #46: The update merges from master and applies...
GitHub user KellenSunderland opened a pull request: https://github.com/apache/incubator-joshua/pull/46 The update merges from master and applies some refactoring. This PR has the merge from master with the Threading change. On top of that merge I've got one commit that reduces the number of constructors for JoshuaDecoder and gets rid of a bunch of now unused arguments. Because I've removed these arguments, this is a breaking change. I'm assuming this is ok for the 7 branch? I didn't include the breaking change in the pull on the master branch. You can merge this pull request into a Git repository by running: $ git pull https://github.com/KellenSunderland/incubator-joshua 7 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/incubator-joshua/pull/46.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #46 commit 5ed36b09484ec18fe00a8cb20af9d0e1ba1ca4e7 Author: Lewis John McGibbneyDate: 2016-08-24T02:36:09Z Update examples README formatting and links. commit 0744ebf56906dbe70292737cd50a39652407869d Author: Lewis John McGibbney Date: 2016-08-24T03:01:06Z Update examples README pipeline invocation parameters commit 762d588efc820cc7e6b98fce454b9254d8d15518 Author: Lewis John McGibbney Date: 2016-08-28T01:17:36Z Add Brew install badge to README commit d1c9c074544d72ee5335bdd83fe415b45098ab08 Author: Kellen Sunderland Date: 2016-08-29T08:08:10Z JOSHUA-285 JOSHUA-296 Refactored threading in order to properly propagate failures and remove custom code commit 0bb293295e3670c7449815941566578facd247e9 Author: Kellen Sunderland Date: 2016-08-29T11:53:53Z Renamed DecoderThread to DecoderTask commit 6d8f684836ddc25e40bb32f91d24d3f6e5eb745b Author: Kellen Sunderland Date: 2016-08-29T11:58:52Z Renamed Translations class to TranslationResponseStream commit 2a458c04f644861fccc3db521579e72ef335c656 Author: Kellen Sunderland Date: 2016-08-29T13:46:35Z Merge remote-tracking branch 'origin' into 7 commit 0e87046c9aa933b365638a88dce694b938ab84ef Author: Kellen Sunderland Date: 2016-08-29T13:55:50Z Merge branch 'master' of https://github.com/KellenSunderland/incubator-joshua into 7 commit f90cf3e4d947b32a27ef8f3d5f51d06677fd8d65 Author: Kellen Sunderland Date: 2016-08-25T09:50:43Z Addressed todos from previous commits. Refactored entry points in Joshua to remove redundancy, this is a breaking change --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---
[GitHub] incubator-joshua pull request #45: Refactored the threading code in the Josh...
GitHub user KellenSunderland opened a pull request: https://github.com/apache/incubator-joshua/pull/45 Refactored the threading code in the JoshuaDecoder class. These commits do some refactoring and reworking of the threading code in JoshuaDecoder. The principal motivation for this refactor is to reduce our use on custom code, and utilize libraries. It should also improve readability and reduce the potential for bugs as we've removed a large chunk of fairly complex code. Note: I've seen some errors with tests due to static Vocab issues with my new test. Will look into fixing this (and will fix right away if required for this PR). You can merge this pull request into a Git repository by running: $ git pull https://github.com/KellenSunderland/incubator-joshua master Alternatively you can review and apply these changes as the patch at: https://github.com/apache/incubator-joshua/pull/45.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #45 commit d1c9c074544d72ee5335bdd83fe415b45098ab08 Author: Kellen SunderlandDate: 2016-08-29T08:08:10Z JOSHUA-285 JOSHUA-296 Refactored threading in order to properly propagate failures and remove custom code commit 0bb293295e3670c7449815941566578facd247e9 Author: Kellen Sunderland Date: 2016-08-29T11:53:53Z Renamed DecoderThread to DecoderTask commit 6d8f684836ddc25e40bb32f91d24d3f6e5eb745b Author: Kellen Sunderland Date: 2016-08-29T11:58:52Z Renamed Translations class to TranslationResponseStream --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enabled but not working, please contact infrastructure at infrastruct...@apache.org or file a JIRA ticket with INFRA. ---