Hi Jean-Luc Try using double quotes, ie
'das ist ein <n translation="yoyo">kleines</n> haus’ cheers - Barry On 04/02/13 12:46, Meunier, Jean-Luc wrote: > > Hi, > > I fail to use the xml-input flag. More precisely, the translation I > provide in the XML markup is ignored (and the markup is discarded). > > Translating'das ist ein <n translation='yoyo'>kleines</n> haus’ , I > expected to obtain‘this is a yoyo house’ with the option -xml-input > exclusive (I also tried using the historical ‘english’ XML attribute) > > Can someone tell me what I do wrong or explain what is going on? > > I tried with the sample_model discussed in the user guide p 21 > (http://www.statmt.org/moses/download/sample-models.tgz ) and a model > of mine as well. > > I’m using the Cygwin pre-compiled version of Moses 1.0 downloaded on > Jan 29^th . BTW is there a way to have the decoder showing its version? > > Thank you! > > JL > > echo 'das ist ein <n translation='yoyo'>kleines</n> haus' | > /c/moses10/bin/moses -f phrase-model/moses.ini -xml-input exclusive > > Defined parameters (per moses.ini or switch): > > config: phrase-model/moses.ini > > input-factors: 0 > > lmodel-file: 8 0 3 lm/europarl.srilm.gz > > mapping: T 0 > > n-best-list: nbest.txt 100 > > ttable-file: 0 0 0 1 phrase-model/phrase-table > > ttable-limit: 10 > > weight-d: 1 > > weight-l: 1 > > weight-t: 1 > > weight-w: 0 > > xml-input: exclusive > > /c/moses10/bin > > ScoreProducer: Distortion start: 0 end: 1 > > ScoreProducer: WordPenalty start: 1 end: 2 > > ScoreProducer: !UnknownWordPenalty start: 2 end: 3 > > Loading lexical distortion models...have 0 models > > Start loading LanguageModel lm/europarl.srilm.gz : [0.000] seconds > > ScoreProducer: LM start: 3 end: 4 > > Loading the LM will be faster if you build a binary file. > > Reading lm/europarl.srilm.gz > > ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100 > > > > **The ARPA file is missing <unk>. Substituting log10 probability > -100.000. > > ************************************************************************************************** > > > > Finished loading LanguageModels : [1.061] seconds > > Start loading PhraseTable phrase-model/phrase-table : [1.061] seconds > > filePath: phrase-model/phrase-table > > ScoreProducer: PhraseModel start: 4 end: 5 > > Finished loading phrase tables : [1.061] seconds > > Start loading phrase table from phrase-model/phrase-table : [1.061] > seconds > > Reading phrase-model/phrase-table > > ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100 > > > > **************************************************************************************************** > > > > Finished loading phrase tables : [1.063] seconds > > IO from STDOUT/STDIN > > Created input-output object : [1.063] seconds > > Translating line 0 in thread id 0x80047030 > > Translating: das ist ein kleines haus > > Line 0: Collecting options took 0.000 seconds > > Line 0: Search took 0.002 seconds > > this is a small house > > BEST TRANSLATION: this is a small house [11111] [total=-28.923] > core=(0.000,-5.000,0.000,-27.091,-1.833) > > Line 0: Translation took 0.007 seconds total > > user 1.045 > > sys 0.031 > > VmRSS: 34560 kB > > echo 'das ist ein <n english='yoyo'>kleines</n> haus' | > /c/moses10/bin/moses -f phrase-model/moses.ini -xml-input exclusive > > Defined parameters (per moses.ini or switch): > > config: phrase-model/moses.ini > > input-factors: 0 > > lmodel-file: 8 0 3 lm/europarl.srilm.gz > > mapping: T 0 > > n-best-list: nbest.txt 100 > > ttable-file: 0 0 0 1 phrase-model/phrase-table > > ttable-limit: 10 > > weight-d: 1 > > weight-l: 1 > > weight-t: 1 > > weight-w: 0 > > xml-input: exclusive > > /c/moses10/bin > > ScoreProducer: Distortion start: 0 end: 1 > > ScoreProducer: WordPenalty start: 1 end: 2 > > ScoreProducer: !UnknownWordPenalty start: 2 end: 3 > > Loading lexical distortion models...have 0 models > > Start loading LanguageModel lm/europarl.srilm.gz : [0.000] seconds > > ScoreProducer: LM start: 3 end: 4 > > Loading the LM will be faster if you build a binary file. > > Reading lm/europarl.srilm.gz > > ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100 > > > > **The ARPA file is missing <unk>. Substituting log10 probability > -100.000. > > ************************************************************************************************** > > > > Finished loading LanguageModels : [1.050] seconds > > Start loading PhraseTable phrase-model/phrase-table : [1.050] seconds > > filePath: phrase-model/phrase-table > > ScoreProducer: PhraseModel start: 4 end: 5 > > Finished loading phrase tables : [1.050] seconds > > Start loading phrase table from phrase-model/phrase-table : [1.051] > seconds > > Reading phrase-model/phrase-table > > ----5---10---15---20---25---30---35---40---45---50---55---60---65---70---75---80---85---90---95--100 > > > > **************************************************************************************************** > > > > Finished loading phrase tables : [1.052] seconds > > IO from STDOUT/STDIN > > Created input-output object : [1.052] seconds > > Translating line 0 in thread id 0x80047030 > > Translating: das ist ein kleines haus > > Line 0: Collecting options took 0.000 seconds > > Line 0: Search took 0.002 seconds > > this is a small house > > BEST TRANSLATION: this is a small house [11111] [total=-28.923] > core=(0.000,-5.000,0.000,-27.091,-1.833) > > Line 0: Translation took 0.008 seconds total > > user 1.060 > > sys 0.015 > > VmRSS: 34560 kB > > exclusive Only the XML-specified translation is used for the input > phrase. Any phrases > > from the phrase table that overlap with that span are ignored. > > *Jean-Luc Meunier****│**Senior Research Engineer **│****Xerox Research > Centre Europe**│**6 chemin de Maupertuis 38240 MEYLAN **│**+33 (0)4 76 > 61 50 18* > > > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
