Hi Nick, for /home/lingua/smt/irstlm/bin/compile-lm complete_fr.truecased_unique_tok_clean.lm --eval /home/lingua/Patricia/Corpora/Corpora_Eval/devtest/nc-test2007.truecased.tok.fr I get the following output: inpfile: complete_fr.truecased_unique_tok_clean.lmloading up to the LM level 1000 (if any)dub: 10000000Language Model Type of complete_fr.truecased_unique_tok_clean.lm is 1\data\loadtxt_ram()1-grams: reading 1091677 entriesdone level12-grams: reading 13524189 entries..done level23-grams: reading 23061450 entries....done level3doneOOV code is 218081OOV code is 218081Start EvalOOV code: 218081%% Nw=58714 PP=201.88 PPwp=5.70 Nbo=19233 Noov=105 OOV=0.18%lmtable class statisticslevels 3lev 1 entries 1091677 used mem 15.62Mblev 2 entries 13524189 used mem 193.47Mblev 3 entries 23061450 used mem 153.95Mbtotal allocated mem 363.03Mbtotal number of get and binary search callslevel 1 get: 3042 bsearch: 0level 2 get: 58713 bsearch: 23178875level 3 get: 58712 bsearch: 55672
For /home/lingua/smt/irstlm/bin/compile-lm complete_fr.truecased_unique_tok_clean.blm --eval /home/lingua/Patricia/Corpora/Corpora_Eval/devtest/nc-test2007.truecased.tok.fr I get the same error as in the phrase training: inpfile: complete_fr.truecased_unique_tok_clean.blmloading up to the LM level 1000 (if any)dub: 10000000Language Model Type of complete_fr.truecased_unique_tok_clean.blm is 1blmtloadbin()lmtable::loadbin_dict()dictionary::loadtxt wrong entry was found (0) in position 1 Best,Patricia > From: nicr...@fbk.eu > To: patriciahelm...@hotmail.com; moses-support@mit.edu > Subject: RE: [Moses-support] IRSTLM - Error: dictionary::loadtxt wrong entry > was found (0) in position 1 > Date: Tue, 3 Jul 2012 13:29:26 +0000 > > Hi Patricia, > > Could you try computing the perplexity of your binarized LM with compile-lm? > > First on the ARPA format (SRILM): > /home/lingua/smt/irstlm/bin/compile-lm > complete_fr.truecased_unique_tok_clean.lm --eval <text-to-eval> > > and then on the binarized version (before your symbolic link): > /home/lingua/smt/irstlm/bin/compile-lm > complete_fr.truecased_unique_tok_clean.blm --eval <text-to-eval> > > It might be easier to debug by first looking at the direct output from IRSTLM. > > Thanks, > Nick > > > ________________________________ > From: moses-support-boun...@mit.edu [moses-support-boun...@mit.edu] on behalf > of Patricia Helmich [patriciahelm...@hotmail.com] > Sent: Tuesday, July 03, 2012 3:07 PM > To: moses-support@mit.edu > Subject: [Moses-support] IRSTLM - Error: dictionary::loadtxt wrong entry was > found (0) in position 1 > > Hi, > I am using Moses in combination with SRILM and IRSTLM for several language > pairs. > After building LMs with SRILM and training the phrase model, I try to > translate a sentence, for example: > > echo "this is a small house" | /home/lingua/smt/moses/bin/moses -f > model/moses.ini > > This works well for each language pair. > > Then I produce an IRSTLM binary LM for each language pair, for example: > > /home/lingua/smt/irstlm/bin/compile-lm > complete_fr.truecased_unique_tok_clean.lm > complete_fr.truecased_unique_tok_clean.blm > ln -s complete_fr.truecased_unique_tok_clean.blm > complete_fr.truecased_unique_tok_clean.blm.mm > > and I produce binary phrase tables and binary reordering tables: > > gzip -cd fr-en/f_en.e_fr/model/phrase-table.gz | LC_ALL=C sort | > /home/lingua/smt/moses/bin/processPhraseTable -ttable 0 0 - -nscores 5 -out > fr-en/f_en.e_fr/model/phrase-table > gzip -cd fr-en/f_en.e_fr/model/reordering-table.wbe-msd-bidirectional-fe.gz | > LC_ALL=C sort | /home/lingua/smt/moses/bin/processLexicalTable -out > fr-en/f_en.e_fr/model/reordering-table > > Then I create a copy of moses.ini (->moses-bin.ini) and set moses-bin.ini to > use the binary files. > > > Now I try to translate a sentence with: > > echo "this is a small house" | TMP=/tmp /home/lingua/smt/moses/bin/moses -v > 2 -f model/moses-bin.ini > > > This works well for each language pair, except for the language pair f: en, > e: fr. > > The output is: > > Defined parameters (per moses.ini or switch): > config: model/moses-bin.ini > distortion-file: 0-0 wbe-msd-bidirectional-fe-allff 6 > /home/lingua/Patricia/Corpora/Corpora_Biling/fr-en/f_en.e_fr/model/reordering-table > distortion-limit: 6 > input-factors: 0 > lmodel-file: 1 0 3 > /home/lingua/Patricia/Corpora/Corpora_Monoling_Complete/fr/complete_fr.truecased_unique_tok_clean.blm.mm > mapping: 0 T 0 > ttable-file: 1 0 0 5 > /home/lingua/Patricia/Corpora/Corpora_Biling/fr-en/f_en.e_fr/model/phrase-table > ttable-limit: 20 > verbose: 2 > weight-d: 0.3 0.3 0.3 0.3 0.3 0.3 0.3 > weight-l: 0.5000 > weight-t: 0.20 0.20 0.20 0.20 0.20 > weight-w: -1 > input type is: text input > Loading lexical distortion models...have 1 models > Creating lexical reordering... > weights: 0.300 0.300 0.300 0.300 0.300 0.300 > binary file loaded, default OFF_T: -1 > Start loading LanguageModel > /home/lingua/Patricia/Corpora/Corpora_Monoling_Complete/fr/complete_fr.truecased_unique_tok_clean.blm.mm > : [0.000] seconds > In LanguageModelIRST::Load: nGramOrder = 3 > Language Model Type of > /home/lingua/Patricia/Corpora/Corpora_Monoling_Complete/fr/complete_fr.truecased_unique_tok_clean.blm.mm > is 1 > blmt > loadbin() > lmtable::loadbin_dict() > dictionary::loadtxt wrong entry was found (0) in position 1 > > I don't understand the reason for this error. Could you help me with this > problem? > > Thank you, > Patricia >
_______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support