[Moses-support] removing identical pairs in phrase table

2013-11-05 Thread Andrew Shin
Hi,
I'm using Moses in dialogue setting, where both source and target language are 
in English.This results in parroting back of the input, so I need to remove 
phrase-pairs where one phrase is a substring of the other. That'll require 
editing the phrase table after GIZA++ but before Moses.Is there a way to 
automate this? If not, how can I access the phrase table generated by GIZA++ 
and run Moses on that after editing?
I appreciate your time in advance.___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] unsolved) Moses eating up too much memory

2013-10-29 Thread Andrew Shin
I apologize since my previous question on the same topic didn't specify 
enough.I'm following the steps in http://www.statmt.org/moses/?n=Moses.Baseline 
exactly the same way,and when I try to run Moses with the following 
command:~/mosesdecoder/bin/moses -f ~/working/mert-work/moses.ini (often with 
-output-hypo-score option)it starts eating up RAM until there's no space and 
starts taking hard disk space as swap memory,while it says its reading from 
phrase table.It went up to 17GB of swap memory and pretty much stopped.I 
binarised the phrase table and reordering table, and the following are the 
sizes of the binarised files-rw-r--r--   1 ipadmin  staff   856K Oct 28 10:45 
phrase-table.binphr.idx-rw-r--r--   1 ipadmin  staff   140M Oct 28 10:45 
phrase-table.binphr.srctree.wa-rw-r--r--   1 ipadmin  staff   1.6M Oct 28 10:45 
phrase-table.binphr.srcvoc-rw-r--r--   1 ipadmin  staff   1.2G Oct 28 10:45 
phrase-table.binphr.tgtdata.wa-rw-r--r--   1 ipadmin  staff   1.5M Oct 28 10:45 
phrase-table.binphr.tgtvoc-rw-r--r--   1 ipadmin  staff   856K Oct 28 11:00 
reordering-table.binlexr.idx-rw-r--r--   1 ipadmin  staff   1.3G Oct 28 11:00 
reordering-table.binlexr.srctree-rw-r--r--   1 ipadmin  staff   916M Oct 28 
11:00 reordering-table.binlexr.tgtdata-rw-r--r--   1 ipadmin  staff   1.6M Oct 
28 11:00 reordering-table.binlexr.voc0-rw-r--r--   1 ipadmin  staff   1.5M Oct 
28 11:00 reordering-table.binlexr.voc1I also attached moses.ini which I tried 
to run.I'm very sure that this is not the way it should be, but can't figure 
out what is wrong since I followed the tutorial above..(and it's worked before) 
Please let me know if there is any additional information I should provide.I 
sincerely thank you very much for your time.
  

moses.ini
Description: Binary data
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] multi-threaded tuning not working

2013-10-28 Thread Andrew Shin
Hi,
I've done tuning without the multi-core option,and it took about 20 hours for 
650,000 lines of input files, and 5000 lines of tuning reference.I tried tuning 
with the multi-core option as--decoder-flags=-threads 4but it doesn't 
initiate anything.I have macbook pro with 2.4 GHz i5 dual core.Is there any 
known issue with this option?
Thank you for your help in advance.
  ___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] Running moses fills up hard disk space..

2013-10-27 Thread Andrew Shin
Hello,

 I'm running Moses after binarising it,and while it's reading from phrase 
table, it rapidly fills up hard disk space until there is no more space..I had 
21GB and it wasn't enough..Once I quit terminal, the space is freed again..I 
successfully ran Moses before, so I don't understand why this happens..Is this 
natural? If so, how much reserve space do I need?and If not, what could have 
gone wrong?
Thank you very much for your help in advance.   
  ___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] getting WER metrics

2013-10-27 Thread Andrew Shin
Hello,sorry to ask another question..
I've done getting BLEU score in the past following the baseline tutorial,but is 
there a way to also get WER given a reference text?
Thank you very much for your help.___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] a few supposedly simple questions..

2013-10-25 Thread Andrew Shin
Dear support team,
 thank you for your previous reply which worked out for me.I have a few 
questions which I think should be simple but couldn't find relevant information 
on the website.
1) When you run Moses and type in a sentence, is there any way you could have 
the translation withthe corresponding probability?
2) Also when you run and type in a sentence, is there a way to have not just 
one translation,but N-best candidates? (preferably with corresponding 
probabilities, which was the first question..)
3) I've done getting BLEU score using moses, but is there a way to also get 
word error rate to a reference?
4) After cleaning process, moses shows the number of lines in input and output 
text files,but I noticed that number of lines in output file decreased about 
5%,resulting in non-matching number of linesfor input and output.Looking at the 
translation results, it seems like it worked fine somehow, but it gets me 
concerned.Why is it, and does it affect the line-match of input-output and the 
training process?

I truly appreciate your help in advance.
best,Andrew
  ___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] error on training

2013-09-15 Thread Andrew Shin
Hello, 

I'm a novice to Moses, so I'm following the Baseline System page 
(http://www.statmt.org/moses/?n=Moses.Baseline) and got stuck at Training the 
Translation System part.When I enter the command starting with nohup nice  as 
in the instruction,I only get a file named training.out and no new 
directory.The training.out file says:Using SCRIPTS_ROOTDIR: 
/Users/ipadmin/Desktop/mosesdecoder/scriptsUsing single-thread GIZAERROR: use 
--f to specify foreign language at 
/Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 368.

So I've then tried the new way in the Training page 
(http://www.statmt.org/moses/?n=FactoredTraining.HomePage) with modification on 
euro and the language:train-model.perl -root-dir . --corpus corpus/euro --f de 
--e enand I get errors as following:Using SCRIPTS_ROOTDIR: 
/Users/ipadmin/Desktop/mosesdecoder/scriptsUse of uninitialized value 
$_EXTERNAL_BINDIR in concatenation (.) or string at 
/Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 
236.Use of uninitialized value $_EXTERNAL_BINDIR in concatenation (.) or string 
at /Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 
237.Use of uninitialized value $_EXTERNAL_BINDIR in concatenation (.) or string 
at /Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 
244.Use of uninitialized value $_EXTERNAL_BINDIR in concatenation (.) or string 
at /Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 
245.Use of uninitialized value $_EXTERNAL_BINDIR in concatenation (.) or string 
at /Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 
247.Using single-thread GIZAUse of uninitialized value $_EXTERNAL_BINDIR in 
concatenation (.) or string at 
/Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 
335.ERROR: Cannot find mkcls, GIZA++/mgiza,  snt2cooc.out/snt2cooc in .You 
MUST specify the parameter -external-bin-dir at 
/Users/ipadmin/Desktop/mosesdecoder/scripts/training/train-model.perl line 335.

I'm working on MacOSX 10.7 have successfully installed Moses and its 
prerequisites, and Giza++.
I know this might be a stupid-user problem, but I will greatly appreciate it if 
you could give me a help.Thank you for your time.


Sincerely,Andrew  ___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] question

2013-09-15 Thread Andrew Shin



Hello,

I'm currently following the baseline system page 
(http://www.statmt.org/moses/?n=Moses.Baseline) and currently at the Training 
the Translation system part.I executed the command as in the tutorial, but the 
process finished instantly instead of supposed 1~2 hours,and instead of 
working/train/model folder with moses.ini file in it,I have 
working/train/corpus folder with files named en.vcb.classes, fr.vcb.classes, 
en.vcb.classes.cats, fr.vcb.classes.cats, en.vcb, fr.vcb, en-fr-int-train.snt, 
fr-en-int-train.snt.(Btw, I had to repeat the command a few times to get these 
files. On the first run, there were only two files, and then next time two 
more, and so on.)
So it seems like something is being done, but I'm not getting moses.ini 
file..I've attached my training.out in case it helps..
This might be a tedious question to look through, but please help me out..Thank 
you very much.

sincerely,Andrew



  

training.out
Description: Binary data
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support