You are right, that's seems to be a mistake. "-threads" should not be
specified twice. Anyone speaks EMS?
W dniu 28.02.2016 o 09:51, Vincent Nguyen pisze:
Marcin, (or others since it relates to EMS...),
look at my TRAINING_binarize-config.7.STDERR file below.
I have highlighted in Bold the important stuff.
I am running the binarizer in EMS with -threads 8
but in the detailed execution, especially when gzip is run, then I see
a "threads 1" parameter.
is this normal ?
Thanks,
Vincent
Executing:
/netshr/mosesdecoder/scripts/training/filter-model-given-input.pl
/netshr/working-fr-en/model/moses.bin.ini.7.tables
/netshr/working-fr-en/model/moses.ini.7 /dev/null -nofilter
-Binarizer "/netshr/mosesdecoder/bin/processPhraseTableMin *-threads 8*"
Executing: mkdir -p /netshr/working-fr-en/model/moses.bin.ini.7.tables
Stripping XML...
Executing:
/netshr/mosesdecoder/scripts/training/../generic/strip-xml.perl <
/dev/null > /netshr/working-fr-en/model/moses.bin.ini.7.tables/input.49617
pt:PhraseDictionaryMemory name=TranslationModel0 num-features=4
path=/netshr/working-fr-en/model/phrase-table.7 input-factor=0
output-factor=0
Considering factor 0
ro:LexicalReordering name=LexicalReordering0 num-features=6
type=wbe-msd-bidirectional-fe-allff input-factor=0 output-factor=0
path=/netshr/working-fr-en/model/reordering-table.7.wbe-msd-bidirectional-fe.gz
Considering factor 0
Filtering files...
filtering /netshr/working-fr-en/model/phrase-table.7 ->
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1...
Executing: ln -s /netshr/working-fr-en/model/phrase-table.7.gz
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.gz
binarizing...
*Executing: gzip *-cd
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.gz
| LC_ALL=C sort -T /netshr/working-fr-en/model/moses.bin.ini.7.tables
>
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.gz.sorted
&& /netshr/mosesdecoder/bin/processPhraseTableMin *-threads 8* -in
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.gz.sorted
-out
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1 -nscores
4*-threads 1* && rm
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.gz.sorted
Used options:
Text phrase table will be read from:
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.gz.sorted
Output phrase table will be written to:
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.minphr
Step size for source landmark phrases: 2^10=1024
Source phrase fingerprint size: 16 bits / P(fp)=1.52588e-05
Selected target phrase encoding: Huffman + PREnc
Maxiumum allowed rank for PREnc: 100
Number of score components in phrase table: 4
Single Huffman code set for score components: no
Using score quantization: no
Explicitly included alignment information: yes
*Running with 1 threads*
Pass 1/3: Creating hash function for rank assignment
..................................................[5000000]
..................................................[10000000]
..................................................[15000000]
..................................................[20000000]
..................................................[25000000]
...........................................
Pass 2/3: Creating source phrase index + Encoding target phrases
..................................................[5000000]
..................................................[10000000]
..................................................[15000000]
..................................................[20000000]
..................................................[25000000]
...........................................
Intermezzo: Calculating Huffman code sets
Creating Huffman codes for 94477 target phrase symbols
Creating Huffman codes for 211655 scores
Creating Huffman codes for 9108470 scores
Creating Huffman codes for 167461 scores
Creating Huffman codes for 7856800 scores
Creating Huffman codes for 26 alignment points
Pass 3/3: Compressing target phrases
..................................................[5000000]
..................................................[10000000]
..................................................[15000000]
..................................................[20000000]
..................................................[25000000]
...........................................
Saving to
/netshr/working-fr-en/model/moses.bin.ini.7.tables/phrase-table.0-0.1.1.minphr
Done
filtering
/netshr/working-fr-en/model/reordering-table.7.wbe-msd-bidirectional-fe.gz
->
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe...
Executing: ln -s
/netshr/working-fr-en/model/reordering-table.7.wbe-msd-bidirectional-fe.gz/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.gz
binarizing...
*Executing: gzip* -cd
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.gz
| LC_ALL=C sort -T /netshr/working-fr-en/model/moses.bin.ini.7.tables
>
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.gz.sorted
&& /netshr/mosesdecoder/bin/processLexicalTableMin*-threads 8* -in
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.gz.sorted
-out
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe
*-threads 1* && rm
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.gz.sorted
Used options:
Text reordering table will be read from:
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.gz.sorted
Output reordering table will be written to:
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.minlexr
Step size for source landmark phrases: 2^10=1024
Phrase fingerprint size: 16 bits / P(fp)=1.52588e-05
Single Huffman code set for score components: no
Using score quantization: no
*Running with 1 threads*
Pass 1/2: Creating phrase index + Counting scores
..................................................[5000000]
..................................................[10000000]
..................................................[15000000]
..................................................[20000000]
..................................................[25000000]
..................................................[30000000]
..................................................[35000000]
..................................................[40000000]
..................................................[45000000]
..................................................[50000000]
..................................................[55000000]
..................................................[60000000]
..................................................[65000000]
.............................................
Intermezzo: Calculating Huffman code sets
Creating Huffman codes for 38763 scores
Creating Huffman codes for 22686 scores
Creating Huffman codes for 38050 scores
Creating Huffman codes for 38284 scores
Creating Huffman codes for 21654 scores
Creating Huffman codes for 37554 scores
Pass 2/2: Compressing scores
..................................................[5000000]
..................................................[10000000]
..................................................[15000000]
..................................................[20000000]
..................................................[25000000]
..................................................[30000000]
..................................................[35000000]
..................................................[40000000]
..................................................[45000000]
..................................................[50000000]
..................................................[55000000]
..................................................[60000000]
..................................................[65000000]
.............................................
Saving to
/netshr/working-fr-en/model/moses.bin.ini.7.tables/reordering-table.7.wbe-msd-bidirectional-fe.minlexr
Done
Executing: rm -f /netshr/working-fr-en/model/moses.bin.ini.7; ln -s
/netshr/working-fr-en/model/moses.bin.ini.7.tables/moses.ini
/netshr/working-fr-en/model/moses.bin.ini.7
Le 17/02/2016 23:16, Marcin Junczys-Dowmunt a écrit :
I just checked, it's really weirdly slow now. Apparently using more than
4 threads is a bad idea. But 4 threads seems to be about 2 times faster
than just one. I remember that used to work better. Maybe because I
haven't tcmalloc linked?
On 17.02.2016 23:07, Marcin Junczys-Dowmunt wrote:
It is, just not very well done. It generally does not make sense to have
more than 8-10 threads. That should however be somewhat faster than only
a single thread.
On 17.02.2016 22:44, Vincent Nguyen wrote:
I have the feeling it's not.
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support