Re: [Moses-support] bleu-annotation / analysis.perl

2016-03-05 Thread Vincent Nguyen

Thanks Phil.
I figured out for the lowercase thing thanks.

For the short n-grams this is not exactly what I meant.

Any 1-gram sentence will give either 1 when exact match or 0.8409 when 
different match

Any 2-gram sentence will give 1 when exact match or 0.7598 when 1 word match


My point is that this is a twist of the BLEU+ algorithm.
it is supposed to avoid to get a 0 score when there is no match under or 
equal 4-gram (because of the geometric mean)

but it twists the scores of short segments giving a much too high score.

By the way :
The reason I am looking into this is that I am using sentence-level bleu 
to filter some noisy corpora.
For instance, out of 6 million sentences I will keep only the sentences 
with a Bleu score > XX to avoid keeping misaligned segments.




Le 04/03/2016 21:59, Philipp Koehn a écrit :

Hi,

this BLEU calculation happens in the function bleu_annotation in lines 
224ff in scripts/ems/support/analysis.perl


You could convert the system translation $system and the reference 
translations to $REFERENCE[$i] to lowercase (lc) if you prefer that.


The code suggests that n-gram precision for sentences of length < n is 
treated as 100% - which may be not what you want, but it is a 
degenerate case, so treating it is a bit undefined.


-phi

On Sat, Feb 27, 2016 at 6:20 AM, > wrote:


Ok obviously this is a modified bleu+ algorithm, similar to what
sentence-bleu does.
However I believe this is still not right for unigram sentences.





De : "Vincent Nguyen"
Date : 26 févr. 2016 22:21:59
A : moses-support@mit.edu 

Sujet : Re: [Moses-support] bleu-annotation / analysis.perl



Am I correct saying that when sentences length is less or equal to 4
tokens then the BLEU score should be 1 for exact matches and 0
when not
exact match ?
(by definition of http://www1.cs.columbia.edu/nlp/sgd/bleu.pdf)


Le 26/02/2016  10:02, Vincent Nguyen a écrit :
> Hi,
>
> I would like to understand better the analysis.perl script that
> generates the bleu-annotation file.
>
> Is there an easy way to get the uncased bleu score of each line
instead
> of the cased calculation ?
> Am I right that this script recompute its own Bleu score without
calling
> the Nist-Bleu nor Multi-Bleu external scripts ?
>
>
> Also I find it strange sometimes when there is only one or two
words :
>
> Translation / reference / score
> Contents / Content / 0.8409
> Ireland / Irish / 0.8409
> Issuer / Italie / 0.8409
> PT / US / 0.8409
> .
> and so on, two words, unrelated will always generate similar
0.8409 scores.
>
> for 2-grams
> Very strong / Very high / 0.7598
> Public sector / Public Sector / 0.7598
> However : / But : / 0.7598
>
> so, for 2-grams, when one word only is good it will generate a
score of
> 0.7598
>
>
> Thanks,
>
> Vincent
>
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu 
> http://mailman.mit.edu/mailman/listinfo/moses-support
___
Moses-support mailing list
Moses-support@mit.edu 
http://mailman.mit.edu/mailman/listinfo/moses-support

___
Moses-support mailing list
Moses-support@mit.edu 
http://mailman.mit.edu/mailman/listinfo/moses-support




___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] Large nohup.out file

2016-03-05 Thread Ayah ElMaghraby
When training my smt, the nohup.out file grows drastically reaching 20G because 
it is full with these errors:

Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "slipping" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "in" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "illegally" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "with" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "PLO" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "chief" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "Yasser" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
U

Re: [Moses-support] Survey on Optimization Methods for MT

2016-03-05 Thread Lane Schwartz
Excellent! Thanks, Graham.

On Sat, Mar 5, 2016 at 9:36 AM, Graham Neubig  wrote:

> Hi Moses Folks,
>
> This is not related to a support request, but Taro Watanabe and I wrote a
> Computational Linguistics survey on optimization (parameter tuning) methods
> for machine translation:
> http://www.phontron.com/paper/neubig16cl.pdf
>
> If you're interested in learning in detail about the optimization methods
> used in Moses like MERT, Mira, PRO, etc., please take a look!
>
> Graham
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>


-- 
When a place gets crowded enough to require ID's, social collapse is not
far away.  It is time to go elsewhere.  The best thing about space travel
is that it made it possible to go elsewhere.
-- R.A. Heinlein, "Time Enough For Love"
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] Large nohup.out file

2016-03-05 Thread Lane Schwartz
What command(s) are you running?

On Sat, Mar 5, 2016 at 10:06 AM, Ayah ElMaghraby 
wrote:

> When training my smt, the nohup.out file grows drastically reaching 20G
> because it is full with these errors:
>
>
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 117,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 119,  line 5077.
>
> Argument "slipping" isn't numeric in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in hash element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 114,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 117,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 119,  line 5077.
>
> Argument "in" isn't numeric in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in hash element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 114,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 117,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 119,  line 5077.
>
> Argument "illegally" isn't numeric in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in hash element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 114,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 117,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 119,  line 5077.
>
> Argument "with" isn't numeric in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in hash element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 114,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 117,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 119,  line 5077.
>
> Argument "PLO" isn't numeric in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in hash element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 114,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 117,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 119,  line 5077.
>
> Argument "chief" isn't numeric in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in numeric ge (>=) at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 108,  line 5077.
>
> Use of uninitialized value $ei in hash element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 114,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 117,  line 5077.
>
> Use of uninitialized value $ei in array element at
> /home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line
> 119,  line 5077.
>

[Moses-support] Survey on Optimization Methods for MT

2016-03-05 Thread Graham Neubig
Hi Moses Folks,

This is not related to a support request, but Taro Watanabe and I wrote a
Computational Linguistics survey on optimization (parameter tuning) methods
for machine translation:
http://www.phontron.com/paper/neubig16cl.pdf

If you're interested in learning in detail about the optimization methods
used in Moses like MERT, Mira, PRO, etc., please take a look!

Graham
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] Large nohup.out file

2016-03-05 Thread Ayah ElMaghraby
This command :

nohup nice ~/mosesdecoder/scripts/training/train-model.perl --target-syntax 
-root-dir train -corpus ~/Documents/trainData/mosesTree -f en -e ar 
-hierarchical -glue-grammar -max-phrase-length 5 --score-options="--GoodTuring" 
-lm 0:3:/home/ayah/Documents/lm/truecase.en-ar.blm.ar:8 -external-bin-dir 
~/mosesdecoder/tools/mgizapp -mgiza -mgiza-cpus 2 -snt2cooc snt2cooc.pl 
-parallel -sort-batch-size 253 -sort-compress gzip --giza-option 
m1=5,m2=2,mh=0,m3=3,m4=3 > training.out

Regards,
Ayah

From: Lane Schwartz
Sent: Saturday, March 5, 2016 6:29 PM
To: Ayah ElMaghraby
Cc: moses-support@mit.edu
Subject: Re: [Moses-support] Large nohup.out file

What command(s) are you running?

On Sat, Mar 5, 2016 at 10:06 AM, Ayah ElMaghraby  
wrote:
When training my smt, the nohup.out file grows drastically reaching 20G because 
it is full with these errors:
 
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "slipping" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "in" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "illegally" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "with" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "PLO" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in hash element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 114, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 117, 
 line 5077.
Use of uninitialized value $ei in array element at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 119, 
 line 5077.
Argument "chief" isn't numeric in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTranslationModel.pm line 108, 
 line 5077.
Use of uninitialized value $ei in numeric ge (>=) at 
/home/ayah/mosesdecoder/scripts/training/LexicalTr

[Moses-support] EAMT - Call for Best Thesis Award 2016

2016-03-05 Thread Lucia Specia
2016 EAMT Best Thesis Award

The European Association for Machine Translation (EAMT, http://www.eamt.org)
is an organization that serves the growing community of people interested
in MT and translation tools, including users, developers, and researchers
of this increasingly viable technology.

The EAMT invites entries for its fifth EAMT Best Thesis Award for a PhD or
equivalent thesis on a topic related to machine translation.
Eligibility

Researchers who

   -

   have completed a PhD (or equivalent) thesis on a relevant topic in a
   European, Northern African or Middle Eastern institution within calendar
   year 2015 and
   -

   have not previously won another international award for that thesis, are
   invited to submit their theses to the EAMT for consideration.

Panel

The submissions will be judged by a panel of experts who will be
specifically appointed as part of the EAMT 2016 programme committee and
which will be ratified by the Executive Board of the EAMT.
Selection criteria

Each thesis will be judged according to how challenging the problem was, to
how relevant the results are for machine translation as a field, and to the
strength of their impact in terms of scientific publications.
Scope

The scope of the thesis need not be confined to a technical area, and
applications are also invited from students who carried out their research
into commercial and management aspects of machine translation.

Possible areas of research include:

   -

   development of machine translation or advanced computer-assisted
   translation : software and resources
   -

   machine translation for less-resourced languages
   -

   the use of these systems in professional environments (freelance
   translators, translation agencies, localisation, etc.)
   -

   the increasing impact of machine translation on non-professional
   Internet users and its impact in communications, social networking, etc.
   -

   spoken language translation
   -

   the integration of machine translation and translation memory systems
   -

   the integration of machine translation software in larger IT applications
   -

   the evaluation of machine translation systems in real tasks such as
   those above
   -

   the cross-fertilisation between machine translation and other language
   technologies

Prize

The winner will be announced at the same time as accepted papers for the
19th Annual Conference of the European Association for Machine Translation
EAMT 2016 (Riga, Latvia, May 30-June 1 2016), and will receive a prize of
€500, together with an inscribed certificate. The recipient of the award
will be required to briefly present their research at EAMT 2015. In order
to facilitate this, the EAMT will waive the winner's registration costs,
and will make available a travel bursary of €200 to enable the recipient of
the award to attend the said conference. The prize includes complmentary
membership in the EAMT for 2016 and 2017.
Submission

Candidates will submit using EasyChair:
https://easychair.org/conferences/?conf=eamt2016, a single PDF file
containing, in this order:

   -

   a 2-page summary of your thesis in English, containing:
   -

  your full contact details,


   -

  the name and contact details of your supervisor(s),
  -

   a copy of your CV in English (at most one page, plus a complete list of
   publications directly related to the thesis)
   -

   an electronic copy of your thesis
   -

   optionally, an appendix with any other relevant information on the thesis


By submitting their work, authors

   -

   agree that, in case they are granted the award, any subsequently
   published version of the thesis should carry the citation "Winner of the
   2015 European Association for Machine Translation Best Thesis Award" and
   -

   acknowledge the right of the EAMT to publicize the granting of the award.

Closing date

The closing date for submissions will be the same as the deadline for
research papers at EAMT 2016: March 25th, 2016

-- 
Lucia
www.dcs.shef.ac.uk/~lucia/
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


Re: [Moses-support] In-domain dictionary - where to include it

2016-03-05 Thread Joze Kadivec
Hello,

 

Can you please help me with the following error?

 

When I run the Tuning process with the following command ((nohup nice 
~/mosesdecoder/scripts/training/mert-moses.pl ~/corpus/news-test2008.true.fr 
~/corpus/news-test2008.true.en ~/mosesdecoder/bin/moses train/model/moses.ini 
--mertdir ~/mosesdecoder/bin/ --decoder-flags="-threads 4" &> mert.out)) the 
process stops immediately after it finishes calculating the best translation 
with the following exit codes:

 

Failed to get order of scores from nbestlist 'run1.best100.out': No such file 
or directory at /home/kjoze/mosesdecoder/scripts/training/mert-moses.pl line 
1474.

coring the nbestlist.

exec: /home/kjoze/working/mert-work/extractor.sh

Executing: /home/kjoze/working/mert-work/extractor.sh > extract.out 2> 
extract.err

Exit code: 1

ERROR: Failed to run '/home/kjoze/working/mert-work/extractor.sh'. at 
/home/kjoze/mosesdecoder/scripts/training/mert-moses.pl line 1744.

 

The contents of mert-moses.pl file at line 1474 is:

 

sub get_order_of_scores_from_nbestlist {

  # read the first line and interpret the ||| label: num num num label2: num 
||| column in nbestlist

  # return the score labels in order

  my $fname_or_source = shift;

  # print STDERR "Peeking at the beginning of nbestlist to get order of scores: 
$fname_or_source\n";

1474à  open my $fh, $fname_or_source or die "Failed to get order of scores from 
nbestlist '$fname_or_source': $!";

  my $line = <$fh>;

  close $fh;

  die "Line empty in nbestlist '$fname_or_source'" if !defined $line;

  my ($sent, $hypo, $scores, $total) = split /\|\|\|/, $line;

  $scores =~ s/^\s*|\s*$//g;

  die "No scores in line: $line" if $scores eq "";

 

  my @order = ();

  my $label = undef;

  my $sparse = 0; # we ignore sparse features here

  foreach my $tok (split /\s+/, $scores) {

 

Please advise. Does changing the news-test2008.true.fr file contents has any 
influence?

 

Thank you.

Kind regards,

Jože

 

 

From: phko...@gmail.com [mailto:phko...@gmail.com] On Behalf Of Philipp Koehn
Sent: Friday, March 04, 2016 11:03 PM
To: Joze Kadivec
Cc: moses-support@mit.edu
Subject: Re: [Moses-support] In-domain dictionary - where to include it

 

Hi,

 

the most straight-forward way is to add it to the phrase table.

 

You could add it multiple times, if you want it to be given more weight than 
the regular parallel corpus.

 

-phi

 

On Tue, Feb 23, 2016 at 3:10 PM, Joze Kadivec  wrote:

Hello,

I just recently installed Moses and would like to use it as part of my
masters thesis on Domain adaptation for less-resourced Slovenian language. I
was given some domain specific dictionaries for which I need your advice.
Where should I include it? Should I simply paste their contents to the end
of aligned corpora files used to create translation model? Is there another
way to use the dictionaries more efficiently?

Thank you.

Kind regards,
Jože Kadivec


___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

 

___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support