Re: [Moses-support] Major bug found in Moses

Matthias Huck Fri, 19 Jun 2015 12:50:21 -0700

Hi James,

Well, it's pretty straightforward: The decoder's job is to find the
hypothesis with the maximum model score. That's why everybody builds
models which assign high model score to high-quality translations.
Unfortunately, you missed this last point in your own work.


Cheers,
Matthias


On Fri, 2015-06-19 at 14:15 +0000, Read, James C wrote:
> I'm gonna try once more. This is what he said:
> 
> "the decoder's job is NOT to find the high quality translation"
> 
> The  next time I have a panel of potential investors in front of me
> I'm gonna pass that line by them and see how it goes down. I stress
> the words HIGH QUALITY TRANSLATION.
> 
> Please promise me that the next time you put in a bid for funding you
> will guarantee your prospective funders that under no circumstances
> will you attempt to design a system which searches for HIGH QUALITY
> TRANSLATION.
> 
> James
> 
> ________________________________________
> From: Matthias Huck <mh...@inf.ed.ac.uk>
> Sent: Friday, June 19, 2015 5:08 PM
> To: Read, James C
> Cc: Hieu Hoang; moses-support@mit.edu; Arnold, Doug
> Subject: Re: [Moses-support] Major bug found in Moses
> 
> Hi James,
> 
> Yes, he just said that.
> 
> The decoder's job is to find the hypothesis with the maximum model
> score. That's one reason why your work is flawed. You did not care at
> all whether your model score correlates with BLEU or not.
> 
> Cheers,
> Matthias
> 
> 
> On Fri, 2015-06-19 at 13:24 +0000, Read, James C wrote:
> > I quote:
> >
> >
> > "the decoder's job is NOT to find the high quality translation"
> >
> >
> >
> > Did you REALLY just say that?
> >
> >
> > James
> >
> >
> >
> >
> > ______________________________________________________________________
> > From: Hieu Hoang <hieuho...@gmail.com>
> > Sent: Wednesday, June 17, 2015 9:00 PM
> > To: Read, James C
> > Cc: Kenneth Heafield; moses-support@mit.edu; Arnold, Doug
> > Subject: Re: [Moses-support] Major bug found in Moses
> >
> > the decoder's job is NOT to find the high quality translation (as
> > measured by bleu). It's job is to find translations with high model
> > score.
> >
> >
> > you need the tuning to make sure high quality translation correlates
> > with high model score. If you don't tune, it's pot luck what quality
> > you get.
> >
> >
> > You should tune with the features you use
> >
> >
> > Hieu Hoang
> > Researcher
> >
> > New York University, Abu Dhabi
> >
> > http://www.hoang.co.uk/hieu
> >
> >
> > On 17 June 2015 at 21:52, Read, James C <jcr...@essex.ac.uk> wrote:
> >         The analogy doesn't seem to be helping me understand just how
> >         exactly it is a desirable quality of a TM to
> >
> >         a) completely break down if no LM is used (thank you for
> >         showing that such is not always the case)
> >         b) be dependent on a tuning step to help it find the higher
> >         scoring translations
> >
> >         What you seem to be essentially saying is that the TM cannot
> >         find the higher scoring translations because I didn't pretune
> >         the system to do so. And I am supposed to accept that such is
> >         a desirable quality of a system whose very job is to find the
> >         higher scoring translations.
> >
> >         Further, I am still unclear which features you prequire a
> >         system to be tuned on. At the very least it seems that I have
> >         discovered the selection process that tuning seems to be
> >         making up for in some unspecified and altogether opaque way.
> >
> >         James
> >
> >
> >         ________________________________________
> >         From: Hieu Hoang <hieuho...@gmail.com>
> >         Sent: Wednesday, June 17, 2015 8:34 PM
> >         To: Read, James C; Kenneth Heafield; moses-support@mit.edu
> >         Cc: Arnold, Doug
> >         Subject: Re: [Moses-support] Major bug found in Moses
> >
> >         4 BLEU is nothing to sniff at :) I was answering Ken's tangent
> >         aspersion
> >         that LM are needed for tuning.
> >
> >         I have some sympathy for you. You're looking at ways to
> >         improve
> >         translation by reducing the search space. I've bashed my head
> >         against
> >         this wall for a while as well without much success.
> >
> >         However, as everyone is telling you, you haven't understood
> >         the role of
> >         tuning. Without tuning, you're pointing your lab rat to some
> >         random part
> >         of the search space, instead of away from the furry animal
> >         with whiskers
> >         and towards the yellow cheesy thing
> >
> >         On 17/06/2015 20:45, Read, James C wrote:
> >         > Doesn't look like the LM is contributing all that much then
> >         does it?
> >         >
> >         > James
> >         >
> >         > ________________________________________
> >         > From: moses-support-boun...@mit.edu
> >         <moses-support-boun...@mit.edu> on behalf of Hieu Hoang
> >         <hieuho...@gmail.com>
> >         > Sent: Wednesday, June 17, 2015 7:35 PM
> >         > To: Kenneth Heafield; moses-support@mit.edu
> >         > Subject: Re: [Moses-support] Major bug found in Moses
> >         >
> >         > On 17/06/2015 20:13, Kenneth Heafield wrote:
> >         >> I'll bite.
> >         >>
> >         >> The moses.ini files ship with bogus feature weights.  One
> >         is required to
> >         >> tune the system to discover good weights for their system.
> >         You did not
> >         >> tune.  The results of an untuned system are meaningless.
> >         >>
> >         >> So for example if the feature weights are all zeros, then
> >         the scores are
> >         >> all zero.  The system will arbitrarily pick some awful
> >         translation from
> >         >> a large space of translations.
> >         >>
> >         >> The filter looks at one feature p(target | source).  So now
> >         you've
> >         >> constrained the awful untuned model to a slightly better
> >         region of the
> >         >> search space.
> >         >>
> >         >> In other words, all you've done is a poor approximation to
> >         manually
> >         >> setting the weight to 1.0 on p(target | source) and the
> >         rest to 0.
> >         >>
> >         >> The problem isn't that you are running without a language
> >         model (though
> >         >> we generally do not care what happens without one).  The
> >         problem is that
> >         >> you did not tune the feature weights.
> >         >>
> >         >> Moreover, as Marcin is pointing out, I wouldn't necessarily
> >         expect
> >         >> tuning to work without an LM.
> >         > Tuning does work without a LM. The results aren't half bad.
> >         fr-en
> >         > europarl (pb):
> >         >     with LM: 22.84
> >         >     retuned without LM: 18.33
> >         >> On 06/17/15 11:56, Read, James C wrote:
> >         >>> Actually the approximation I expect to be:
> >         >>>
> >         >>> p(e|f)=p(f|e)
> >         >>>
> >         >>> Why would you expect this to give poor results if the TM
> >         is well trained? Surely the results of my filtering
> >         experiments provve otherwise.
> >         >>>
> >         >>> James
> >         >>>
> >         >>> ________________________________________
> >         >>> From: moses-support-boun...@mit.edu
> >         <moses-support-boun...@mit.edu> on behalf of Rico Sennrich
> >         <rico.sennr...@gmx.ch>
> >         >>> Sent: Wednesday, June 17, 2015 5:32 PM
> >         >>> To: moses-support@mit.edu
> >         >>> Subject: Re: [Moses-support] Major bug found in Moses
> >         >>>
> >         >>> Read, James C <jcread@...> writes:
> >         >>>
> >         >>>> I have been unable to find a logical explanation for this
> >         behaviour other
> >         >>> than to conclude that there must be some kind of bug in
> >         Moses which causes a
> >         >>> TM only run of Moses to perform poorly in finding the most
> >         likely
> >         >>> translations according to the TM when
> >         >>>>    there are less likely phrase pairs included in the
> >         race.
> >         >>> I may have overlooked something, but you seem to have
> >         removed the language
> >         >>> model from your config, and used default weights. your
> >         default model will
> >         >>> thus (roughly) implement the following model:
> >         >>>
> >         >>> p(e|f) = p(e|f)*p(f|e)
> >         >>>
> >         >>> which is obviously wrong, and will give you poor results.
> >         This is not a bug
> >         >>> in the code, but a poor choice of models and weights.
> >         Standard steps in SMT
> >         >>> (like tuning the model weights on a development set, and
> >         including a
> >         >>> language model) will give you the desired results.
> >         >>>
> >         >>> _______________________________________________
> >         >>> Moses-support mailing list
> >         >>> Moses-support@mit.edu
> >         >>> http://mailman.mit.edu/mailman/listinfo/moses-support
> >         >>>
> >         >>> _______________________________________________
> >         >>> Moses-support mailing list
> >         >>> Moses-support@mit.edu
> >         >>> http://mailman.mit.edu/mailman/listinfo/moses-support
> >         >> _______________________________________________
> >         >> Moses-support mailing list
> >         >> Moses-support@mit.edu
> >         >> http://mailman.mit.edu/mailman/listinfo/moses-support
> >         >>
> >         > --
> >         > Hieu Hoang
> >         > Researcher
> >         > New York University, Abu Dhabi
> >         > http://www.hoang.co.uk/hieu
> >         >
> >         > _______________________________________________
> >         > Moses-support mailing list
> >         > Moses-support@mit.edu
> >         > http://mailman.mit.edu/mailman/listinfo/moses-support
> >         > .
> >         >
> >
> >         --
> >         Hieu Hoang
> >         Researcher
> >         New York University, Abu Dhabi
> >         http://www.hoang.co.uk/hieu
> >
> >
> >
> >
> > _______________________________________________
> > Moses-support mailing list
> > Moses-support@mit.edu
> > http://mailman.mit.edu/mailman/listinfo/moses-support
> 
> 
> 
> --
> The University of Edinburgh is a charitable body, registered in
> Scotland, with registration number SC005336.
> 



-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Major bug found in Moses

Reply via email to