Hi,
I have one more question:
In the lex.e2f file there is a translation Gitarre-guitar:
Gitarre guitar 0.400
Gitarre using 0.284
Gitarre ; 0.017
Why has not it became part of the phrase table?
Thanks again!
Vera
-Ursprüngliche Nachricht-
Von: Vera
Hi Vera,
It's odd that the lexical translation model contains such an entry if
the pair is always unaligned. Maybe you used a different word alignment
when you extracted the lexicon model?
You should manually have a look at your word alignment in order to check
whether it has reasonable quality.
Hi Vera
I think the situation you describe could happen even without unaligned
words. Suppose that you have a 2 word sentence on each side, and the
alignment points are (0,0), (0,1) and (1,0) - I think this is possible
with the usual symmetrisation algorithm. Then you would extract the
phrase
Yes, that's right. That's a situation as illustrated in Fig. 1b of
http://ufal.mff.cuni.cz/pbml/95/art-stein-vilar-ney-jane.pdf
and a single word heuristic as proposed in that paper can be a remedy.
On Thu, 2014-11-27 at 16:51 +, Barry Haddow wrote:
Hi Vera
I think the situation you