Hi Raphael

Hieu
Sent while bumping into things

On 8 Jun 2013, at 05:05 PM, Raphael Payen <[email protected]> wrote:

I had made an external script for this some time ago. It replaced each
number with a tag before translation, and after translation it used the
word alignment information to replace the tags with their stored value. But
it only worked with phrase-based translation, not hierarchical. And it
didn't work well enough, I dont remember exactly but i think it crashed
when the translation didn't have the same number of tags as the source,
maybe other problems.


I think this is why some of it should be implemented in the decoder. It
would be more efficient and reliable

If you still have the script to identify numbers and replace it with a
placeholder, please add it to moses. We can start implementing this
functionality

And anyway, if you plan to implement this "in the decoder", i suppose an
external script is not what you want.

--
Raphaƫl Payen

On Thu, Jun 6, 2013 at 9:59 PM, Hieu Hoang <[email protected]> wrote:

> As you say, the number's in the input sentence would be unknown. However,
> the reason to use placeholders is to make them known to the LM and
> phrase-table so that they can have more accurate scores for them
>
> Therefore you need to replace numbers in your input sentence with
> placeholders.
>
> You can then use the word alignment from the decoder to put the number
> back.
>
> This is something I want to implement better in the decoder. So if you, or
> anyone, is willing help me and contribute some time & code, I can help out.
>
>
>
> On 6 June 2013 15:25, Arezki Sadoune <[email protected]> wrote:
>
>> Dear Hieu Hoang,
>>
>> Thank you for the answer,
>>
>> Yes, I'm replacing the numbers with a placeholder on the training data and 
>> the LM as well, I thought this might address the issue of the number without 
>> losing too much translation quality,
>>
>> Regarding the input sentence I'm not interfering with the process assuming 
>> that the number will still the same as it is unknown..., An other option 
>> will be to make a script which, as you said puts back the original number 
>> for a 100% accuracy.
>>
>> Do you think I could use the both at the same time?
>>
>> Regards
>>
>>
>> A.S
>>
>>
>> _______________________________________________
>> Moses-support mailing list
>> [email protected]
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
>
> --
> Hieu Hoang
> Research Associate
> University of Edinburgh
> http://www.hoang.co.uk/hieu
>
>
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to