Re: [Moses-support] Data collection

2016-04-19 Thread Philipp Koehn
Hi,

the common training pipeline limits sentences to at most 80 words.
This is due to limitations in GIZA++.

There can be any mix of sentence lengths - long sentences, short
sentences, single words.

There is a good chance for the system to translate "I eat an apple"
correctly, if it a training sentence pair with "I eat an apple on Friday
and
an orange on Saturday."

-phi

On Tue, Apr 19, 2016 at 6:15 AM, Sanjanashree Palanivel <
sanjanash...@gmail.com> wrote:

> Hi,
>
>How the data should be collected for training Moses.
>
>I wish to know how much longer and shorter the sentence can be for
> training moses.
>
> What will happens, if the simple sentences like "I eat an apple" are given
> for training with longer sentences.
>
> and what if i give a word as a sentence in data.
>
>
>
> --
> Thanks and regards,
>
> Sanjanasri J.P
>
> ___
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support


[Moses-support] Data collection

2016-04-19 Thread Sanjanashree Palanivel
Hi,

   How the data should be collected for training Moses.

   I wish to know how much longer and shorter the sentence can be for
training moses.

What will happens, if the simple sentences like "I eat an apple" are given
for training with longer sentences.

and what if i give a word as a sentence in data.



-- 
Thanks and regards,

Sanjanasri J.P
___
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support