If you treat entire paragraphs as segments, then you'll presumably end up with very long segments. This will make it difficult to get good alignments, and so the resulting models may be of poor quality. Also note that there will be nothing to prevent the extracted phrases from spanning sentence boundaries, which is probably not what you want.
- John Burger MITRE On Jul 22, 2013, at 07:23 , Heidi Heweidy wrote: > Good evening, > If I just press enter so that paragraphs are parallel to each other manually, > does it count to be a a sentence aligned data set? > Cheers, > Heidi. > > _______________________________________________ > Moses-support mailing list > Moses-support@mit.edu > http://mailman.mit.edu/mailman/listinfo/moses-support _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support