I didn't realize that there was a compound splitting script that was
distributed with Moses, but I suspect it will be quite a bit
different. The one in cdec is based on CRFs and uses a bunch of
different features to model segmentations, and it will also produce
segmentation lattices. It was trained using segmentations that seemed
(to my intuition) to be "sensible" for MT.

On Fri, Mar 11, 2011 at 3:05 AM, Joerg Tiedemann
<jorg.tiedem...@lingfil.uu.se> wrote:
> On Thu, Mar 10, 2011 at 8:09 PM, Chris Dyer <cd...@cs.cmu.edu> wrote:
>> There's a German compound splitting tool that's tuned for MT that's
>> released as part of cdec (https://github.com/redpony/cdec). You'll
>> have to build the decoder, but then you should be able to run the
>> script in
>>
>> cdec / compound-split / compound-split.pl
>
> Does this use the same idea as the compound-splitter script
> distributed with Moses?
> Are there any known performance differences?
>
> Jörg
>
>
>>
>> -Chris
>>
>> On Thu, Mar 10, 2011 at 1:50 PM, Tom Hoar
>> <tah...@precisiontranslationtools.com> wrote:
>>> I know German language requires special corpus preparation. Can someone
>>> point me in the right direction regarding what compound words, stemming,
>>> etc?
>>>
>>> Thanks,
>>> Tom
>>>
>>> _______________________________________________
>>> Moses-support mailing list
>>> Moses-support@mit.edu
>>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>>
>>>
>> _______________________________________________
>> Moses-support mailing list
>> Moses-support@mit.edu
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>
>
>
> --
> **********************************************************************************
>  Jörg 
> Tiedemann                                     jorg.tiedem...@lingfil.uu.se
>  Dep. of Linguistics and Philology
> http://stp.lingfil.uu.se/~joerg/
>  Uppsala University                                  tel:  +46 (0)18 - 471 
> 1412
>  Box 635, SE-751 26 Uppsala/SWEDEN   fax: +46 (0)18 - 471 1094
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>

_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to