We’ve had reasonable luck with the Stanford Chinese segmenter - I think the ctb 
model did better than the pku one for our use case

> Message: 2
> Date: Fri, 20 Mar 2015 13:19:02 +0100
> From: Marcin Junczys-Dowmunt <junc...@amu.edu.pl>
> Subject: [Moses-support] Chinese segmentation/tokenization
> To: Moses Support <moses-support@mit.edu>
> Message-ID: <e4d171cb90994cb853a9965facaeb...@amu.edu.pl>
> Content-Type: text/plain; charset="us-ascii"
> 
> 
> 
> Hi, 
> 
> questions appear from time to time on the list concerning Chinese
> segmentation/tokenization. I saw Barry mention Lingpipe and other tools.
> Is there a favourite tool you guys prefer to use over others? 
> 
> Thanks, 
> 
> Marcin 


_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to