We’ve had reasonable luck with the Stanford Chinese segmenter - I think the ctb model did better than the pku one for our use case
> Message: 2 > Date: Fri, 20 Mar 2015 13:19:02 +0100 > From: Marcin Junczys-Dowmunt <junc...@amu.edu.pl> > Subject: [Moses-support] Chinese segmentation/tokenization > To: Moses Support <moses-support@mit.edu> > Message-ID: <e4d171cb90994cb853a9965facaeb...@amu.edu.pl> > Content-Type: text/plain; charset="us-ascii" > > > > Hi, > > questions appear from time to time on the list concerning Chinese > segmentation/tokenization. I saw Barry mention Lingpipe and other tools. > Is there a favourite tool you guys prefer to use over others? > > Thanks, > > Marcin _______________________________________________ Moses-support mailing list Moses-support@mit.edu http://mailman.mit.edu/mailman/listinfo/moses-support