The following module was proposed for inclusion in the Module List:
modid: Lingua::ZH::MMSEG
DSLIP: adpOl
description: Mandarin Chinese text segmentation
userid: DRYMAN (陳仁乾)
chapterid: 9 (Language_Interfaces)
communities:
github
similar:
Lingua::ZH::TaBE, Lingua::ZH::WordSegment
rationale:
I found that Lingua::ZH::Segment is registered. So change my
namespace to Lingua::ZH::MMSEG
A problem in computational analysis of Chinese text is that there
are no word boundaries in conventionally printed text. Since the
word is such a fundamental linguistic unit, it is necessary to
identify words in Chinese text so that higher-level analyses can be
performed. This module provide phrase segmentation using Maximum
Matching Algorithm. It was found that the system successfully
identified 98.41% of words in a sample consisting of 1013 words.
enteredby: DRYMAN (陳仁乾)
enteredon: Tue Dec 27 12:40:14 2011 GMT
The resulting entry would be:
Lingua::ZH::
::MMSEG adpOl Mandarin Chinese text segmentation DRYMAN
Thanks for registering,
--
The PAUSE
PS: The following links are only valid for module list maintainers:
Registration form with editing capabilities:
https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=08900000_7e269787df8188bc&SUBMIT_pause99_add_mod_preview=1
Immediate (one click) registration:
https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=08900000_7e269787df8188bc&SUBMIT_pause99_add_mod_insertit=1
Peek at the current permissions:
https://pause.perl.org/pause/authenquery?pause99_peek_perms_by=me&pause99_peek_perms_query=Lingua%3A%3AZH%3A%3AMMSEG