[R] text vector clustering

srinivasa raghavan Thu, 22 Jan 2009 03:15:37 -0800

Hi,

I am a new user of R using R 2.8.1 in windows 2003.  I have a  csv file with
single column which contain the 30,000 students names. There were typo
errors while entering this student names. The actual list of names is <
1000. However we dont have that list for keyword search.


 I am interested in grouping/cluster these names   as those which are
similar  letter to letter.  Are there any text clustering algorithm in R
which can group names of similar type in to segments of exactly matching ,
90% matching, 80% matching,....etc.

thanks in advance,

regards,
srinivas
statistical analyst.

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

[R] text vector clustering

Reply via email to