Hi, I've put up a slightly cleaner version of the script on JIRA at https://issues.apache.org/jira/browse/MAHOUT-390
Best regards, Sisir Koppaka On Mon, May 3, 2010 at 11:28 PM, Grant Ingersoll <gsing...@apache.org>wrote: > Sisir, > > Thanks for the script. I think it would be great to open a JIRA issue for > this and we can check in the shell script under the examples. > > I think LDA also has similar tools to download Reuters, we should try to > reuse if possible. > > On May 2, 2010, at 3:42 PM, Sisir Koppaka wrote: > > > For GSOC students, > > In case anyone was going through the code and finding some difficulty in > > running stuff, I have updated the kMeans page on the > > wiki<https://cwiki.apache.org/confluence/display/MAHOUT/k-Means> with > > a short quickstart shell script that will run it for you. You can tweak > the > > settings and reuse it. Reading the code after running it will hopefully > help > > out in understanding the codebase well. > > > > If any of you have any tips to share, or have made notes of > > quirks-to-be-aware-of, do post them here for everyone's benefit. > > >