Hi Guillaume,
Check out the manning forum, I have posted the code snippets there.

The thread is here
http://www.manning-sandbox.com/thread.jspa?threadID=36895&tstart=0

Robin

On Thu, Mar 4, 2010 at 6:12 PM, tog <[email protected]> wrote:

> Dear all,
>
> Ok I tried 0.2 and it seems to work. Due to some comment on the list
> regarding KMean in 0.2 I decided to move to trunk since 0.3 seems
> close enough.
> Nevertheless I had an issue with the creation of a writer (I am still
> in the example of Ch 7 of Mahout in Action). Here is what I got:
>
>        at
> org.apache.hadoop.io.serializer.SerializationFactory.getSerializer(SerializationFactory.java:73)
>        at
> org.apache.hadoop.io.SequenceFile$Writer.init(SequenceFile.java:910)
>        at
> org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java:843)
>        at
> org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java:831)
>        at
> org.apache.hadoop.io.SequenceFile$Writer.<init>(SequenceFile.java:823)
>
> when calling:
>
> writer = new SequenceFile.Writer(fs, conf, path, LongWritable.class,
> DenseVector.class)
>
> Anyone having the same issue ?
>
> Regards
> Guillaume
>
>
> On Tue, Feb 23, 2010 at 7:14 PM, Alleon Guillaume
> <[email protected]> wrote:
> > Hi all,
> >
> > I am a complete newbie in action ... despite I have gone throuh the book
> of the same collection;)
> > I would like to classify a number of items - each of them being
> characterize by a number of vectors. I thought it will be a good idea first
> to classify the vectors. Unfortunately my items number keep on growing so
> what I have done so far is a small piece of code constructing the mahout
> dense vectors on the fly setting the name as my item name. As far as I
> understand those vectors are kept in memory ...
> > What are the next steps for me ?
> > Storing those vectors on disk I assume :)
> > Then creating some canopies and then using kmean to create my clusters.
> > Can you guide me trough some steps ?
> >
> > Then I have more questions ?
> > Can mahout determine an "optimal" number of clusters ?
> > Once a set of clusters exist and new items are added, is it possible to
> update the existing clusters ? Is it possible to add clusters at alower cost
> than recreting it ?
> >
> > T hanks for your help and time
> > Regards
> > Guillaume
> >
>
>
>
> --
> PGP KeyID: 1024D/69B00854  subkeys.pgp.net
>
> http://cheztog.blogspot.com
>

Reply via email to