Re: performance study

2012-07-27 Thread Sean Owen
Are you basically asking how much faster a parallel algorithm is than non-parallel? If you're measuring wall-clock time, the answer depends on how many workers/threads you use to parallelize. The point is the time generally goes down as more workers are added, so there's not one answer. If

Naive Bayesian - How to convert graphml input data into key, value pairs

2012-07-27 Thread k6.amruta
Hi, I have some graphs (in graphml format) for all the known categories whose nodes edges have values and attributes associated with them. I want to do graph mining on this data so that whenever a new unknown graph comes in I can map it to the correct known category. After reading some stuff

Re: performance study

2012-07-27 Thread Dmitriy Lyubimov
IMO it doesn't make much sense to compare non-parallel and a parallel algorithm (assuming they are running approximately same flops-sized computation). Which is probably why there's not so many (i don't know any). However, there are studies comparing parallel approaches (e.g. certain mahout vs.

Exception in thread main java.lang.NoClassDefFoundError: classpath

2012-07-27 Thread k6.amruta
I am trying to run Wikipedia Bayes Example from https://cwiki.apache.org/confluence/...+Bayes+Example When I ran the following command : $MAHOUT_HOME/bin/mahout wikipediaXMLSplitter -d $MAHOUT_HOME/examples/temp/enwiki-latest-pages-articles10.xml -o wikipedia/chunks -c 64 I am getting this