Re: Problems running examples

2011-09-02 Thread Dan Brickley
s this relate to your problems? Dan > Jeff > > -Original Message- > From: danbri2...@danbri.org [mailto:danbri2...@danbri.org] On Behalf Of Dan > Brickley > Sent: Wednesday, August 31, 2011 12:16 PM > To: Jeff Eastman > Cc: user@mahout.apache.org > Subject: Re:

RE: Problems running examples

2011-08-31 Thread Jeff Eastman
, August 31, 2011 12:16 PM To: Jeff Eastman Cc: user@mahout.apache.org Subject: Re: Problems running examples On 10 June 2011 18:34, Jeff Eastman wrote: > I'm still trying to figure out why reuters-0.5 does not work on either of my > clusters. The scripts themselves have no diff and the

Re: Problems running examples

2011-08-31 Thread Dan Brickley
On 10 June 2011 18:34, Jeff Eastman wrote: > I'm still trying to figure out why reuters-0.5 does not work on either of my > clusters. The scripts themselves have no diff and the environment variables > are set as in trunk except for MAHOUT_HOME. The synthetic control and 20 > newsgroups example

Re: Problems running examples

2011-06-11 Thread Alex Ott
Yes, this was a bug in command-line examples in MiA, it was already fixed for production version of the book On Thu, Jun 9, 2011 at 6:23 PM, Mark wrote: > Hector > > Try using the reuters-vectors/tfidf-vectors folder as input, not the top > level reueters-vectors. > > On 6/9/11 8:45 AM, Hector Ye

Re: Problems running examples

2011-06-10 Thread Drew Farris
unk, I don't see this and it works just fine. > > -Original Message- > From: Drew Farris [mailto:d...@apache.org] > Sent: Thursday, June 09, 2011 5:36 PM > To: user@mahout.apache.org > Subject: Re: Problems running examples > > Jeff, No impuning perceived and thanks for runni

RE: Problems running examples

2011-06-10 Thread Jeff Eastman
d it works just fine. -Original Message- From: Drew Farris [mailto:d...@apache.org] Sent: Thursday, June 09, 2011 5:36 PM To: user@mahout.apache.org Subject: Re: Problems running examples Jeff, No impuning perceived and thanks for running the variety of tests. So it appears that trunk is fine

Re: Problems running examples

2011-06-10 Thread Sean Owen
Sorry all, I take this back. I am at best confused. There seems to be some superficial problem here but the underlying implementation ought to be just fine. Use it with confidence. On Thu, Jun 9, 2011 at 8:28 AM, Sean Owen wrote: > (Assuming you are on HEAD,) I think KMeans is busted -- this has

Re: Problems running examples

2011-06-09 Thread Drew Farris
fails in both >> KMeans and LDA when run on a cluster. The MIA examples are not always >> correct. Most of this has been reported before. >> >> -Original Message- >> From: Sean Owen [mailto:sro...@gmail.com] >> Sent: Thursday, June 09, 2011 12:29 AM >

RE: Problems running examples

2011-06-09 Thread Jeff Eastman
To: user@mahout.apache.org Subject: RE: Problems running examples Hi Drew, Running trunk locally, latest update, just now, build-reuters.sh works (kmeans and lda). Running trunk on my CDH3 cluster, just now: - build-cluster-syntheticcontrol.sh works (with kmeans and others) - build-reuters.sh

RE: Problems running examples

2011-06-09 Thread Jeff Eastman
-- From: Drew Farris [mailto:d...@apache.org] Sent: Thursday, June 09, 2011 11:36 AM To: user@mahout.apache.org Subject: Re: Problems running examples Jeff, Could you tell me about what's failing in KMeans and LDA when running on a cluster? I had this working just prior to 0.5 in https://issu

Re: Problems running examples

2011-06-09 Thread Drew Farris
n [mailto:sro...@gmail.com] > Sent: Thursday, June 09, 2011 12:29 AM > To: user@mahout.apache.org > Subject: Re: Problems running examples > > (Assuming you are on HEAD,) I think KMeans is busted -- this has come up > before. I don't know if it is being maintained.  Anyone who&

RE: Problems running examples

2011-06-09 Thread Jeff Eastman
: Sean Owen [mailto:sro...@gmail.com] Sent: Thursday, June 09, 2011 12:29 AM To: user@mahout.apache.org Subject: Re: Problems running examples (Assuming you are on HEAD,) I think KMeans is busted -- this has come up before. I don't know if it is being maintained. Anyone who's willing

Re: Problems running examples

2011-06-09 Thread Hector Yee
Oh that fixes it! +Ted for the Mahout in Action example. I'll fix it in the wiki if I can find it. On Thu, Jun 9, 2011 at 9:23 AM, Mark wrote: > Hector > > Try using the reuters-vectors/tfidf-vectors folder as input, not the top > level reueters-vectors. > > > On 6/9/11 8:45 AM, Hector Yee wrot

Re: Problems running examples

2011-06-09 Thread Mark
Hector Try using the reuters-vectors/tfidf-vectors folder as input, not the top level reueters-vectors. On 6/9/11 8:45 AM, Hector Yee wrote: I was following the book examples and k means , dirichlet and lda all have this casting problem. It may be a Mac issue not sure . I suspect it may be

Re: Problems running examples

2011-06-09 Thread Sean Owen
Yes, I just mean I think we have heard about this casting issue for a while and not sure it was fixed up in 0.5. I imagine it is only a shallow problem, but I also wasn't sure if anyone was looking after k-means since I am not sure I recall anyone looking into this. Let me file a JIRA to track thi

Re: Problems running examples

2011-06-09 Thread Hector Yee
I was following the book examples and k means , dirichlet and lda all have this casting problem. It may be a Mac issue not sure . I suspect it may be seq2sparse messing up the inputs, maybe wrong version. It outputs the regular part-r-* but the lda driver expects a file called data. Sent from

Re: Problems running examples

2011-06-09 Thread Drew Farris
Sean, I'd be surprised to find out that k-means was busted. It was working just prior to release 0.5 when I was working on https://issues.apache.org/jira/browse/MAHOUT-694 which may be related to Mark's problems, but then again I haven't been tracking the other patches that were applied around that

Re: Problems running examples

2011-06-09 Thread Mark
Forgot to mention... great book :) On 6/9/11 7:30 AM, Mark wrote: KMeans is busted? What do you mean by this? The algorithm simply won't work or just the reuters example? Thanks On 6/9/11 12:28 AM, Sean Owen wrote: (Assuming you are on HEAD,) I think KMeans is busted -- this has come up befo

Re: Problems running examples

2011-06-09 Thread Mark
KMeans is busted? What do you mean by this? The algorithm simply won't work or just the reuters example? Thanks On 6/9/11 12:28 AM, Sean Owen wrote: (Assuming you are on HEAD,) I think KMeans is busted -- this has come up before. I don't know if it is being maintained. Anyone who's willing to

Re: Problems running examples

2011-06-09 Thread Sean Owen
(Assuming you are on HEAD,) I think KMeans is busted -- this has come up before. I don't know if it is being maintained. Anyone who's willing to step up and fix it is also welcome to overhaul it IMHO. On Thu, Jun 9, 2011 at 12:03 AM, Hector Yee wrote: > I got a slightly different error on the n

Re: Problems running examples

2011-06-08 Thread Hector Yee
ro...@gmail.com] > Sent: Sunday, June 05, 2011 11:56 AM > To: user@mahout.apache.org > Subject: Re: Problems running examples > > This all sounds a load like things that were fixed a little while ago. Are > you on version 0.5, or better yet, SVN HEAD? > > The rest, I don't

RE: Problems running examples

2011-06-05 Thread Jeff Eastman
[mailto:sro...@gmail.com] Sent: Sunday, June 05, 2011 11:56 AM To: user@mahout.apache.org Subject: Re: Problems running examples This all sounds a load like things that were fixed a little while ago. Are you on version 0.5, or better yet, SVN HEAD? The rest, I don't know, would have to defer t

Re: Problems running examples

2011-06-05 Thread Mark
Any idea on how I can generate the sequence files locally? On 6/5/11 12:23 PM, Mark wrote: I was on an older trunk version of 0.5 but then I realized there was an official release the other day so I retried on that with the same results. Running the same on 0.4 works as expected. On 6/5/11 1

Re: Problems running examples

2011-06-05 Thread Mark
I was on an older trunk version of 0.5 but then I realized there was an official release the other day so I retried on that with the same results. Running the same on 0.4 works as expected. On 6/5/11 11:56 AM, Sean Owen wrote: This all sounds a load like things that were fixed a little while a

Re: Problems running examples

2011-06-05 Thread Sean Owen
This all sounds a load like things that were fixed a little while ago. Are you on version 0.5, or better yet, SVN HEAD? The rest, I don't know, would have to defer to the author of that bit. On Sun, Jun 5, 2011 at 7:07 PM, Mark wrote: > Hi all. I'm trying to run the examples/bin/build-reuters.s

Re: Problems running examples

2011-06-05 Thread Mark
FYI: $hadoop version Hadoop 0.20.2 Subversion https://svn.apache.org/repos/asf/hadoop/common/branches/branch-0.20 -r 911707 Compiled by chrisdo on Fri Feb 19 08:07:34 UTC 2010 On 6/5/11 11:07 AM, Mark wrote: Hi all. I'm trying to run the examples/bin/build-reuters.sh but I continue to run i

Re: Problems running examples

2011-06-05 Thread Mark
It seems like seqdirectory expects the input to be on HDFS and not local? Running the below command will write an empty output directory on HDFS MAHOUT_LOCAL=true $MAHOUT seqdirectory \ -i mahout-work/reuters-out \ -o mahout-work/reuters-out-seqdir \ -c UTF-8 -chunk 5 I