[ 
https://issues.apache.org/jira/browse/MAHOUT-1593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14076149#comment-14076149
 ] 

Suneel Marthi edited comment on MAHOUT-1593 at 7/28/14 11:41 AM:
-----------------------------------------------------------------

[~jaehoon13.ko] Mahout 0.9 doesn't support Hadoop 2x and hence the issues u r 
seeing. Nevertheless these need to be fixed in the present trunk for Hadoop 2x 
and thanks for submitting a patch.

Since we r now using Git as source code control, could u create a Git Pull 
request as opposed to submitting a patch. 
Also I am assuming that u r generating a patch against the present trunk and 
not from 0.9 codebase, correct?


was (Author: smarthi):
[~jaehoon13.ko] Mahout 0.9 doesn't support Hadoop 2x and hence the issues u r 
seeing. Nevertheless these need to be fixed in the present trunk for Hadoop 2x.

Since we r now using Git as source code control, could u create a Git Pull 
request as opposed to submitting a patch. 
Also I am assuming that u r generating a patch against the present trunk and 
not from 0.9 codebase, correct?

> cluster-reuters.sh does not work complaining java.lang.IllegalStateException
> ----------------------------------------------------------------------------
>
>                 Key: MAHOUT-1593
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1593
>             Project: Mahout
>          Issue Type: Bug
>          Components: Examples
>    Affects Versions: 0.9
>         Environment: Hadoop version: 2.4.0.2.1.1.0-385
> Git hash: 2b65475c3ab682ebd47cffdc6b502698799cd2c8 (trunk)
>            Reporter: jaehoon ko
>            Priority: Minor
>              Labels: patch
>             Fix For: 1.0
>
>         Attachments: MAHOUT-1593.patch
>
>
> When I choose "kmeans clustering" in cluster-reuters.sh, clusterdump 
> complains java.lang.IllegalStateException as follows:
> {code:borderStyle=solid}
> Exception in thread "main" java.lang.IllegalStateException: 
> /tmp/mahout-work-user/reuters-kmeans/clusters-*-final
>         at 
> org.apache.mahout.common.iterator.sequencefile.SequenceFileDirValueIterable.iterator(SequenceFileDirValueIterable.java:78)
>         at 
> org.apache.mahout.clustering.evaluation.ClusterEvaluator.loadClusters(ClusterEvaluator.java:93)
>         at 
> org.apache.mahout.clustering.evaluation.ClusterEvaluator.<init>(ClusterEvaluator.java:81)
>         at 
> org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:208)
>         at 
> org.apache.mahout.utils.clustering.ClusterDumper.run(ClusterDumper.java:157)
>         at 
> org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:101)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at 
> org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:72)
>         at org.apache.hadoop.util.ProgramDriver.run(ProgramDriver.java:145)
>         at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:153)
>         at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:195)
>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>         at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>         at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>         at java.lang.reflect.Method.invoke(Method.java:606)
>         at org.apache.hadoop.util.RunJar.main(RunJar.java:212)
> Caused by: java.io.FileNotFoundException: File 
> /tmp/mahout-work-user/reuters-kmeans/clusters-*-final does not exist.
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatusInternal(DistributedFileSystem.java:654)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem.access$600(DistributedFileSystem.java:102)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:712)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem$14.doCall(DistributedFileSystem.java:708)
>         at 
> org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
>         at 
> org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:708)
>         at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1483)
>         at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:1523)
>         at 
> org.apache.mahout.common.iterator.sequencefile.SequenceFileDirValueIterator.<init>(SequenceFileDirValueIterator.java:70)
>         at 
> org.apache.mahout.common.iterator.sequencefile.SequenceFileDirValueIterable.iterator(SequenceFileDirValueIterable.java:76)
>         ... 18 more
> {code}
> Other clustering options run well.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to