My bad. The example should not even have a -c parameter as it uses Canopy to populate the initial clusters and they go into a default directory. I will fix asap.

On 9/24/10 10:05 AM, Jeff Eastman wrote:
This error was likely caused by incorrect -c parameter. The error message was misleading. I committed a better message earlier this week. Synthetic control works reliably with k-Means when the arguments are given correctly. I think this can be closed.

On 9/24/10 8:34 AM, Sean Owen (JIRA) wrote:
[ https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]

Sean Owen updated MAHOUT-504:
-----------------------------

          Assignee: Robin Anil
     Fix Version/s: 0.5

Kmeans clustering error
-----------------------

                 Key: MAHOUT-504
                 URL: https://issues.apache.org/jira/browse/MAHOUT-504
             Project: Mahout
          Issue Type: Bug
            Reporter: Zhen Guo
            Assignee: Robin Anil
             Fix For: 0.5


I tried the Kmeans algorithm on the Synthetic Control data. The following error appears. I tried the Canopy algorithm, it is fine. This error is from Mapper. I am using Trunk. 10/09/20 19:40:06 INFO mapred.JobClient: Task Id : attempt_201008261432_1324_m_000000_0, Status : FAILED
java.lang.IllegalStateException: Cluster is empty!
at org.apache.mahout.clustering.kmeans.KMeansClusterMapper.setup(KMeansClusterMapper.java:57)
    at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
    at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
    at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
    at org.apache.hadoop.mapred.Child.main(Child.java:170)


Reply via email to