[ https://issues.apache.org/jira/browse/SPARK-20634?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen resolved SPARK-20634. ------------------------------- Resolution: Invalid I can't understand what this is describing; please read http://spark.apache.org/contributing.html . This doesn't specify any particular problem. You would not expect k-means results to be the same each time. It's stochastic. > result of MLlib KMeans cluster is not stabilize > ----------------------------------------------- > > Key: SPARK-20634 > URL: https://issues.apache.org/jira/browse/SPARK-20634 > Project: Spark > Issue Type: Bug > Components: MLlib > Affects Versions: 2.0.2 > Environment: Windows 10 > spark 2.0.2 standalone > spyder 3.1.4 > Anaconda 4.3.0 > python 3.5.2 > Reporter: Simon.J > Priority: Critical > > 1.Get a DataFrame through python with Cx_Oracle lib. > 2.Start a local Spark Session. > 3.Convert the dataset for Kmeansmodel train. > 4.Train the KMeans model and predict the same data.just set K =3 > 5.Get the ClassifierFeature of the KMeans model'predict. > 6.Get the count of every ClassifierFeature. > 7.Loop 4-6 for 20 times. > 8.Compare the result of every time. > 9.Find the KMeans result dose not stabilize. > 10.The same dataset and param for ML package'KMeans, its result is the same. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org