GitHub user ddna1021 opened a pull request: https://github.com/apache/spark/pull/21179
Update StreamingKMeans.scala ## What changes were proposed in this pull request? I think the ân_t+tâ in the following code may be wrong, it shoud be ân_t+1â that means is the number of points to the cluster after it finish the no.t+1 min-batch. * <blockquote> * $$ * \begin{align} * c_t+1 &= [(c_t * n_t * a) + (x_t * m_t)] / [n_t + m_t] \\ * n_t+1 &= n_t * a + m_t * \end{align} * $$ * </blockquote> You can merge this pull request into a Git repository by running: $ git pull https://github.com/ddna1021/spark master Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/21179.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #21179 ---- commit e061dce36220809dd65df6a2da9e47c58e0811e8 Author: Fan Donglai <ddna_1022@...> Date: 2018-04-27T08:25:30Z Update StreamingKMeans.scala ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org