GitHub user ddna1021 opened a pull request:
https://github.com/apache/spark/pull/21179
Update StreamingKMeans.scala
## What changes were proposed in this pull request?
I think the ân_t+tâ in the following code may be wrong, it shoud be
ân_t+1â that means is the number of points to the cluster after it finish
the no.t+1 min-batch.
*
*$$
*\begin{align}
* c_t+1 &= [(c_t * n_t * a) + (x_t * m_t)] / [n_t + m_t] \\
* n_t+1 &= n_t * a + m_t
*\end{align}
*$$
*
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ddna1021/spark master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/21179.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #21179
commit e061dce36220809dd65df6a2da9e47c58e0811e8
Author: Fan Donglai
Date: 2018-04-27T08:25:30Z
Update StreamingKMeans.scala
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org