[
https://issues.apache.org/jira/browse/SPARK-957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen resolved SPARK-957.
-----------------------------
Resolution: Not a Problem
> The problem that repeated computation among iterations
> ------------------------------------------------------
>
> Key: SPARK-957
> URL: https://issues.apache.org/jira/browse/SPARK-957
> Project: Spark
> Issue Type: Bug
> Components: Examples
> Affects Versions: 0.7.3
> Reporter: caizhua
>
> For LDA model, if we make each document as a single record of RDD, it is
> quite slow, so we try making the RDD as a set of blocks, where each block has
> a subset of documents. However, when we run the program, we find that a lot
> of computation among iterations are repeated. Basically, when we comes to the
> ith iteration, all the jobs that happened in 0 to (i-1)th iteration are
> repeated. Certainly, the jobs in the ith iteration will be repeated in the
> (i+1) iteration. In total, if you have m iterations, then the jobs in the ith
> iteration will be repeated.
> However, the result is still correct. :)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]