Github user MrMathias commented on a diff in the pull request: https://github.com/apache/spark/pull/20759#discussion_r173563217 --- Diff: docs/ml-collaborative-filtering.md --- @@ -19,6 +19,7 @@ by a small set of latent factors that can be used to predict missing entries. algorithm to learn these latent factors. The implementation in `spark.ml` has the following parameters: +* *checkpointInterval* helps with recovery when nodes fail and StackOverflow exceptions caused by long lineage. **Will be silently ignored if *SparkContext.CheckpointDir* is not set.** (defaults to 10). --- End diff -- Checkpointing exists to better deal with node failure and decrease memory consumption from lineage. This wording is taken from the parameter-comment in the ALS implementation itself, so I think it is fitting. This list of parameters is both a sub-set and unordered.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org