Github user zhengruifeng closed the pull request at:
https://github.com/apache/spark/pull/18154
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail:
Github user hhbyyh commented on a diff in the pull request:
https://github.com/apache/spark/pull/18154#discussion_r121836024
--- Diff:
mllib/src/main/scala/org/apache/spark/ml/feature/CountVectorizer.scala ---
@@ -154,13 +155,19 @@ class CountVectorizer @Since("1.5.0")
GitHub user zhengruifeng opened a pull request:
https://github.com/apache/spark/pull/18154
[SPARK-20932][ML]CountVectorizer support handle persistence
## What changes were proposed in this pull request?
unpersist RDDs `input` & `wordCounts` after computation
## How was