Repository: spark
Updated Branches:
  refs/heads/master 1995c2e64 -> 111a62474


[SPARK-14147][ML][SPARKR] SparkR predict should not output feature column

## What changes were proposed in this pull request?
SparkR does not support type of vector which is the default type of feature 
column in ML. R predict also does not output intermediate feature column. So 
SparkR ```predict``` should not output feature column. In this PR, I only fix 
this issue for ```naiveBayes``` and ```survreg```. ```kmeans``` has the right 
code route already and  ```glm``` will be fixed at SparkRWrapper 
refactor(#12294).

## How was this patch tested?
No new tests.

cc mengxr shivaram

Author: Yanbo Liang <yblia...@gmail.com>

Closes #11958 from yanboliang/spark-14147.


Project: http://git-wip-us.apache.org/repos/asf/spark/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/111a6247
Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/111a6247
Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/111a6247

Branch: refs/heads/master
Commit: 111a62474a2fb7f4e7f19fcfb8efaae37aa40400
Parents: 1995c2e
Author: Yanbo Liang <yblia...@gmail.com>
Authored: Tue Apr 12 11:34:40 2016 -0700
Committer: Xiangrui Meng <m...@databricks.com>
Committed: Tue Apr 12 11:34:40 2016 -0700

----------------------------------------------------------------------
 .../org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala     | 2 +-
 .../src/main/scala/org/apache/spark/ml/r/NaiveBayesWrapper.scala | 4 +++-
 2 files changed, 4 insertions(+), 2 deletions(-)
----------------------------------------------------------------------


http://git-wip-us.apache.org/repos/asf/spark/blob/111a6247/mllib/src/main/scala/org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala
----------------------------------------------------------------------
diff --git 
a/mllib/src/main/scala/org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala 
b/mllib/src/main/scala/org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala
index 2ae4115..7835468 100644
--- 
a/mllib/src/main/scala/org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala
+++ 
b/mllib/src/main/scala/org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala
@@ -44,7 +44,7 @@ private[r] class AFTSurvivalRegressionWrapper private (
   }
 
   def transform(dataset: Dataset[_]): DataFrame = {
-    pipeline.transform(dataset)
+    pipeline.transform(dataset).drop(aftModel.getFeaturesCol)
   }
 }
 

http://git-wip-us.apache.org/repos/asf/spark/blob/111a6247/mllib/src/main/scala/org/apache/spark/ml/r/NaiveBayesWrapper.scala
----------------------------------------------------------------------
diff --git a/mllib/src/main/scala/org/apache/spark/ml/r/NaiveBayesWrapper.scala 
b/mllib/src/main/scala/org/apache/spark/ml/r/NaiveBayesWrapper.scala
index 2cd709d..b17207e 100644
--- a/mllib/src/main/scala/org/apache/spark/ml/r/NaiveBayesWrapper.scala
+++ b/mllib/src/main/scala/org/apache/spark/ml/r/NaiveBayesWrapper.scala
@@ -37,7 +37,9 @@ private[r] class NaiveBayesWrapper private (
   lazy val tables: Array[Double] = naiveBayesModel.theta.toArray.map(math.exp)
 
   def transform(dataset: Dataset[_]): DataFrame = {
-    pipeline.transform(dataset).drop(PREDICTED_LABEL_INDEX_COL)
+    pipeline.transform(dataset)
+      .drop(PREDICTED_LABEL_INDEX_COL)
+      .drop(naiveBayesModel.getFeaturesCol)
   }
 }
 


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

Reply via email to