黄龙龙 created SPARK-23211:
---------------------------

             Summary: SparkR MLlib randomFroest  parameter problem
                 Key: SPARK-23211
                 URL: https://issues.apache.org/jira/browse/SPARK-23211
             Project: Spark
          Issue Type: Bug
          Components: SparkR
    Affects Versions: 2.1.0
         Environment: {code:R}
sdf_list <- randomSplit(train_data, rep(7, 3), 10086) 

model <- spark.randomForest(
  sdf_list[[1]],  
  forward_count ~ .,   
  type          = "regression",   
  path          = paste0("./predict/model/randomForest_", x),   
  overwrite     = TRUE,  
  newData       = sdf_list[[2]])
{code}
train_data is a SparkDataFrame
The notes of parameter newData is "a SparkDataFrame for testing."
The notes of parameter path is "The directory where the model is saved."
These all don't work normaly.

why?

            Reporter: 黄龙龙


spark.randomForest() and randomSplit() problem



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to