Saurabh Agrawal created SPARK-21476:
---------------------------------------

             Summary: RandomForest classification model not using broadcast in 
transform
                 Key: SPARK-21476
                 URL: https://issues.apache.org/jira/browse/SPARK-21476
             Project: Spark
          Issue Type: Bug
          Components: ML
    Affects Versions: 2.2.0
            Reporter: Saurabh Agrawal


I notice significant task deserialization latency while running prediction with 
pipelines using RandomForestClassificationModel. While digging into the source, 
found that the transform method in RandomForestClassificationModel binds to its 
parent ProbabilisticClassificationModel and the only concrete definition that 
RandomForestClassificationModel provides and which is actually used in 
transform is that of predictRaw. Broadcasting is not being used in predictRaw.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to