zhengruifeng created SPARK-22737: ------------------------------------ Summary: Simplity OneVsRest transform Key: SPARK-22737 URL: https://issues.apache.org/jira/browse/SPARK-22737 Project: Spark Issue Type: Improvement Components: ML Affects Versions: 2.3.0 Reporter: zhengruifeng
Current impl of OneVsRest#transform is over-complicated. It sequentially updates an acumulated column. By using a direct UDF of prediction, we obtain a speedup of at least 2x. On some extreme case with 20 classes, it obtain about 14x speedup. The test code and performance comparsion details are in the corresponding PR. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org