[ 
https://issues.apache.org/jira/browse/SPARK-12225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16005904#comment-16005904
 ] 

Liang-Chi Hsieh commented on SPARK-12225:
-----------------------------------------

Without knowing this issue, I've implemented a {{withColumns}} API in Dataset 
in SPARK-20542. It benefits ML usage a lot and gets better performance results. 
For ML pipelines which can chain dozens of stages, if we do withColumn in each 
stage, the total cost grows big fast.

> Support adding or replacing multiple columns at once in DataFrame API
> ---------------------------------------------------------------------
>
>                 Key: SPARK-12225
>                 URL: https://issues.apache.org/jira/browse/SPARK-12225
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 1.5.2
>            Reporter: Sun Rui
>
> Currently, withColumn() method of DataFrame supports adding or replacing only 
> single column. It would be convenient to support adding or replacing multiple 
> columns at once.
> Also withColumnRenamed() supports renaming only single column.It would also 
> be convenient to support renaming multiple columns at once.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to