[ 
https://issues.apache.org/jira/browse/SPARK-8345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14741618#comment-14741618
 ] 

Simeon Simeonov commented on SPARK-8345:
----------------------------------------

This would be very nice, especially as it can leverage existing UDFs.

> Add an SQL node as a feature transformer
> ----------------------------------------
>
>                 Key: SPARK-8345
>                 URL: https://issues.apache.org/jira/browse/SPARK-8345
>             Project: Spark
>          Issue Type: Sub-task
>          Components: ML
>            Reporter: Xiangrui Meng
>            Assignee: Yanbo Liang
>             Fix For: 1.6.0
>
>
> Some simple feature transformations can take leverage on SQL operators. Users 
> do not need to create an ML transformer for each of them. We can have an SQL 
> transformer that executes an SQL command which operates on the input 
> dataframe.
> {code}
> val sql = new SQL()
>   .setStatement("SELECT *, length(text) AS text_length FROM __THIS__")
> {code}
> where "__THIS__" will be replaced by a temp table that represents the 
> DataFrame.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to