[ 
https://issues.apache.org/jira/browse/SPARK-8573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yin Huai resolved SPARK-8573.
-----------------------------
    Resolution: Duplicate

> For PySpark's DataFrame API, we need to throw exceptions when users try to 
> use and/or/not
> -----------------------------------------------------------------------------------------
>
>                 Key: SPARK-8573
>                 URL: https://issues.apache.org/jira/browse/SPARK-8573
>             Project: Spark
>          Issue Type: Sub-task
>          Components: PySpark, SQL
>    Affects Versions: 1.3.0
>            Reporter: Yin Huai
>            Assignee: Davies Liu
>            Priority: Critical
>
> In PySpark's DataFrame API, we have
> {code}
> # `and`, `or`, `not` cannot be overloaded in Python,
> # so use bitwise operators as boolean operators
> __and__ = _bin_op('and')
> __or__ = _bin_op('or')
> __invert__ = _func_op('not')
> __rand__ = _bin_op("and")
> __ror__ = _bin_op("or")
> {code}
> Right now, users can still use operators like {{and}}, which can cause very 
> confusing behaviors. We need to throw an error when users try to use them and 
> let them know what is the right way to do.
> For example, 
> {code}
> df = sqlContext.range(1, 10)
> df.id > 5 or df.id < 10
> Out[30]: Column<(id > 5)>
> df.id > 5 and df.id < 10
> Out[31]: Column<(id < 10)>
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to