[jira] [Commented] (SPARK-20356) Spark sql group by returns incorrect results after join + distinct transformations

2017-04-18 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15972544#comment-15972544 ] Ed Lee commented on SPARK-20356: really quite dangerous bug > Spark sql group by returns incorrect

[jira] [Created] (SPARK-21754) No Exception/Warn When Join Columns are Differing Types

2017-08-16 Thread Ed Lee (JIRA)
Ed Lee created SPARK-21754: -- Summary: No Exception/Warn When Join Columns are Differing Types Key: SPARK-21754 URL: https://issues.apache.org/jira/browse/SPARK-21754 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-21754) No Exception/Warn When Join Columns are Differing Types

2017-08-16 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-21754: --- Description: No Exception/Warn When Join Columns are Differing Types, which can lead to problematic join

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-07 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Fix Version/s: 2.2.0 Description: Hello encountered a filtering bug using 'isin' in pyspark sql on

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-07 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Environment: Ubuntu Xenial 16.04, Python 3.5 (was: Ubuntu Xenial 16.04) > pyspark.sql, filtering with

[jira] [Updated] (SPARK-20617) pyspark.sql filtering fails when using ~isin when there are nulls in column

2017-05-07 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Summary: pyspark.sql filtering fails when using ~isin when there are nulls in column (was: pyspark.sql,

[jira] [Updated] (SPARK-20617) pyspark.sql filtering fails when using ~isin when there are nulls in column

2017-05-07 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Created] (SPARK-20617) pyspark.sql, isin when columns contain null

2017-05-05 Thread Ed Lee (JIRA)
Ed Lee created SPARK-20617: -- Summary: pyspark.sql, isin when columns contain null Key: SPARK-20617 URL: https://issues.apache.org/jira/browse/SPARK-20617 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-20617) pyspark.sql, isin when columns contain null

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Summary: pyspark.sql, filtering with ~isin missing rows (was: pyspark.sql, ~isin when columns contain

[jira] [Updated] (SPARK-20617) pyspark.sql, ~isin when columns contain null (missing rows)

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Summary: pyspark.sql, ~isin when columns contain null (missing rows) (was: pyspark.sql, isin when columns

[jira] [Updated] (SPARK-20617) pyspark.sql, isin when columns contain null

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, isin when columns contain null

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, filtering with ~isin missing rows

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Updated] (SPARK-20617) pyspark.sql, isin when columns contain null

2017-05-05 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ed Lee updated SPARK-20617: --- Description: Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0, Ubuntu 16.04.

[jira] [Commented] (SPARK-20617) pyspark.sql filtering fails when using ~isin when there are nulls in column

2018-04-10 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16432463#comment-16432463 ] Ed Lee commented on SPARK-20617: Thank you for the clarification.  So conversely: {code:java}

[jira] [Commented] (SPARK-21163) DataFrame.toPandas should respect the data type

2018-04-14 Thread Ed Lee (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438535#comment-16438535 ] Ed Lee commented on SPARK-21163: Had a question: in Spark 2.2.1, if I do a .toPandas on a Spark DataFrame