[
https://issues.apache.org/jira/browse/SPARK-20356?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15972544#comment-15972544
]
Ed Lee commented on SPARK-20356:
really quite dangerous bug
> Spark sql group by returns incorrect
Ed Lee created SPARK-21754:
--
Summary: No Exception/Warn When Join Columns are Differing Types
Key: SPARK-21754
URL: https://issues.apache.org/jira/browse/SPARK-21754
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-21754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-21754:
---
Description:
No Exception/Warn When Join Columns are Differing Types, which can lead to
problematic join
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Fix Version/s: 2.2.0
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Environment: Ubuntu Xenial 16.04, Python 3.5 (was: Ubuntu Xenial 16.04)
> pyspark.sql, filtering with
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Summary: pyspark.sql filtering fails when using ~isin when there are nulls
in column (was: pyspark.sql,
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
Ed Lee created SPARK-20617:
--
Summary: pyspark.sql, isin when columns contain null
Key: SPARK-20617
URL: https://issues.apache.org/jira/browse/SPARK-20617
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Summary: pyspark.sql, filtering with ~isin missing rows (was:
pyspark.sql, ~isin when columns contain
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Summary: pyspark.sql, ~isin when columns contain null (missing rows)
(was: pyspark.sql, isin when columns
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ed Lee updated SPARK-20617:
---
Description:
Hello encountered a filtering bug using 'isin' in pyspark sql on version 2.2.0,
Ubuntu 16.04.
[
https://issues.apache.org/jira/browse/SPARK-20617?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16432463#comment-16432463
]
Ed Lee commented on SPARK-20617:
Thank you for the clarification. So conversely:
{code:java}
[
https://issues.apache.org/jira/browse/SPARK-21163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16438535#comment-16438535
]
Ed Lee commented on SPARK-21163:
Had a question: in Spark 2.2.1, if I do a .toPandas on a Spark DataFrame
21 matches
Mail list logo