[jira] [Commented] (SPARK-28182) Spark fails to download Hive 2.2+ jars from maven

2019-06-27 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-28182?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16874029#comment-16874029 ] Emlyn Corrin commented on SPARK-28182: -- I can work around it by adding --packages

[jira] [Created] (SPARK-28182) Spark fails to download Hive 2.2+ jars from maven

2019-06-27 Thread Emlyn Corrin (JIRA)
Emlyn Corrin created SPARK-28182: Summary: Spark fails to download Hive 2.2+ jars from maven Key: SPARK-28182 URL: https://issues.apache.org/jira/browse/SPARK-28182 Project: Spark Issue

[jira] [Commented] (SPARK-23549) Spark SQL unexpected behavior when comparing timestamp to date

2018-04-30 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-23549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16458511#comment-16458511 ] Emlyn Corrin commented on SPARK-23549: -- Will this be included in Spark 2.3.1? It only says 2.4.0 >

[jira] [Commented] (SPARK-24051) Incorrect results for certain queries using Java and Python APIs on Spark 2.3.0

2018-04-24 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16449513#comment-16449513 ] Emlyn Corrin commented on SPARK-24051: -- I've reshuffled the pyspark version to make it even clearer:

[jira] [Commented] (SPARK-24051) Incorrect results for certain queries using Java API on Spark 2.3.0

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16448390#comment-16448390 ] Emlyn Corrin commented on SPARK-24051: -- I've managed to reproduce this in {{pyspark}}: {code} from

[jira] [Updated] (SPARK-24051) Incorrect results for certain queries using Java and Python APIs on Spark 2.3.0

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emlyn Corrin updated SPARK-24051: - Summary: Incorrect results for certain queries using Java and Python APIs on Spark 2.3.0 (was:

[jira] [Updated] (SPARK-24051) Incorrect results for certain queries using Java API on Spark 2.3.0

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emlyn Corrin updated SPARK-24051: - Summary: Incorrect results for certain queries using Java API on Spark 2.3.0 (was: Incorrect

[jira] [Commented] (SPARK-24051) Incorrect results for certain queries in Java API on Spark 2.3.0

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16447854#comment-16447854 ] Emlyn Corrin commented on SPARK-24051: -- [~hyukjin.kwon] I expanded the title a bit, but feel free to

[jira] [Updated] (SPARK-24051) Incorrect results for certain queries in Java API on Spark 2.3.0

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emlyn Corrin updated SPARK-24051: - Summary: Incorrect results for certain queries in Java API on Spark 2.3.0 (was: Incorrect

[jira] [Updated] (SPARK-24051) Incorrect results

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emlyn Corrin updated SPARK-24051: - Description: I'm seeing Spark 2.3.0 return incorrect results for a certain (very specific)

[jira] [Commented] (SPARK-24051) Incorrect results

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16447847#comment-16447847 ] Emlyn Corrin commented on SPARK-24051: -- I tried again to reproduce it in the scala API, keeping as

[jira] [Commented] (SPARK-24051) Incorrect results

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16447794#comment-16447794 ] Emlyn Corrin commented on SPARK-24051: -- I have tried before to reproduce it using the scala API, and

[jira] [Updated] (SPARK-24051) Incorrect results

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emlyn Corrin updated SPARK-24051: - Description: I'm seeing Spark 2.3.0 return incorrect results for a certain (very specific)

[jira] [Commented] (SPARK-24051) Incorrect results

2018-04-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24051?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16447776#comment-16447776 ] Emlyn Corrin commented on SPARK-24051: -- Not at all, I was meaning to go back and improve it but

[jira] [Created] (SPARK-24051) Incorrect results

2018-04-23 Thread Emlyn Corrin (JIRA)
Emlyn Corrin created SPARK-24051: Summary: Incorrect results Key: SPARK-24051 URL: https://issues.apache.org/jira/browse/SPARK-24051 Project: Spark Issue Type: Bug Components: SQL

[jira] [Created] (SPARK-24033) LAG Window function broken in Spark 2.3

2018-04-20 Thread Emlyn Corrin (JIRA)
Emlyn Corrin created SPARK-24033: Summary: LAG Window function broken in Spark 2.3 Key: SPARK-24033 URL: https://issues.apache.org/jira/browse/SPARK-24033 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-18971) Netty issue may cause the shuffle client hang

2017-03-29 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15946882#comment-15946882 ] Emlyn Corrin commented on SPARK-18971: -- Will this fix go into Spark 2.1.1? > Netty issue may cause

[jira] [Commented] (SPARK-8480) Add setName for Dataframe

2017-02-23 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15880338#comment-15880338 ] Emlyn Corrin commented on SPARK-8480: - If anyone just wants a way to identify the RDDs in the storage

[jira] [Commented] (SPARK-8480) Add setName for Dataframe

2017-01-18 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15828512#comment-15828512 ] Emlyn Corrin commented on SPARK-8480: - [~skp33] OK, I can see that could be useful, but I think it's

[jira] [Commented] (SPARK-8480) Add setName for Dataframe

2017-01-18 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15827932#comment-15827932 ] Emlyn Corrin commented on SPARK-8480: - [~skp33] with this change, you can do similar: {code} scala>

[jira] [Commented] (SPARK-8480) Add setName for Dataframe

2017-01-17 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15826360#comment-15826360 ] Emlyn Corrin commented on SPARK-8480: - [~skp33] I'm not sure what you mean, maybe a code snippet would

[jira] [Commented] (SPARK-13210) NPE in Sort

2016-12-16 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-13210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15754237#comment-15754237 ] Emlyn Corrin commented on SPARK-13210: -- I've just had this suspiciously similar stack trace in Spark

[jira] [Commented] (SPARK-18172) AnalysisException in first/last during aggregation

2016-11-17 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15673931#comment-15673931 ] Emlyn Corrin commented on SPARK-18172: -- I'm not sure I've got the time to build from source at the

[jira] [Commented] (SPARK-18172) AnalysisException in first/last during aggregation

2016-11-16 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15671227#comment-15671227 ] Emlyn Corrin commented on SPARK-18172: -- It occurs on 2.0.1 and 2.0.2 (on Mac, installed via

[jira] [Updated] (SPARK-18300) ClassCastException during count distinct

2016-11-11 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Emlyn Corrin updated SPARK-18300: - Component/s: SQL > ClassCastException during count distinct >

[jira] [Commented] (SPARK-18172) AnalysisException in first/last during aggregation

2016-11-07 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15644358#comment-15644358 ] Emlyn Corrin commented on SPARK-18172: -- I can also reproduce it in Spark SQL with the following

[jira] [Created] (SPARK-18300) ClassCastException during count distinct

2016-11-07 Thread Emlyn Corrin (JIRA)
Emlyn Corrin created SPARK-18300: Summary: ClassCastException during count distinct Key: SPARK-18300 URL: https://issues.apache.org/jira/browse/SPARK-18300 Project: Spark Issue Type: Bug

[jira] [Comment Edited] (SPARK-16648) LAST_VALUE(FALSE) OVER () throws IndexOutOfBoundsException

2016-10-30 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613656#comment-15613656 ] Emlyn Corrin edited comment on SPARK-16648 at 10/30/16 10:07 AM: - Edit:

[jira] [Created] (SPARK-18172) AnalysisException in first/last during aggregation

2016-10-30 Thread Emlyn Corrin (JIRA)
Emlyn Corrin created SPARK-18172: Summary: AnalysisException in first/last during aggregation Key: SPARK-18172 URL: https://issues.apache.org/jira/browse/SPARK-18172 Project: Spark Issue

[jira] [Comment Edited] (SPARK-16648) LAST_VALUE(FALSE) OVER () throws IndexOutOfBoundsException

2016-10-27 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613656#comment-15613656 ] Emlyn Corrin edited comment on SPARK-16648 at 10/27/16 11:33 PM: - Since

[jira] [Commented] (SPARK-16648) LAST_VALUE(FALSE) OVER () throws IndexOutOfBoundsException

2016-10-27 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-16648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15613656#comment-15613656 ] Emlyn Corrin commented on SPARK-16648: -- Since Spark 2.0.1, the following snippet fails (I believe it

[jira] [Created] (SPARK-15846) Allow passing a PrintStream to DataFrame.explain

2016-06-09 Thread Emlyn Corrin (JIRA)
Emlyn Corrin created SPARK-15846: Summary: Allow passing a PrintStream to DataFrame.explain Key: SPARK-15846 URL: https://issues.apache.org/jira/browse/SPARK-15846 Project: Spark Issue Type:

[jira] [Commented] (SPARK-8480) Add setName for Dataframe

2016-02-19 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-8480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15153988#comment-15153988 ] Emlyn Corrin commented on SPARK-8480: - This would be really useful. We have a fairly large Spark

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-02-16 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15148472#comment-15148472 ] Emlyn Corrin commented on SPARK-9740: - Thanks, the {{registerTempTable}} + {{sql}} workaround is fine

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-02-16 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15148458#comment-15148458 ] Emlyn Corrin commented on SPARK-9740: - Any update on this? Should I open a new issue for it so it

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-26 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116977#comment-15116977 ] Emlyn Corrin commented on SPARK-9740: - I've put together a minimal example to demonstrate the problem:

[jira] [Comment Edited] (SPARK-9740) first/last aggregate NULL behavior

2016-01-26 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116977#comment-15116977 ] Emlyn Corrin edited comment on SPARK-9740 at 1/26/16 9:32 AM: -- I've put

[jira] [Comment Edited] (SPARK-9740) first/last aggregate NULL behavior

2016-01-26 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116977#comment-15116977 ] Emlyn Corrin edited comment on SPARK-9740 at 1/26/16 9:33 AM: -- I've put

[jira] [Comment Edited] (SPARK-9740) first/last aggregate NULL behavior

2016-01-26 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116977#comment-15116977 ] Emlyn Corrin edited comment on SPARK-9740 at 1/26/16 9:40 AM: -- I've put

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115904#comment-15115904 ] Emlyn Corrin commented on SPARK-9740: - Thanks for the help. I've tried with {{callUDF}} and that gives

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116297#comment-15116297 ] Emlyn Corrin commented on SPARK-9740: - Here's the stack trace, it fails during compilation: {code}

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15116329#comment-15116329 ] Emlyn Corrin commented on SPARK-9740: - Version 1.6.0 > first/last aggregate NULL behavior >

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15114935#comment-15114935 ] Emlyn Corrin commented on SPARK-9740: - Thanks [~yhuai], that's just what I was looking for! >

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-25 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15115115#comment-15115115 ] Emlyn Corrin commented on SPARK-9740: - [~yhuai] actually, that doesn't seem to work, when I try it, I

[jira] [Commented] (SPARK-9740) first/last aggregate NULL behavior

2016-01-21 Thread Emlyn Corrin (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-9740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15110688#comment-15110688 ] Emlyn Corrin commented on SPARK-9740: - How do you use FIRST/LAST from the Java API with ignoreNulls