[jira] [Commented] (SPARK-27039) toPandas with Arrow swallows maxResultSize errors

2020-04-23 Thread peay (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17090352#comment-17090352 ] peay commented on SPARK-27039: -- [~hyukjin.kwon] do you know if this was eventually back ported in 2.4.x?

[jira] [Commented] (SPARK-27039) toPandas with Arrow swallows maxResultSize errors

2019-03-07 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16786658#comment-16786658 ] peay commented on SPARK-27039: -- For reference, I've realized you can also get an incomplete but non-empty

[jira] [Commented] (SPARK-24624) Can not mix vectorized and non-vectorized UDFs

2019-03-06 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16786456#comment-16786456 ] peay commented on SPARK-24624: -- I mean regular aggregation functions and Pandas UDF aggregation functions

[jira] [Commented] (SPARK-24624) Can not mix vectorized and non-vectorized UDFs

2019-03-06 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16785920#comment-16785920 ] peay commented on SPARK-24624: -- Are there plans to support something similar for aggregation functions? >

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-05 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16784260#comment-16784260 ] peay commented on SPARK-27019: -- Yes, I had edited my message above shortly after posting - cannot reproduce 

[jira] [Comment Edited] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-04 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16784172#comment-16784172 ] peay edited comment on SPARK-27019 at 3/5/19 7:28 AM: -- Great! -Is that compatible

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-04 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16784172#comment-16784172 ] peay commented on SPARK-27019: -- Great! Is that compatible with my second observation above? (I tested

[jira] [Commented] (SPARK-27039) toPandas with Arrow swallows maxResultSize errors

2019-03-04 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16783711#comment-16783711 ] peay commented on SPARK-27039: -- Interesting, thanks for checking. Yes, I can definitely live without that

[jira] [Commented] (SPARK-27039) toPandas with Arrow swallows maxResultSize errors

2019-03-04 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27039?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16783352#comment-16783352 ] peay commented on SPARK-27039: -- Oops, sorry, I've edited the title. I meant _Arrow_, not Avro. Maybe

[jira] [Updated] (SPARK-27039) toPandas with Arrow swallows maxResultSize errors

2019-03-04 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27039?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-27039: - Summary: toPandas with Arrow swallows maxResultSize errors (was: toPandas with Avro swallows maxResultSize

[jira] [Created] (SPARK-27039) toPandas with Avro swallows maxResultSize errors

2019-03-04 Thread peay (JIRA)
peay created SPARK-27039: Summary: toPandas with Avro swallows maxResultSize errors Key: SPARK-27039 URL: https://issues.apache.org/jira/browse/SPARK-27039 Project: Spark Issue Type: Bug

[jira] [Updated] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-27019: - Attachment: application_1550040445209_4748 > Spark UI's SQL tab shows inconsistent values >

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781718#comment-16781718 ] peay commented on SPARK-27019: -- Attached: [^application_1550040445209_4748] > Spark UI's SQL tab shows

[jira] [Comment Edited] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781651#comment-16781651 ] peay edited comment on SPARK-27019 at 3/1/19 1:06 PM: -- Sure does. I've done one

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781651#comment-16781651 ] peay commented on SPARK-27019: -- Sure does. I've done one more test, using no executors and running the

[jira] [Updated] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-27019: - Attachment: query-job-1.png query-1-details.png query-1-list.png > Spark UI's

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781604#comment-16781604 ] peay commented on SPARK-27019: -- OK, I can actually reproduce it pretty easily with pyspark: {code:java}

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781595#comment-16781595 ] peay commented on SPARK-27019: -- I don't really have a minimal example - this uses a bunch of python jobs

[jira] [Comment Edited] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781595#comment-16781595 ] peay edited comment on SPARK-27019 at 3/1/19 11:40 AM: --- I don't really have a

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781583#comment-16781583 ] peay commented on SPARK-27019: -- Also seeing this both for the Spark UI for live jobs, and when accessing

[jira] [Updated] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-27019: - Description: Since 2.4.0, I am frequently seeing broken outputs in the SQL tab of the Spark UI, where

[jira] [Comment Edited] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781581#comment-16781581 ] peay edited comment on SPARK-27019 at 3/1/19 11:10 AM: --- Seems like the screenshots

[jira] [Updated] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-27019: - Attachment: screenshot-spark-ui-list.png screenshot-spark-ui-details.png > Spark UI's SQL tab

[jira] [Commented] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-27019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16781581#comment-16781581 ] peay commented on SPARK-27019: -- Seems like the screenshot did not embed, attaching them instead. > Spark

[jira] [Created] (SPARK-27019) Spark UI's SQL tab shows inconsistent values

2019-03-01 Thread peay (JIRA)
peay created SPARK-27019: Summary: Spark UI's SQL tab shows inconsistent values Key: SPARK-27019 URL: https://issues.apache.org/jira/browse/SPARK-27019 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-24523) InterruptedException when closing SparkContext

2018-10-10 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24523?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16644602#comment-16644602 ] peay commented on SPARK-24523: -- I've started hitting the exact same error since I upgraded to EMR 5.17.

[jira] [Updated] (SPARK-24523) InterruptedException when closing SparkContext

2018-10-10 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-24523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-24523: - Attachment: thread-dump.log > InterruptedException when closing SparkContext >

[jira] [Commented] (SPARK-10925) Exception when joining DataFrames

2017-09-20 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-10925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16173139#comment-16173139 ] peay commented on SPARK-10925: -- Same issue on Spark 2.1.0. I have been working around this using

[jira] [Created] (SPARK-21659) FileStreamSink checks for _spark_metadata even if path has globs

2017-08-07 Thread peay (JIRA)
peay created SPARK-21659: Summary: FileStreamSink checks for _spark_metadata even if path has globs Key: SPARK-21659 URL: https://issues.apache.org/jira/browse/SPARK-21659 Project: Spark Issue

[jira] [Closed] (SPARK-21550) approxQuantiles throws "next on empty iterator" on empty data

2017-07-27 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay closed SPARK-21550. Resolution: Duplicate Fix Version/s: 2.2.0 > approxQuantiles throws "next on empty iterator" on empty data

[jira] [Commented] (SPARK-21551) pyspark's collect fails when getaddrinfo is too slow

2017-07-27 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21551?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16103539#comment-16103539 ] peay commented on SPARK-21551: -- Sure, does 15 seconds sound good? > pyspark's collect fails when

[jira] [Created] (SPARK-21551) pyspark's collect fails when getaddrinfo is too slow

2017-07-27 Thread peay (JIRA)
peay created SPARK-21551: Summary: pyspark's collect fails when getaddrinfo is too slow Key: SPARK-21551 URL: https://issues.apache.org/jira/browse/SPARK-21551 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-21550) approxQuantiles throws "next on empty iterator" on empty data

2017-07-27 Thread peay (JIRA)
peay created SPARK-21550: Summary: approxQuantiles throws "next on empty iterator" on empty data Key: SPARK-21550 URL: https://issues.apache.org/jira/browse/SPARK-21550 Project: Spark Issue Type:

[jira] [Comment Edited] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15672245#comment-15672245 ] peay edited comment on SPARK-18473 at 11/17/16 12:55 AM: - Ok, I see, thanks. The

[jira] [Commented] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15672245#comment-15672245 ] peay commented on SPARK-18473: -- Ok, I see, thanks. The fix is in 2.0.3 though, not 2.0.2, correct? >

[jira] [Commented] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15671624#comment-15671624 ] peay commented on SPARK-18473: -- Ah, great, thanks. I had checked out the CHANGELOG but couldn't find

[jira] [Updated] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-18473: - Description: I have stumbled onto a corner case where an INNER join appears to return incorrect results. I

[jira] [Updated] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-18473: - Description: I have stumbled onto a corner case where an INNER join appears to return incorrect results. I

[jira] [Updated] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-18473: - Description: I have stumbled onto a corner case where an INNER join appears to return incorrect results. I

[jira] [Updated] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-18473: - Description: I have stumbled onto a corner case where an INNER join appears to return incorrect results. I

[jira] [Updated] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-18473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] peay updated SPARK-18473: - Description: I have stumbled onto a corner case where an INNER join appears to return incorrect results. I

[jira] [Created] (SPARK-18473) Correctness issue in INNER join result with window functions

2016-11-16 Thread peay (JIRA)
peay created SPARK-18473: Summary: Correctness issue in INNER join result with window functions Key: SPARK-18473 URL: https://issues.apache.org/jira/browse/SPARK-18473 Project: Spark Issue Type: