[jira] [Created] (SPARK-29281) Examples in Like/RLike doesn't consider the default value of spark.sql.parser.escapedStringLiterals

2019-09-27 Thread Jungtaek Lim (Jira)
Jungtaek Lim created SPARK-29281: Summary: Examples in Like/RLike doesn't consider the default value of spark.sql.parser.escapedStringLiterals Key: SPARK-29281 URL: https://issues.apache.org/jira/browse/SPARK-2928

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939879#comment-16939879 ] Yuming Wang commented on SPARK-29274: - Mysql also return incorrect values. {noformat

[jira] [Comment Edited] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939875#comment-16939875 ] Yuming Wang edited comment on SPARK-29274 at 9/28/19 6:22 AM:

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939875#comment-16939875 ] Yuming Wang commented on SPARK-29274: - I don't konw how DB2 work. But ir returns cor

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939867#comment-16939867 ] Yuming Wang commented on SPARK-29274: - I don't know how Vertica works. But it return

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939865#comment-16939865 ] Yuming Wang commented on SPARK-29274: - Oracle will cast varchar type to number type:

[jira] [Updated] (SPARK-29280) DataFrameReader should support a compression option

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-29280: - Description: [DataFrameWriter|http://spark.apache.org/docs/latest/api/python/pyspark.sql

[jira] [Comment Edited] (SPARK-29102) Read gzipped file into multiple partitions without full gzip expansion on a single-node

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939856#comment-16939856 ] Nicholas Chammas edited comment on SPARK-29102 at 9/28/19 5:35 AM: ---

[jira] [Commented] (SPARK-29280) DataFrameReader should support a compression option

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939864#comment-16939864 ] Nicholas Chammas commented on SPARK-29280: -- cc [~hyukjin.kwon], [~cloud_fan] >

[jira] [Created] (SPARK-29280) DataFrameReader should support a compression option

2019-09-27 Thread Nicholas Chammas (Jira)
Nicholas Chammas created SPARK-29280: Summary: DataFrameReader should support a compression option Key: SPARK-29280 URL: https://issues.apache.org/jira/browse/SPARK-29280 Project: Spark I

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939862#comment-16939862 ] Yuming Wang commented on SPARK-29274: - SQL server will cast string type to decimal t

[jira] [Comment Edited] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939862#comment-16939862 ] Yuming Wang edited comment on SPARK-29274 at 9/28/19 5:26 AM:

[jira] [Commented] (SPARK-29102) Read gzipped file into multiple partitions without full gzip expansion on a single-node

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939856#comment-16939856 ] Nicholas Chammas commented on SPARK-29102: -- I figured it out. Looks like the co

[jira] [Created] (SPARK-29279) DataSourceV2: merge SHOW NAMESPACES and SHOW DATABASES code path

2019-09-27 Thread Terry Kim (Jira)
Terry Kim created SPARK-29279: - Summary: DataSourceV2: merge SHOW NAMESPACES and SHOW DATABASES code path Key: SPARK-29279 URL: https://issues.apache.org/jira/browse/SPARK-29279 Project: Spark I

[jira] [Comment Edited] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939827#comment-16939827 ] angerszhu edited comment on SPARK-29273 at 9/28/19 3:07 AM:

[jira] [Comment Edited] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939827#comment-16939827 ] angerszhu edited comment on SPARK-29273 at 9/28/19 3:06 AM:

[jira] [Commented] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939827#comment-16939827 ] angerszhu commented on SPARK-29273: --- [~UncleHuang] {code} /** * Peak memory used

[jira] [Commented] (SPARK-29186) SubqueryAlias name value is null in Spark 2.4.3 Logical plan.

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939825#comment-16939825 ] angerszhu commented on SPARK-29186: --- In spark-2.4 & and master branch.  RUN SQL  {cod

[jira] [Updated] (SPARK-29278) Implement CATALOG/NAMESPACE related SQL commands for Data Source V2

2019-09-27 Thread Terry Kim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Terry Kim updated SPARK-29278: -- Summary: Implement CATALOG/NAMESPACE related SQL commands for Data Source V2 (was: Implement NAMESPAC

[jira] [Created] (SPARK-29278) Implement NAMESPACE related SQL commands for Data Source V2

2019-09-27 Thread Terry Kim (Jira)
Terry Kim created SPARK-29278: - Summary: Implement NAMESPACE related SQL commands for Data Source V2 Key: SPARK-29278 URL: https://issues.apache.org/jira/browse/SPARK-29278 Project: Spark Issue

[jira] [Resolved] (SPARK-29221) Flaky test: SQLQueryTestSuite.sql (subquery/scalar-subquery/scalar-subquery-select.sql)

2019-09-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-29221. -- Fix Version/s: 3.0.0 Resolution: Fixed This is resolved via [https://github.com/apache/

[jira] [Created] (SPARK-29277) DataSourceV2: Add early filter and projection pushdown

2019-09-27 Thread Ryan Blue (Jira)
Ryan Blue created SPARK-29277: - Summary: DataSourceV2: Add early filter and projection pushdown Key: SPARK-29277 URL: https://issues.apache.org/jira/browse/SPARK-29277 Project: Spark Issue Type:

[jira] [Commented] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread Jim Kleckner (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939787#comment-16939787 ] Jim Kleckner commented on SPARK-29055: -- [~Geopap] How quickly does the memory grow?

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Description: Hi,   I had this problem in "real" environments and also made a self-cont

[jira] [Closed] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz closed SPARK-29265. > Window+collect_list causing single-task operation >

[jira] [Resolved] (SPARK-27254) Cleanup complete but becoming invalid output files in ManifestFileCommitProtocol if job is aborted

2019-09-27 Thread Shixiong Zhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-27254. -- Fix Version/s: 3.0.0 Assignee: Jungtaek Lim Resolution: Fixed > Cleanup comple

[jira] [Commented] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-09-27 Thread Jiaqi Guo (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939712#comment-16939712 ] Jiaqi Guo commented on SPARK-29232: --- [~aman_omer], here is [an example from the Spark

[jira] [Comment Edited] (SPARK-26431) Update availableSlots by availableCpus for barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-26431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939667#comment-16939667 ] Xingbo Jiang edited comment on SPARK-26431 at 9/27/19 6:39 PM: ---

[jira] [Comment Edited] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939670#comment-16939670 ] Xingbo Jiang edited comment on SPARK-29263 at 9/27/19 6:38 PM: ---

[jira] [Updated] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang updated SPARK-29263: - Target Version/s: (was: 2.4.0) > availableSlots in scheduler can change before being checked b

[jira] [Updated] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang updated SPARK-29263: - Affects Version/s: (was: 3.0.0) 2.4.0 > availableSlots in scheduler c

[jira] [Resolved] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang resolved SPARK-29263. -- Fix Version/s: 3.0.0 Target Version/s: 2.4.0 Assignee: Juliusz Sompolski

[jira] [Resolved] (SPARK-26431) Update availableSlots by availableCpus for barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-26431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang resolved SPARK-26431. -- Fix Version/s: 3.0.0 Assignee: Juliusz Sompolski Resolution: Fixed Resolved by

[jira] [Resolved] (SPARK-29070) Make SparkLauncher log full spark-submit command line

2019-09-27 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-29070. Fix Version/s: 3.0.0 Assignee: Jeff Evans Resolution: Fixed

[jira] [Resolved] (SPARK-29240) PySpark 2.4 about sql function 'element_at' param 'extraction'

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-29240. --- Fix Version/s: 3.0.0 2.4.5 Assignee: Hyukjin Kwon Resoluti

[jira] [Updated] (SPARK-29240) PySpark 2.4 about sql function 'element_at' param 'extraction'

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29240: -- Labels: (was: beginner easyfix newbie starter) > PySpark 2.4 about sql function 'element_at'

[jira] [Updated] (SPARK-29240) PySpark 2.4 about sql function 'element_at' param 'extraction'

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29240: -- Issue Type: Bug (was: Improvement) > PySpark 2.4 about sql function 'element_at' param 'extra

[jira] [Commented] (SPARK-29245) CCE during creating HiveMetaStoreClient

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939583#comment-16939583 ] Dongjoon Hyun commented on SPARK-29245: --- Like we missed this last time, there migh

[jira] [Resolved] (SPARK-29015) Can't use 'add jar' jar's class as create table serde class on JDK 11

2019-09-27 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-29015. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25775 [https://gi

[jira] [Assigned] (SPARK-29015) Can't use 'add jar' jar's class as create table serde class on JDK 11

2019-09-27 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-29015: Assignee: angerszhu > Can't use 'add jar' jar's class as create table serde class on JDK

[jira] [Created] (SPARK-29276) Spark job fails because of timeout to Driver

2019-09-27 Thread Jochen Hebbrecht (Jira)
Jochen Hebbrecht created SPARK-29276: Summary: Spark job fails because of timeout to Driver Key: SPARK-29276 URL: https://issues.apache.org/jira/browse/SPARK-29276 Project: Spark Issue Ty

[jira] [Commented] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread huangweiyi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939466#comment-16939466 ] huangweiyi commented on SPARK-29273: i do the same thing like shs for replaying the

[jira] [Commented] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939423#comment-16939423 ] Florentino Sainz commented on SPARK-29265: -- Ok just realized what's happening,

[jira] [Issue Comment Deleted] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Comment: was deleted (was: Ok, after quite a bit of research, I had to change my Window

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939414#comment-16939414 ] Yuming Wang commented on SPARK-29274: - PostgreSQL need to add explicit type casts: {

[jira] [Commented] (SPARK-29267) rdd.countApprox should stop when 'timeout'

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939393#comment-16939393 ] Hyukjin Kwon commented on SPARK-29267: -- Do you mean countApprox should fail by its

[jira] [Created] (SPARK-29275) Update the SQL migration guide regarding special date/timestamp values

2019-09-27 Thread Maxim Gekk (Jira)
Maxim Gekk created SPARK-29275: -- Summary: Update the SQL migration guide regarding special date/timestamp values Key: SPARK-29275 URL: https://issues.apache.org/jira/browse/SPARK-29275 Project: Spark

[jira] [Assigned] (SPARK-21914) Running examples as tests in SQL builtin function documentation

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-21914: Assignee: Maxim Gekk > Running examples as tests in SQL builtin function documentation >

[jira] [Resolved] (SPARK-21914) Running examples as tests in SQL builtin function documentation

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21914. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25942 [https://gi

[jira] [Assigned] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-29274: --- Assignee: Pengfei Chang (was: Yuming Wang) > Can not coerce decimal type to double type wh

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Description: How to reproduce this issue: {code:sql} create table t1 (incdata_id decimal(21,0), v

[jira] [Assigned] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-29274: --- Assignee: Yuming Wang > Can not coerce decimal type to double type when it's join key > ---

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Description: How to reproduce this issue: {code:sql} create table t1 (incdata_id decimal(21,0), v

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939384#comment-16939384 ] Yuming Wang commented on SPARK-29274: - I'll asgin this ticket to [~pfchang]  who fou

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Description: How to reproduce this issue: {code:sql} create table t1 (incdata_id decimal(21,0), v

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Attachment: image-2019-09-27-20-20-24-238.png > Can not coerce decimal type to double type when it

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Summary: Window+collect_list causing single-task operation (was: Window orderBy causing

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Description: Hi,   I had this problem in "real" environments and also made a self-cont

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Description: Hi,   I had this problem in "real" environments and also made a self-cont

[jira] [Commented] (SPARK-29265) Window orderBy causing full-DF orderBy

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939381#comment-16939381 ] Florentino Sainz commented on SPARK-29265: -- Ok, after quite a bit of research,

[jira] [Created] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-29274: --- Summary: Can not coerce decimal type to double type when it's join key Key: SPARK-29274 URL: https://issues.apache.org/jira/browse/SPARK-29274 Project: Spark

[jira] [Updated] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-29232: - Priority: Major (was: Critical) > RandomForestRegressionModel does not update the parameter map

[jira] [Resolved] (SPARK-29238) Add newColumn using withColumn to an empty Dataframe

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29238. -- Resolution: Not A Problem > Add newColumn using withColumn to an empty Dataframe > ---

[jira] [Commented] (SPARK-29238) Add newColumn using withColumn to an empty Dataframe

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939379#comment-16939379 ] Hyukjin Kwon commented on SPARK-29238: -- Since there's no record, there would be any

[jira] [Commented] (SPARK-29262) DataFrameWriter insertIntoPartition function

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939371#comment-16939371 ] Hyukjin Kwon commented on SPARK-29262: -- [~hzfeiwang] please be clear about what thi

[jira] [Resolved] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29272. -- Resolution: Invalid > dataframe.write.format("libsvm").save() take too much time > ---

[jira] [Commented] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939367#comment-16939367 ] Hyukjin Kwon commented on SPARK-29272: -- Questions should go to mailing list or stac

[jira] [Commented] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939365#comment-16939365 ] Hyukjin Kwon commented on SPARK-29273: -- Can you show reproducer with the current /

[jira] [Created] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread huangweiyi (Jira)
huangweiyi created SPARK-29273: -- Summary: Spark peakExecutionMemory metrics is zero Key: SPARK-29273 URL: https://issues.apache.org/jira/browse/SPARK-29273 Project: Spark Issue Type: Improvement

[jira] [Resolved] (SPARK-29242) Check results of expression examples automatically

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29242. -- Resolution: Duplicate > Check results of expression examples automatically > -

[jira] [Resolved] (SPARK-29253) Add agg(Spark, Spark*) to SQL Dataset

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29253. -- Resolution: Won't Fix > Add agg(Spark, Spark*) to SQL Dataset > --

[jira] [Updated] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread George Papa (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] George Papa updated SPARK-29055: Description: I used Spark 2.1.1 and I upgraded into new versions. After Spark version 2.3.3,  I o

[jira] [Commented] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-09-27 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16939293#comment-16939293 ] Aman Omer commented on SPARK-29232: --- I used some examples for RF regression but can't

[jira] [Updated] (SPARK-29152) Spark Executor Plugin API shutdown is not proper when dynamic allocation enabled[SPARK-24918]

2019-09-27 Thread Rakesh Raushan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rakesh Raushan updated SPARK-29152: --- Description: *Issue Description* Spark Executor Plugin API *shutdown handling is not proper

[jira] [Updated] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread George Papa (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] George Papa updated SPARK-29055: Affects Version/s: (was: 2.4.4) (was: 2.4.3) > Memory leak in Spark

[jira] [Updated] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread George Papa (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] George Papa updated SPARK-29055: Summary: Memory leak in Spark (was: Memory leak in Spark Driver) > Memory leak in Spark > ---

[jira] [Updated] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 张焕明 updated SPARK-29272: Description: I have a pyspark dataframe with about 10 thousand records,while using pyspark api to dump the whole

[jira] [Updated] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 张焕明 updated SPARK-29272: Summary: dataframe.write.format("libsvm").save() take too much time (was: dataframe.write.format("libsvm").save()

[jira] [Created] (SPARK-29272) dataframe.write.format("libsvm").save() 保存时间太长

2019-09-27 Thread Jira
张焕明 created SPARK-29272: --- Summary: dataframe.write.format("libsvm").save() 保存时间太长 Key: SPARK-29272 URL: https://issues.apache.org/jira/browse/SPARK-29272 Project: Spark Issue Type: Question C

[jira] [Assigned] (SPARK-29213) Make it consistent when get notnull output and generate null checks in FilterExec

2019-09-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-29213: --- Assignee: Wang Shuo > Make it consistent when get notnull output and generate null checks i

[jira] [Resolved] (SPARK-29213) Make it consistent when get notnull output and generate null checks in FilterExec

2019-09-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-29213. - Fix Version/s: 3.0.0 2.4.5 Resolution: Fixed Issue resolved by pull re

[jira] [Resolved] (SPARK-29270) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu resolved SPARK-29270. --- Resolution: Duplicate > Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then > t

[jira] [Resolved] (SPARK-29257) All Task attempts scheduled to the same executor inevitably access the same bad disk

2019-09-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-29257. - Resolution: Not A Problem > All Task attempts scheduled to the same executor inevitably access t