[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939867#comment-16939867 ] Yuming Wang commented on SPARK-29274: - I don't know how Vertica works. But it returns incorrect

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939865#comment-16939865 ] Yuming Wang commented on SPARK-29274: - Oracle will cast varchar type to number type: {noformat}

[jira] [Updated] (SPARK-29280) DataFrameReader should support a compression option

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicholas Chammas updated SPARK-29280: - Description:

[jira] [Comment Edited] (SPARK-29102) Read gzipped file into multiple partitions without full gzip expansion on a single-node

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939856#comment-16939856 ] Nicholas Chammas edited comment on SPARK-29102 at 9/28/19 5:35 AM: --- I

[jira] [Commented] (SPARK-29280) DataFrameReader should support a compression option

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939864#comment-16939864 ] Nicholas Chammas commented on SPARK-29280: -- cc [~hyukjin.kwon], [~cloud_fan] > DataFrameReader

[jira] [Created] (SPARK-29280) DataFrameReader should support a compression option

2019-09-27 Thread Nicholas Chammas (Jira)
Nicholas Chammas created SPARK-29280: Summary: DataFrameReader should support a compression option Key: SPARK-29280 URL: https://issues.apache.org/jira/browse/SPARK-29280 Project: Spark

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939862#comment-16939862 ] Yuming Wang commented on SPARK-29274: - SQL server will cast string type to decimal type: {noformat}

[jira] [Comment Edited] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939862#comment-16939862 ] Yuming Wang edited comment on SPARK-29274 at 9/28/19 5:26 AM: -- SQL Server

[jira] [Commented] (SPARK-29102) Read gzipped file into multiple partitions without full gzip expansion on a single-node

2019-09-27 Thread Nicholas Chammas (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939856#comment-16939856 ] Nicholas Chammas commented on SPARK-29102: -- I figured it out. Looks like the correct setting is

[jira] [Created] (SPARK-29279) DataSourceV2: merge SHOW NAMESPACES and SHOW DATABASES code path

2019-09-27 Thread Terry Kim (Jira)
Terry Kim created SPARK-29279: - Summary: DataSourceV2: merge SHOW NAMESPACES and SHOW DATABASES code path Key: SPARK-29279 URL: https://issues.apache.org/jira/browse/SPARK-29279 Project: Spark

[jira] [Comment Edited] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939827#comment-16939827 ] angerszhu edited comment on SPARK-29273 at 9/28/19 3:07 AM: [~UncleHuang]

[jira] [Comment Edited] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939827#comment-16939827 ] angerszhu edited comment on SPARK-29273 at 9/28/19 3:06 AM: [~UncleHuang]

[jira] [Commented] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939827#comment-16939827 ] angerszhu commented on SPARK-29273: --- [~UncleHuang] {code} /** * Peak memory used by internal

[jira] [Commented] (SPARK-29186) SubqueryAlias name value is null in Spark 2.4.3 Logical plan.

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29186?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939825#comment-16939825 ] angerszhu commented on SPARK-29186: --- In spark-2.4 & and master branch.  RUN SQL  {code:java} create

[jira] [Updated] (SPARK-29278) Implement CATALOG/NAMESPACE related SQL commands for Data Source V2

2019-09-27 Thread Terry Kim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Terry Kim updated SPARK-29278: -- Summary: Implement CATALOG/NAMESPACE related SQL commands for Data Source V2 (was: Implement

[jira] [Created] (SPARK-29278) Implement NAMESPACE related SQL commands for Data Source V2

2019-09-27 Thread Terry Kim (Jira)
Terry Kim created SPARK-29278: - Summary: Implement NAMESPACE related SQL commands for Data Source V2 Key: SPARK-29278 URL: https://issues.apache.org/jira/browse/SPARK-29278 Project: Spark Issue

[jira] [Resolved] (SPARK-29221) Flaky test: SQLQueryTestSuite.sql (subquery/scalar-subquery/scalar-subquery-select.sql)

2019-09-27 Thread Jungtaek Lim (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jungtaek Lim resolved SPARK-29221. -- Fix Version/s: 3.0.0 Resolution: Fixed This is resolved via 

[jira] [Created] (SPARK-29277) DataSourceV2: Add early filter and projection pushdown

2019-09-27 Thread Ryan Blue (Jira)
Ryan Blue created SPARK-29277: - Summary: DataSourceV2: Add early filter and projection pushdown Key: SPARK-29277 URL: https://issues.apache.org/jira/browse/SPARK-29277 Project: Spark Issue Type:

[jira] [Commented] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread Jim Kleckner (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939787#comment-16939787 ] Jim Kleckner commented on SPARK-29055: -- [~Geopap] How quickly does the memory grow? > Memory leak

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Description: Hi,   I had this problem in "real" environments and also made a

[jira] [Closed] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz closed SPARK-29265. > Window+collect_list causing single-task operation >

[jira] [Resolved] (SPARK-27254) Cleanup complete but becoming invalid output files in ManifestFileCommitProtocol if job is aborted

2019-09-27 Thread Shixiong Zhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shixiong Zhu resolved SPARK-27254. -- Fix Version/s: 3.0.0 Assignee: Jungtaek Lim Resolution: Fixed > Cleanup

[jira] [Commented] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-09-27 Thread Jiaqi Guo (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939712#comment-16939712 ] Jiaqi Guo commented on SPARK-29232: --- [~aman_omer], here is [an example from the Spark

[jira] [Comment Edited] (SPARK-26431) Update availableSlots by availableCpus for barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-26431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939667#comment-16939667 ] Xingbo Jiang edited comment on SPARK-26431 at 9/27/19 6:39 PM: --- Resolved

[jira] [Comment Edited] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939670#comment-16939670 ] Xingbo Jiang edited comment on SPARK-29263 at 9/27/19 6:38 PM: --- Resolved

[jira] [Updated] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang updated SPARK-29263: - Target Version/s: (was: 2.4.0) > availableSlots in scheduler can change before being checked

[jira] [Updated] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang updated SPARK-29263: - Affects Version/s: (was: 3.0.0) 2.4.0 > availableSlots in scheduler

[jira] [Resolved] (SPARK-29263) availableSlots in scheduler can change before being checked by barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang resolved SPARK-29263. -- Fix Version/s: 3.0.0 Target Version/s: 2.4.0 Assignee: Juliusz Sompolski

[jira] [Resolved] (SPARK-26431) Update availableSlots by availableCpus for barrier taskset

2019-09-27 Thread Xingbo Jiang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-26431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xingbo Jiang resolved SPARK-26431. -- Fix Version/s: 3.0.0 Assignee: Juliusz Sompolski Resolution: Fixed Resolved

[jira] [Resolved] (SPARK-29070) Make SparkLauncher log full spark-submit command line

2019-09-27 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29070?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-29070. Fix Version/s: 3.0.0 Assignee: Jeff Evans Resolution:

[jira] [Resolved] (SPARK-29240) PySpark 2.4 about sql function 'element_at' param 'extraction'

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-29240. --- Fix Version/s: 3.0.0 2.4.5 Assignee: Hyukjin Kwon

[jira] [Updated] (SPARK-29240) PySpark 2.4 about sql function 'element_at' param 'extraction'

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29240: -- Labels: (was: beginner easyfix newbie starter) > PySpark 2.4 about sql function

[jira] [Updated] (SPARK-29240) PySpark 2.4 about sql function 'element_at' param 'extraction'

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29240: -- Issue Type: Bug (was: Improvement) > PySpark 2.4 about sql function 'element_at' param

[jira] [Commented] (SPARK-29245) CCE during creating HiveMetaStoreClient

2019-09-27 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939583#comment-16939583 ] Dongjoon Hyun commented on SPARK-29245: --- Like we missed this last time, there might be more code

[jira] [Resolved] (SPARK-29015) Can't use 'add jar' jar's class as create table serde class on JDK 11

2019-09-27 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-29015. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25775

[jira] [Assigned] (SPARK-29015) Can't use 'add jar' jar's class as create table serde class on JDK 11

2019-09-27 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-29015: Assignee: angerszhu > Can't use 'add jar' jar's class as create table serde class on JDK

[jira] [Created] (SPARK-29276) Spark job fails because of timeout to Driver

2019-09-27 Thread Jochen Hebbrecht (Jira)
Jochen Hebbrecht created SPARK-29276: Summary: Spark job fails because of timeout to Driver Key: SPARK-29276 URL: https://issues.apache.org/jira/browse/SPARK-29276 Project: Spark Issue

[jira] [Commented] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread huangweiyi (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939466#comment-16939466 ] huangweiyi commented on SPARK-29273: i do the same thing like shs for replaying the spark-event-log, 

[jira] [Commented] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939423#comment-16939423 ] Florentino Sainz commented on SPARK-29265: -- Ok just realized what's happening, we did have one

[jira] [Issue Comment Deleted] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Comment: was deleted (was: Ok, after quite a bit of research, I had to change my Window

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939414#comment-16939414 ] Yuming Wang commented on SPARK-29274: - PostgreSQL need to add explicit type casts: {code:sql}

[jira] [Commented] (SPARK-29267) rdd.countApprox should stop when 'timeout'

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29267?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939393#comment-16939393 ] Hyukjin Kwon commented on SPARK-29267: -- Do you mean countApprox should fail by its timeout?

[jira] [Created] (SPARK-29275) Update the SQL migration guide regarding special date/timestamp values

2019-09-27 Thread Maxim Gekk (Jira)
Maxim Gekk created SPARK-29275: -- Summary: Update the SQL migration guide regarding special date/timestamp values Key: SPARK-29275 URL: https://issues.apache.org/jira/browse/SPARK-29275 Project: Spark

[jira] [Assigned] (SPARK-21914) Running examples as tests in SQL builtin function documentation

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-21914: Assignee: Maxim Gekk > Running examples as tests in SQL builtin function documentation >

[jira] [Resolved] (SPARK-21914) Running examples as tests in SQL builtin function documentation

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-21914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-21914. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25942

[jira] [Assigned] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-29274: --- Assignee: Pengfei Chang (was: Yuming Wang) > Can not coerce decimal type to double type

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Description: How to reproduce this issue: {code:sql} create table t1 (incdata_id decimal(21,0), v

[jira] [Assigned] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang reassigned SPARK-29274: --- Assignee: Yuming Wang > Can not coerce decimal type to double type when it's join key >

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Description: How to reproduce this issue: {code:sql} create table t1 (incdata_id decimal(21,0), v

[jira] [Commented] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939384#comment-16939384 ] Yuming Wang commented on SPARK-29274: - I'll asgin this ticket to [~pfchang]  who found this issue.

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Description: How to reproduce this issue: {code:sql} create table t1 (incdata_id decimal(21,0), v

[jira] [Updated] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29274: Attachment: image-2019-09-27-20-20-24-238.png > Can not coerce decimal type to double type when

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Summary: Window+collect_list causing single-task operation (was: Window orderBy

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Description: Hi,   I had this problem in "real" environments and also made a

[jira] [Updated] (SPARK-29265) Window+collect_list causing single-task operation

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florentino Sainz updated SPARK-29265: - Description: Hi,   I had this problem in "real" environments and also made a

[jira] [Commented] (SPARK-29265) Window orderBy causing full-DF orderBy

2019-09-27 Thread Florentino Sainz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29265?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939381#comment-16939381 ] Florentino Sainz commented on SPARK-29265: -- Ok, after quite a bit of research, I had to change

[jira] [Created] (SPARK-29274) Can not coerce decimal type to double type when it's join key

2019-09-27 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-29274: --- Summary: Can not coerce decimal type to double type when it's join key Key: SPARK-29274 URL: https://issues.apache.org/jira/browse/SPARK-29274 Project: Spark

[jira] [Updated] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon updated SPARK-29232: - Priority: Major (was: Critical) > RandomForestRegressionModel does not update the parameter

[jira] [Resolved] (SPARK-29238) Add newColumn using withColumn to an empty Dataframe

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29238. -- Resolution: Not A Problem > Add newColumn using withColumn to an empty Dataframe >

[jira] [Commented] (SPARK-29238) Add newColumn using withColumn to an empty Dataframe

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939379#comment-16939379 ] Hyukjin Kwon commented on SPARK-29238: -- Since there's no record, there would be anything will be

[jira] [Commented] (SPARK-29262) DataFrameWriter insertIntoPartition function

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939371#comment-16939371 ] Hyukjin Kwon commented on SPARK-29262: -- [~hzfeiwang] please be clear about what this JIRA means.

[jira] [Resolved] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29272. -- Resolution: Invalid > dataframe.write.format("libsvm").save() take too much time >

[jira] [Commented] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939367#comment-16939367 ] Hyukjin Kwon commented on SPARK-29272: -- Questions should go to mailing list or stackoverflow. You

[jira] [Commented] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939365#comment-16939365 ] Hyukjin Kwon commented on SPARK-29273: -- Can you show reproducer with the current / expected & input

[jira] [Created] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-09-27 Thread huangweiyi (Jira)
huangweiyi created SPARK-29273: -- Summary: Spark peakExecutionMemory metrics is zero Key: SPARK-29273 URL: https://issues.apache.org/jira/browse/SPARK-29273 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-29242) Check results of expression examples automatically

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29242. -- Resolution: Duplicate > Check results of expression examples automatically >

[jira] [Resolved] (SPARK-29253) Add agg(Spark, Spark*) to SQL Dataset

2019-09-27 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-29253. -- Resolution: Won't Fix > Add agg(Spark, Spark*) to SQL Dataset >

[jira] [Updated] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread George Papa (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] George Papa updated SPARK-29055: Description: I used Spark 2.1.1 and I upgraded into new versions. After Spark version 2.3.3,  I

[jira] [Commented] (SPARK-29232) RandomForestRegressionModel does not update the parameter maps of the DecisionTreeRegressionModels underneath

2019-09-27 Thread Aman Omer (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29232?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939293#comment-16939293 ] Aman Omer commented on SPARK-29232: --- I used some examples for RF regression but can't use

[jira] [Updated] (SPARK-29152) Spark Executor Plugin API shutdown is not proper when dynamic allocation enabled[SPARK-24918]

2019-09-27 Thread Rakesh Raushan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rakesh Raushan updated SPARK-29152: --- Description: *Issue Description* Spark Executor Plugin API *shutdown handling is not

[jira] [Updated] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread George Papa (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] George Papa updated SPARK-29055: Affects Version/s: (was: 2.4.4) (was: 2.4.3) > Memory leak in

[jira] [Updated] (SPARK-29055) Memory leak in Spark

2019-09-27 Thread George Papa (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29055?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] George Papa updated SPARK-29055: Summary: Memory leak in Spark (was: Memory leak in Spark Driver) > Memory leak in Spark >

[jira] [Updated] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 张焕明 updated SPARK-29272: Description: I have a pyspark dataframe with about 10 thousand records,while using pyspark api to dump the whole

[jira] [Updated] (SPARK-29272) dataframe.write.format("libsvm").save() take too much time

2019-09-27 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-29272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] 张焕明 updated SPARK-29272: Summary: dataframe.write.format("libsvm").save() take too much time (was:

[jira] [Created] (SPARK-29272) dataframe.write.format("libsvm").save() 保存时间太长

2019-09-27 Thread Jira
张焕明 created SPARK-29272: --- Summary: dataframe.write.format("libsvm").save() 保存时间太长 Key: SPARK-29272 URL: https://issues.apache.org/jira/browse/SPARK-29272 Project: Spark Issue Type: Question

[jira] [Assigned] (SPARK-29213) Make it consistent when get notnull output and generate null checks in FilterExec

2019-09-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-29213: --- Assignee: Wang Shuo > Make it consistent when get notnull output and generate null checks

[jira] [Resolved] (SPARK-29213) Make it consistent when get notnull output and generate null checks in FilterExec

2019-09-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-29213. - Fix Version/s: 3.0.0 2.4.5 Resolution: Fixed Issue resolved by pull

[jira] [Resolved] (SPARK-29270) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu resolved SPARK-29270. --- Resolution: Duplicate > Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then >

[jira] [Resolved] (SPARK-29257) All Task attempts scheduled to the same executor inevitably access the same bad disk

2019-09-27 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-29257. - Resolution: Not A Problem > All Task attempts scheduled to the same executor inevitably access

[jira] [Updated] (SPARK-29271) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-29271: -- Description: Reproduce in master, run this UT

[jira] [Updated] (SPARK-29271) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-29271: -- Description: Reproduce in master, run this UT !WeChat33ff9dee6d78fe7be280d5f3b974e6ac.png!

[jira] [Updated] (SPARK-29271) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-29271: -- Description: Reproduce in master, run this UT (was: Reproduce in master, run this UT {code:java}

[jira] [Updated] (SPARK-29271) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-29271: -- Attachment: WeChat33ff9dee6d78fe7be280d5f3b974e6ac.png > Run HiveSparkSubmitSuite with hive jars use

[jira] [Updated] (SPARK-29271) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] angerszhu updated SPARK-29271: -- Description: Reproduce in master, run this UT {code:java} {code} was: Reproduce in master, run

[jira] [Created] (SPARK-29271) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
angerszhu created SPARK-29271: - Summary: Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout Key: SPARK-29271 URL: https://issues.apache.org/jira/browse/SPARK-29271 Project:

[jira] [Created] (SPARK-29270) Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout

2019-09-27 Thread angerszhu (Jira)
angerszhu created SPARK-29270: - Summary: Run HiveSparkSubmitSuite with hive jars use maven 3.1.2 won't stop then timeout Key: SPARK-29270 URL: https://issues.apache.org/jira/browse/SPARK-29270 Project:

[jira] [Updated] (SPARK-29203) Reduce shuffle partitions in SQLQueryTestSuite

2019-09-27 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-29203: Parent: SPARK-25604 Issue Type: Sub-task (was: Improvement) > Reduce shuffle partitions

[jira] [Commented] (SPARK-29195) Can't config orc.compress.size option for native ORC writer

2019-09-27 Thread Eric Sun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29195?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16939155#comment-16939155 ] Eric Sun commented on SPARK-29195: -- It is very likely in Spark -