[jira] [Created] (SPARK-29326) ANSI store assignment policy: throw exception on insertion failure

2019-10-02 Thread Gengliang Wang (Jira)
Gengliang Wang created SPARK-29326: -- Summary: ANSI store assignment policy: throw exception on insertion failure Key: SPARK-29326 URL: https://issues.apache.org/jira/browse/SPARK-29326 Project: Spark

[jira] [Created] (SPARK-29327) Support specifying features via multiple columns in Predictor and PredictionModel

2019-10-02 Thread Liangcai Li (Jira)
Liangcai Li created SPARK-29327: --- Summary: Support specifying features via multiple columns in Predictor and PredictionModel Key: SPARK-29327 URL: https://issues.apache.org/jira/browse/SPARK-29327 Proje

[jira] [Created] (SPARK-29328) Incorrect calculation mean seconds per month

2019-10-02 Thread Maxim Gekk (Jira)
Maxim Gekk created SPARK-29328: -- Summary: Incorrect calculation mean seconds per month Key: SPARK-29328 URL: https://issues.apache.org/jira/browse/SPARK-29328 Project: Spark Issue Type: Bug

[jira] [Created] (SPARK-29329) maven incremental builds not working

2019-10-02 Thread Thomas Graves (Jira)
Thomas Graves created SPARK-29329: - Summary: maven incremental builds not working Key: SPARK-29329 URL: https://issues.apache.org/jira/browse/SPARK-29329 Project: Spark Issue Type: Bug

[jira] [Commented] (SPARK-28759) Upgrade scala-maven-plugin to 4.2.0

2019-10-02 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16942787#comment-16942787 ] Thomas Graves commented on SPARK-28759: --- I rolled back this commit and the increme

[jira] [Updated] (SPARK-29329) maven incremental builds not working

2019-10-02 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Thomas Graves updated SPARK-29329: -- Description: It looks like since we Upgraded scala-maven-plugin to 4.2.0 https://issues.apach

[jira] [Commented] (SPARK-29329) maven incremental builds not working

2019-10-02 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16942790#comment-16942790 ] Thomas Graves commented on SPARK-29329: --- there are few comments on SPARK-28759 in

[jira] [Commented] (SPARK-29212) Add common classes without using JVM backend

2019-10-02 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16942809#comment-16942809 ] Maciej Szymkiewicz commented on SPARK-29212: [~podongfeng] It sounds about r

[jira] [Comment Edited] (SPARK-29212) Add common classes without using JVM backend

2019-10-02 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16942809#comment-16942809 ] Maciej Szymkiewicz edited comment on SPARK-29212 at 10/2/19 1:41 PM: -

[jira] [Assigned] (SPARK-28970) implement USE CATALOG/NAMESPACE for Data Source V2

2019-10-02 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-28970: --- Assignee: Terry Kim > implement USE CATALOG/NAMESPACE for Data Source V2 >

[jira] [Resolved] (SPARK-28970) implement USE CATALOG/NAMESPACE for Data Source V2

2019-10-02 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-28970. - Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25771 [https://gith

[jira] [Created] (SPARK-29330) Allow users to chose the name of Spark Shuffle service

2019-10-02 Thread Alexander Bessonov (Jira)
Alexander Bessonov created SPARK-29330: -- Summary: Allow users to chose the name of Spark Shuffle service Key: SPARK-29330 URL: https://issues.apache.org/jira/browse/SPARK-29330 Project: Spark

[jira] [Updated] (SPARK-29330) Allow users to chose the name of Spark Shuffle service

2019-10-02 Thread Alexander Bessonov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexander Bessonov updated SPARK-29330: --- Description: As of now, Spark uses hardcoded value {{spark_shuffle}} as the name of

[jira] [Created] (SPARK-29331) create DS v2 Write at physical plan

2019-10-02 Thread Wenchen Fan (Jira)
Wenchen Fan created SPARK-29331: --- Summary: create DS v2 Write at physical plan Key: SPARK-29331 URL: https://issues.apache.org/jira/browse/SPARK-29331 Project: Spark Issue Type: Improvement

[jira] [Commented] (SPARK-13802) Fields order in Row(**kwargs) is not consistent with Schema.toInternal method

2019-10-02 Thread Simeon H.K. Fitch (Jira)
[ https://issues.apache.org/jira/browse/SPARK-13802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16942906#comment-16942906 ] Simeon H.K. Fitch commented on SPARK-13802: --- Is there a workaround to this pro

[jira] [Commented] (SPARK-13802) Fields order in Row(**kwargs) is not consistent with Schema.toInternal method

2019-10-02 Thread Maciej Szymkiewicz (Jira)
[ https://issues.apache.org/jira/browse/SPARK-13802?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16942912#comment-16942912 ] Maciej Szymkiewicz commented on SPARK-13802: [~metasim] namedtuples are the

[jira] [Resolved] (SPARK-29273) Spark peakExecutionMemory metrics is zero

2019-10-02 Thread Marcelo Masiero Vanzin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Marcelo Masiero Vanzin resolved SPARK-29273. Fix Version/s: 3.0.0 Assignee: huangweiyi Resolution: Fixed

[jira] [Created] (SPARK-29332) Upgrade zstd-jni library to 1.4.3

2019-10-02 Thread Dongjoon Hyun (Jira)
Dongjoon Hyun created SPARK-29332: - Summary: Upgrade zstd-jni library to 1.4.3 Key: SPARK-29332 URL: https://issues.apache.org/jira/browse/SPARK-29332 Project: Spark Issue Type: Improvement

[jira] [Created] (SPARK-29333) Sample weight in RandomForestRegressor

2019-10-02 Thread Jiaqi Guo (Jira)
Jiaqi Guo created SPARK-29333: - Summary: Sample weight in RandomForestRegressor Key: SPARK-29333 URL: https://issues.apache.org/jira/browse/SPARK-29333 Project: Spark Issue Type: New Feature

[jira] [Created] (SPARK-29334) Supported vector operators in scala should have parity with pySpark

2019-10-02 Thread Patrick Pisciuneri (Jira)
Patrick Pisciuneri created SPARK-29334: -- Summary: Supported vector operators in scala should have parity with pySpark Key: SPARK-29334 URL: https://issues.apache.org/jira/browse/SPARK-29334 Proj

[jira] [Updated] (SPARK-29334) Supported vector operators in scala should have parity with pySpark

2019-10-02 Thread Patrick Pisciuneri (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Patrick Pisciuneri updated SPARK-29334: --- Description: pySpark supports various overloaded operators for the DenseVector type

[jira] [Updated] (SPARK-28917) Jobs can hang because of race of RDD.dependencies

2019-10-02 Thread Imran Rashid (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Imran Rashid updated SPARK-28917: - Description: {{RDD.dependencies}} stores the precomputed cache value, but it is not thread-safe

[jira] [Assigned] (SPARK-29332) Upgrade zstd-jni library to 1.4.3

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-29332: - Assignee: Dongjoon Hyun > Upgrade zstd-jni library to 1.4.3 > -

[jira] [Resolved] (SPARK-29332) Upgrade zstd-jni library to 1.4.3

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-29332. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 26002 [https://

[jira] [Commented] (SPARK-29078) Spark shell fails if read permission is not granted to hive warehouse directory

2019-10-02 Thread Peter Toth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943095#comment-16943095 ] Peter Toth commented on SPARK-29078: [~misutoth], if we look closer at the stacktrac

[jira] [Assigned] (SPARK-27297) Add higher order functions to Scala API

2019-10-02 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin reassigned SPARK-27297: - Assignee: Nikolas Vanderhoof > Add higher order functions to Scala API > --

[jira] [Resolved] (SPARK-27297) Add higher order functions to Scala API

2019-10-02 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-27297. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 24232 https://g

[jira] [Commented] (SPARK-29078) Spark shell fails if read permission is not granted to hive warehouse directory

2019-10-02 Thread Mihaly Toth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943118#comment-16943118 ] Mihaly Toth commented on SPARK-29078: - But if the user has access to that directory

[jira] [Resolved] (SPARK-28962) High-order function: filter(array, function) → array

2019-10-02 Thread Takuya Ueshin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28962?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takuya Ueshin resolved SPARK-28962. --- Fix Version/s: 3.0.0 Assignee: Henry Davidge Resolution: Fixed Issue resolve

[jira] [Commented] (SPARK-29078) Spark shell fails if read permission is not granted to hive warehouse directory

2019-10-02 Thread Peter Toth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943127#comment-16943127 ] Peter Toth commented on SPARK-29078: I don't think there should be other databases u

[jira] [Comment Edited] (SPARK-29078) Spark shell fails if read permission is not granted to hive warehouse directory

2019-10-02 Thread Peter Toth (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943127#comment-16943127 ] Peter Toth edited comment on SPARK-29078 at 10/2/19 8:15 PM: -

[jira] [Created] (SPARK-29335) Cost Based Optimizer stats are not used while evaluating query plans in Spark Sql

2019-10-02 Thread Srini E (Jira)
Srini E created SPARK-29335: --- Summary: Cost Based Optimizer stats are not used while evaluating query plans in Spark Sql Key: SPARK-29335 URL: https://issues.apache.org/jira/browse/SPARK-29335 Project: Spar

[jira] [Created] (SPARK-29336) The implementation of QuantileSummaries.merge does not guarantee the relativeError will be respected

2019-10-02 Thread Guilherme Souza (Jira)
Guilherme Souza created SPARK-29336: --- Summary: The implementation of QuantileSummaries.merge does not guarantee the relativeError will be respected Key: SPARK-29336 URL: https://issues.apache.org/jira/browse/S

[jira] [Updated] (SPARK-29336) The implementation of QuantileSummaries.merge does not guarantee the relativeError will be respected

2019-10-02 Thread Guilherme Souza (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guilherme Souza updated SPARK-29336: Shepherd: (was: Sean Zhong) Description: (sorry for the early submission, I'm stil

[jira] [Updated] (SPARK-29335) Cost Based Optimizer stats are not used while evaluating query plans in Spark Sql

2019-10-02 Thread Srini E (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Srini E updated SPARK-29335: Attachment: explain_plan_cbo_spark.txt > Cost Based Optimizer stats are not used while evaluating query pl

[jira] [Updated] (SPARK-29337) How to Cache Table and Pin it in Memory and should not Spill to Disk on Thrift Server

2019-10-02 Thread Srini E (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Srini E updated SPARK-29337: Attachment: Cache+Image.png > How to Cache Table and Pin it in Memory and should not Spill to Disk on > T

[jira] [Created] (SPARK-29337) How to Cache Table and Pin it in Memory and should not Spill to Disk on Thrift Server

2019-10-02 Thread Srini E (Jira)
Srini E created SPARK-29337: --- Summary: How to Cache Table and Pin it in Memory and should not Spill to Disk on Thrift Server Key: SPARK-29337 URL: https://issues.apache.org/jira/browse/SPARK-29337 Project:

[jira] [Updated] (SPARK-29336) The implementation of QuantileSummaries.merge does not guarantee that the relativeError will be respected

2019-10-02 Thread Guilherme Souza (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guilherme Souza updated SPARK-29336: Summary: The implementation of QuantileSummaries.merge does not guarantee that the relati

[jira] [Updated] (SPARK-29336) The implementation of QuantileSummaries.merge does not guarantee the relativeError will be respected

2019-10-02 Thread Guilherme Souza (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guilherme Souza updated SPARK-29336: Description: Hello Spark maintainers, I was experimenting with my own implementation of t

[jira] [Updated] (SPARK-29336) The implementation of QuantileSummaries.merge does not guarantee that the relativeError will be respected

2019-10-02 Thread Guilherme Souza (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guilherme Souza updated SPARK-29336: Description: Hello Spark maintainers, I was experimenting with my own implementation of t

[jira] [Updated] (SPARK-29336) The implementation of QuantileSummaries.merge does not guarantee that the relativeError will be respected

2019-10-02 Thread Guilherme Souza (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Guilherme Souza updated SPARK-29336: Description: Hello Spark maintainers, I was experimenting with my own implementation of t

[jira] [Commented] (SPARK-18748) UDF multiple evaluations causes very poor performance

2019-10-02 Thread Anton Baranau (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943176#comment-16943176 ] Anton Baranau commented on SPARK-18748: --- I got the same problem having the code be

[jira] [Comment Edited] (SPARK-18748) UDF multiple evaluations causes very poor performance

2019-10-02 Thread Anton Baranau (Jira)
[ https://issues.apache.org/jira/browse/SPARK-18748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943176#comment-16943176 ] Anton Baranau edited comment on SPARK-18748 at 10/2/19 9:32 PM: --

[jira] [Commented] (SPARK-28725) Spark ML not able to de-serialize Logistic Regression model saved in previous version of Spark

2019-10-02 Thread Sharad Varshney (Jira)
[ https://issues.apache.org/jira/browse/SPARK-28725?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16943192#comment-16943192 ] Sharad Varshney commented on SPARK-28725: - Even the same version of Spark 2.4.3

[jira] [Created] (SPARK-29338) Add overload for filter with index to Scala/Java API

2019-10-02 Thread Nikolas Vanderhoof (Jira)
Nikolas Vanderhoof created SPARK-29338: -- Summary: Add overload for filter with index to Scala/Java API Key: SPARK-29338 URL: https://issues.apache.org/jira/browse/SPARK-29338 Project: Spark

[jira] [Resolved] (SPARK-29338) Add overload for filter with index to Scala/Java API

2019-10-02 Thread Nikolas Vanderhoof (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nikolas Vanderhoof resolved SPARK-29338. Resolution: Duplicate > Add overload for filter with index to Scala/Java API > ---

[jira] [Created] (SPARK-29339) Support Arrow 0.14 in vectoried dapply and gapply (test it in AppVeyor build)

2019-10-02 Thread Hyukjin Kwon (Jira)
Hyukjin Kwon created SPARK-29339: Summary: Support Arrow 0.14 in vectoried dapply and gapply (test it in AppVeyor build) Key: SPARK-29339 URL: https://issues.apache.org/jira/browse/SPARK-29339 Project

[jira] [Assigned] (SPARK-29322) History server is stuck reading incomplete event log file compressed with zstd

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned SPARK-29322: - Assignee: Jungtaek Lim > History server is stuck reading incomplete event log file comp

[jira] [Resolved] (SPARK-29322) History server is stuck reading incomplete event log file compressed with zstd

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29322?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun resolved SPARK-29322. --- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25996 [https://

[jira] [Updated] (SPARK-29328) Incorrect calculation mean seconds per month

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29328: -- Affects Version/s: 2.3.4 > Incorrect calculation mean seconds per month >

[jira] [Updated] (SPARK-29328) Incorrect calculation mean seconds per month

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29328: -- Affects Version/s: 2.2.3 > Incorrect calculation mean seconds per month >

[jira] [Updated] (SPARK-29328) Incorrect calculation mean seconds per month

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29328: -- Description: Existing implementation assumes 31 days per month or 372 days per year which is

[jira] [Updated] (SPARK-29328) Incorrect calculation mean seconds per month

2019-10-02 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun updated SPARK-29328: -- Affects Version/s: 2.1.3 > Incorrect calculation mean seconds per month >

[jira] [Assigned] (SPARK-29305) Update LICENSE and NOTICE for hadoop 3.2

2019-10-02 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen reassigned SPARK-29305: Assignee: angerszhu > Update LICENSE and NOTICE for hadoop 3.2 >

[jira] [Resolved] (SPARK-29305) Update LICENSE and NOTICE for hadoop 3.2

2019-10-02 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-29305. -- Fix Version/s: 3.0.0 Resolution: Fixed Issue resolved by pull request 25978 [https://gi

[jira] [Updated] (SPARK-29305) Update LICENSE and NOTICE for hadoop 3.2

2019-10-02 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen updated SPARK-29305: - Priority: Minor (was: Major) > Update LICENSE and NOTICE for hadoop 3.2 > -

[jira] [Updated] (SPARK-29328) Incorrect calculation mean seconds per month

2019-10-02 Thread Maxim Gekk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-29328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Gekk updated SPARK-29328: --- Labels: correctness (was: ) > Incorrect calculation mean seconds per month > --