[jira] [Commented] (SPARK-32619) converting dataframe to dataset for the json schema

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178731#comment-17178731 ] Hyukjin Kwon commented on SPARK-32619: -- I don't think this is a bug in a Spark. We can confirm once

[jira] [Resolved] (SPARK-32619) converting dataframe to dataset for the json schema

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32619?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32619. -- Resolution: Invalid > converting dataframe to dataset for the json schema >

[jira] [Resolved] (SPARK-32633) GenericRowWithSchema cannot be cast to GenTraversableOnce

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32633. -- Resolution: Not A Problem > GenericRowWithSchema cannot be cast to GenTraversableOnce >

[jira] [Commented] (SPARK-32633) GenericRowWithSchema cannot be cast to GenTraversableOnce

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178730#comment-17178730 ] Hyukjin Kwon commented on SPARK-32633: -- {{GenericRowWithSchema}} isn't supposed to use it as an

[jira] [Comment Edited] (SPARK-25390) Data source V2 API refactoring

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178729#comment-17178729 ] Hyukjin Kwon edited comment on SPARK-25390 at 8/17/20, 5:44 AM:

[jira] [Commented] (SPARK-25390) Data source V2 API refactoring

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-25390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178729#comment-17178729 ] Hyukjin Kwon commented on SPARK-25390: -- [~Kyrdan], let's interact at

[jira] [Resolved] (SPARK-32611) Querying ORC table in Spark3 using spark.sql.orc.impl=hive produces incorrect when timestamp is present in predicate

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32611. -- Resolution: Cannot Reproduce > Querying ORC table in Spark3 using spark.sql.orc.impl=hive

[jira] [Comment Edited] (SPARK-32187) User Guide - Shipping Python Package

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178728#comment-17178728 ] Hyukjin Kwon edited comment on SPARK-32187 at 8/17/20, 5:39 AM: The

[jira] [Commented] (SPARK-32187) User Guide - Shipping Python Package

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32187?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178728#comment-17178728 ] Hyukjin Kwon commented on SPARK-32187: -- The draft looks good as a start. A couple of comments from

[jira] [Resolved] (SPARK-32626) Do not increase the input metrics when read rdd from cache

2020-08-16 Thread Udbhav Agrawal (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Udbhav Agrawal resolved SPARK-32626. Resolution: Not A Problem > Do not increase the input metrics when read rdd from cache >

[jira] [Commented] (SPARK-32018) Fix UnsafeRow set overflowed decimal

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178714#comment-17178714 ] Apache Spark commented on SPARK-32018: -- User 'gengliangwang' has created a pull request for this

[jira] [Commented] (SPARK-32018) Fix UnsafeRow set overflowed decimal

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178713#comment-17178713 ] Apache Spark commented on SPARK-32018: -- User 'gengliangwang' has created a pull request for this

[jira] [Resolved] (SPARK-32601) Issue in converting an RDD of Arrow RecordBatches in v3.0.0

2020-08-16 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-32601. -- Resolution: Not A Problem > Issue in converting an RDD of Arrow RecordBatches in v3.0.0 >

[jira] [Created] (SPARK-32633) GenericRowWithSchema cannot be cast to GenTraversableOnce

2020-08-16 Thread ImportMengjie (Jira)
ImportMengjie created SPARK-32633: - Summary: GenericRowWithSchema cannot be cast to GenTraversableOnce Key: SPARK-32633 URL: https://issues.apache.org/jira/browse/SPARK-32633 Project: Spark

[jira] [Comment Edited] (SPARK-32614) Support for treating the line as valid record if it starts with \u0000 or null character, or starts with any character mentioned as comment

2020-08-16 Thread chanduhawk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178560#comment-17178560 ] chanduhawk edited comment on SPARK-32614 at 8/17/20, 3:49 AM: -- [~srowen]

[jira] [Commented] (SPARK-32624) Replace getClass.getName with getClass.getCanonicalName in CodegenContext.addReferenceObj

2020-08-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178699#comment-17178699 ] Yuming Wang commented on SPARK-32624: - Thank you [~srowen] I have updated the description. >

[jira] [Updated] (SPARK-32624) Replace getClass.getName with getClass.getCanonicalName in CodegenContext.addReferenceObj

2020-08-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32624: Description: {code:java} scala> Array[Byte](1, 2).getClass.getName res13: String = [B scala>

[jira] [Updated] (SPARK-32624) Replace getClass.getName with getClass.getCanonicalName in CodegenContext.addReferenceObj

2020-08-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32624: Description: {code:java} scala> Array[Byte](1, 2).getClass.getName res13: String = [B scala>

[jira] [Updated] (SPARK-32624) Replace getClass.getName with getClass.getCanonicalName in CodegenContext.addReferenceObj

2020-08-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32624: Description: Not all error messages are in {{CodeGenerator}}, such as: {noformat} 20:49:54.885

[jira] [Updated] (SPARK-32632) Bad partitioning in spark jdbc method with parameter lowerBound and upperBound

2020-08-16 Thread Liu Dinghua (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liu Dinghua updated SPARK-32632: Description: When I use the jdbc methed {code:java} def jdbc( url: String, table: String,

[jira] [Resolved] (SPARK-32289) Chinese characters are garbled when opening csv files with Excel

2020-08-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang resolved SPARK-32289. - Resolution: Workaround !Workaround.png! > Chinese characters are garbled when opening csv

[jira] [Commented] (SPARK-32601) Issue in converting an RDD of Arrow RecordBatches in v3.0.0

2020-08-16 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178691#comment-17178691 ] L. C. Hsieh commented on SPARK-32601: - I think that we changed to use Arrow stream format in

[jira] [Updated] (SPARK-32632) Bad partitioning in spark jdbc method with parameter lowerBound and upperBound

2020-08-16 Thread Liu Dinghua (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liu Dinghua updated SPARK-32632: Description: When i use the jdbc methed {code:java} def jdbc( url: String, table: String,

[jira] [Updated] (SPARK-32632) Bad partitioning in spark jdbc method with parameter lowerBound and upperBound

2020-08-16 Thread Liu Dinghua (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liu Dinghua updated SPARK-32632: Description: When i use the jdbc methed {code:java} def jdbc( url: String, table: String,

[jira] [Updated] (SPARK-32632) Bad partitioning in spark jdbc method with parameter lowerBound and upperBound

2020-08-16 Thread Liu Dinghua (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liu Dinghua updated SPARK-32632: Description: When i use the jdbc methed {code:java} def jdbc( url: String, table: String,

[jira] [Updated] (SPARK-32632) Bad partitioning in spark jdbc method with parameter lowerBound and upperBound

2020-08-16 Thread Liu Dinghua (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liu Dinghua updated SPARK-32632: Description: When i use the jdbc methed {code:java} def jdbc( url: String, table: String,

[jira] [Updated] (SPARK-32632) Bad partitioning in spark jdbc method with parameter lowerBound and upperBound

2020-08-16 Thread Liu Dinghua (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Liu Dinghua updated SPARK-32632: Description: When i use the jdbc methed {code:java} def jdbc( url: String, table: String,

[jira] [Created] (SPARK-32632) Bad partitioning in spark jdbc method with parameter lowerBound and upperBound

2020-08-16 Thread Liu Dinghua (Jira)
Liu Dinghua created SPARK-32632: --- Summary: Bad partitioning in spark jdbc method with parameter lowerBound and upperBound Key: SPARK-32632 URL: https://issues.apache.org/jira/browse/SPARK-32632

[jira] [Assigned] (SPARK-32631) Handle Null error message in hive ThriftServer UI

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32631: Assignee: Apache Spark > Handle Null error message in hive ThriftServer UI >

[jira] [Commented] (SPARK-32631) Handle Null error message in hive ThriftServer UI

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178680#comment-17178680 ] Apache Spark commented on SPARK-32631: -- User 'tianhanhu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32631) Handle Null error message in hive ThriftServer UI

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32631: Assignee: (was: Apache Spark) > Handle Null error message in hive ThriftServer UI >

[jira] [Assigned] (SPARK-32627) Add showSessionLink parameter to SqlStatsPagedTable class in ThriftServerPage

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32627: Assignee: Apache Spark > Add showSessionLink parameter to SqlStatsPagedTable class in

[jira] [Assigned] (SPARK-32627) Add showSessionLink parameter to SqlStatsPagedTable class in ThriftServerPage

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32627: Assignee: Apache Spark > Add showSessionLink parameter to SqlStatsPagedTable class in

[jira] [Commented] (SPARK-32627) Add showSessionLink parameter to SqlStatsPagedTable class in ThriftServerPage

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178679#comment-17178679 ] Apache Spark commented on SPARK-32627: -- User 'tianhanhu' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32627) Add showSessionLink parameter to SqlStatsPagedTable class in ThriftServerPage

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32627: Assignee: (was: Apache Spark) > Add showSessionLink parameter to SqlStatsPagedTable

[jira] [Created] (SPARK-32631) Handle Null error message in hive ThriftServer UI

2020-08-16 Thread Tianhan Hu (Jira)
Tianhan Hu created SPARK-32631: -- Summary: Handle Null error message in hive ThriftServer UI Key: SPARK-32631 URL: https://issues.apache.org/jira/browse/SPARK-32631 Project: Spark Issue Type:

[jira] [Commented] (SPARK-32342) Kafka events are missing magic byte

2020-08-16 Thread Sridhar Baddela (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178668#comment-17178668 ] Sridhar Baddela commented on SPARK-32342: - This magic byte is specific to Confluent schema

[jira] [Resolved] (SPARK-32399) Support full outer join in shuffled hash join

2020-08-16 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32399?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro resolved SPARK-32399. -- Fix Version/s: 3.1.0 Assignee: Cheng Su Resolution: Fixed Resolved by

[jira] [Commented] (SPARK-32630) Reduce user confusion and subtle bugs by optionally preventing date & timestamp comparison

2020-08-16 Thread Simeon Simeonov (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178624#comment-17178624 ] Simeon Simeonov commented on SPARK-32630: - [~rxin] fyi, one of the subtle issues that add

[jira] [Created] (SPARK-32630) Reduce user confusion and subtle bugs by optionally preventing date & timestamp comparison

2020-08-16 Thread Simeon Simeonov (Jira)
Simeon Simeonov created SPARK-32630: --- Summary: Reduce user confusion and subtle bugs by optionally preventing date & timestamp comparison Key: SPARK-32630 URL: https://issues.apache.org/jira/browse/SPARK-32630

[jira] [Commented] (SPARK-32385) Publish a "bill of materials" (BOM) descriptor for Spark with correct versions of various dependencies

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178588#comment-17178588 ] Sean R. Owen commented on SPARK-32385: -- This requires us fixing every version of every transitive

[jira] [Commented] (SPARK-27708) Add documentation for v2 data sources

2020-08-16 Thread Rafael (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27708?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178583#comment-17178583 ] Rafael commented on SPARK-27708: [~rdblue] [~jlaskowski] Hey guys, I'm trying to migrate my package

[jira] [Commented] (SPARK-32385) Publish a "bill of materials" (BOM) descriptor for Spark with correct versions of various dependencies

2020-08-16 Thread Vladimir Matveev (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178581#comment-17178581 ] Vladimir Matveev commented on SPARK-32385: -- [~srowen] a BOM descriptor can be used as a

[jira] [Commented] (SPARK-32611) Querying ORC table in Spark3 using spark.sql.orc.impl=hive produces incorrect when timestamp is present in predicate

2020-08-16 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178576#comment-17178576 ] L. C. Hsieh commented on SPARK-32611: - I also tested on branch-3.0, but still cannot reproduce it.

[jira] [Commented] (SPARK-32611) Querying ORC table in Spark3 using spark.sql.orc.impl=hive produces incorrect when timestamp is present in predicate

2020-08-16 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178571#comment-17178571 ] L. C. Hsieh commented on SPARK-32611: - Hm, I build from current master branch, but cannot reproduce

[jira] [Commented] (SPARK-32619) converting dataframe to dataset for the json schema

2020-08-16 Thread L. C. Hsieh (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32619?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178563#comment-17178563 ] L. C. Hsieh commented on SPARK-32619: - Could you show the schema of the dataframe? By

[jira] [Resolved] (SPARK-27249) Developers API for Transformers beyond UnaryTransformer

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-27249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-27249. -- Resolution: Won't Fix > Developers API for Transformers beyond UnaryTransformer >

[jira] [Updated] (SPARK-32629) Record metrics of extra BitSet/HashSet in full outer shuffled hash join

2020-08-16 Thread Cheng Su (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32629?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Cheng Su updated SPARK-32629: - Parent: SPARK-32461 Issue Type: Sub-task (was: Improvement) > Record metrics of extra

[jira] [Created] (SPARK-32629) Record metrics of extra BitSet/HashSet in full outer shuffled hash join

2020-08-16 Thread Cheng Su (Jira)
Cheng Su created SPARK-32629: Summary: Record metrics of extra BitSet/HashSet in full outer shuffled hash join Key: SPARK-32629 URL: https://issues.apache.org/jira/browse/SPARK-32629 Project: Spark

[jira] [Resolved] (SPARK-32205) Writing timestamp in mysql gets fails

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32205. -- Resolution: Not A Problem > Writing timestamp in mysql gets fails >

[jira] [Comment Edited] (SPARK-32614) Support for treating the line as valid record if it starts with \u0000 or null character, or starts with any character mentioned as comment

2020-08-16 Thread chanduhawk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178560#comment-17178560 ] chanduhawk edited comment on SPARK-32614 at 8/16/20, 5:39 PM: -- *currently

[jira] [Comment Edited] (SPARK-32614) Support for treating the line as valid record if it starts with \u0000 or null character, or starts with any character mentioned as comment

2020-08-16 Thread chanduhawk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178560#comment-17178560 ] chanduhawk edited comment on SPARK-32614 at 8/16/20, 5:37 PM: -- *currently

[jira] [Comment Edited] (SPARK-32614) Support for treating the line as valid record if it starts with \u0000 or null character, or starts with any character mentioned as comment

2020-08-16 Thread chanduhawk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178560#comment-17178560 ] chanduhawk edited comment on SPARK-32614 at 8/16/20, 5:37 PM: -- *currently

[jira] [Commented] (SPARK-32614) Support for treating the line as valid record if it starts with \u0000 or null character, or starts with any character mentioned as comment

2020-08-16 Thread chanduhawk (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178560#comment-17178560 ] chanduhawk commented on SPARK-32614: If one of the rows the data file(CSV) starts with null or

[jira] [Commented] (SPARK-32502) Please fix CVE related to Guava 14.0.1

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178559#comment-17178559 ] Sean R. Owen commented on SPARK-32502: -- Yes it's shaded. The problem is that Hadoop < 3.2.1 and

[jira] [Resolved] (SPARK-32502) Please fix CVE related to Guava 14.0.1

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32502?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32502. -- Resolution: Duplicate > Please fix CVE related to Guava 14.0.1 >

[jira] [Commented] (SPARK-32534) Cannot load a Pipeline Model on a stopped Spark Context

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32534?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178558#comment-17178558 ] Sean R. Owen commented on SPARK-32534: -- Generally speaking, it's not going to work on stop and

[jira] [Commented] (SPARK-32569) Gaussian can not handle data close to MaxDouble

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178557#comment-17178557 ] Sean R. Owen commented on SPARK-32569: -- Please, if you can, narrow down the actual error in the

[jira] [Commented] (SPARK-32578) PageRank not sending the correct values in Pergel sendMessage

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32578?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178554#comment-17178554 ] Sean R. Owen commented on SPARK-32578: -- I feel like this has come up a couple times over the years

[jira] [Commented] (SPARK-32604) Bug in ALSModel Python Documentation

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32604?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178552#comment-17178552 ] Sean R. Owen commented on SPARK-32604: -- Yeah, the docs are generated from the code. The change to

[jira] [Resolved] (SPARK-32604) Bug in ALSModel Python Documentation

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32604. -- Resolution: Duplicate > Bug in ALSModel Python Documentation >

[jira] [Resolved] (SPARK-32610) Fix the link to metrics.dropwizard.io in monitoring.md to refer the proper version

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32610?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32610. -- Fix Version/s: 3.1.0 3.0.1 Resolution: Fixed Issue resolved by pull

[jira] [Commented] (SPARK-32612) int columns produce inconsistent results on pandas UDFs

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178543#comment-17178543 ] Sean R. Owen commented on SPARK-32612: -- I don't think it's correct to upgrade it to float in all

[jira] [Commented] (SPARK-32614) Support for treating the line as valid record if it starts with \u0000 or null character, or starts with any character mentioned as comment

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32614?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178537#comment-17178537 ] Sean R. Owen commented on SPARK-32614: -- I dont' really understand this. Are you just saying you

[jira] [Resolved] (SPARK-32618) ORC writer doesn't support colon in column names

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32618. -- Resolution: Invalid Maybe, this is likely a duplicate. You'd have to test, and at least on

[jira] [Commented] (SPARK-32624) Replace getClass.getName with getClass.getCanonicalName in CodegenContext.addReferenceObj

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32624?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178534#comment-17178534 ] Sean R. Owen commented on SPARK-32624: -- Lots of your JIRAs are lacking descriptions or no detail

[jira] [Resolved] (SPARK-32336) 11 Critical & 4 High severity issues in Apcahe Spark 3.0.0 - dependency libraries

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32336. -- Resolution: Invalid Some of these are _Spark_ CVEs that are already resolved. Some do not

[jira] [Commented] (SPARK-32342) Kafka events are missing magic byte

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32342?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178531#comment-17178531 ] Sean R. Owen commented on SPARK-32342: -- Is the magic byte supposed to be part of Avro's spec or

[jira] [Resolved] (SPARK-32359) Implement max_error metric evaluator for spark regression mllib

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean R. Owen resolved SPARK-32359. -- Resolution: Invalid > Implement max_error metric evaluator for spark regression mllib >

[jira] [Commented] (SPARK-32385) Publish a "bill of materials" (BOM) descriptor for Spark with correct versions of various dependencies

2020-08-16 Thread Sean R. Owen (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32385?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178528#comment-17178528 ] Sean R. Owen commented on SPARK-32385: -- What does this record that isn't available from a POM

[jira] [Assigned] (SPARK-32628) Use bloom filter to improve dynamicPartitionPruning

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32628: Assignee: (was: Apache Spark) > Use bloom filter to improve dynamicPartitionPruning

[jira] [Assigned] (SPARK-32628) Use bloom filter to improve dynamicPartitionPruning

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32628: Assignee: Apache Spark > Use bloom filter to improve dynamicPartitionPruning >

[jira] [Commented] (SPARK-32628) Use bloom filter to improve dynamicPartitionPruning

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178515#comment-17178515 ] Apache Spark commented on SPARK-32628: -- User 'wangyum' has created a pull request for this issue:

[jira] [Commented] (SPARK-32628) Use bloom filter to improve dynamicPartitionPruning

2020-08-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178511#comment-17178511 ] Yuming Wang commented on SPARK-32628: - Benchmark: {code:scala} spark.range(200L)

[jira] [Commented] (SPARK-32092) CrossvalidatorModel does not save all submodels (it saves only 3)

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178484#comment-17178484 ] Apache Spark commented on SPARK-32092: -- User 'Louiszr' has created a pull request for this issue:

[jira] [Commented] (SPARK-32092) CrossvalidatorModel does not save all submodels (it saves only 3)

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32092?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178485#comment-17178485 ] Apache Spark commented on SPARK-32092: -- User 'Louiszr' has created a pull request for this issue:

[jira] [Assigned] (SPARK-32092) CrossvalidatorModel does not save all submodels (it saves only 3)

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32092: Assignee: (was: Apache Spark) > CrossvalidatorModel does not save all submodels (it

[jira] [Assigned] (SPARK-32092) CrossvalidatorModel does not save all submodels (it saves only 3)

2020-08-16 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-32092: Assignee: Apache Spark > CrossvalidatorModel does not save all submodels (it saves only

[jira] [Updated] (SPARK-32628) Use bloom filter to improve dynamicPartitionPruning

2020-08-16 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yuming Wang updated SPARK-32628: Description: It will throw exception when 

[jira] [Created] (SPARK-32628) Use bloom filter to improve dynamicPartitionPruning

2020-08-16 Thread Yuming Wang (Jira)
Yuming Wang created SPARK-32628: --- Summary: Use bloom filter to improve dynamicPartitionPruning Key: SPARK-32628 URL: https://issues.apache.org/jira/browse/SPARK-32628 Project: Spark Issue

[jira] [Commented] (SPARK-32542) Add an optimizer rule to split an Expand into multiple Expands for aggregates

2020-08-16 Thread karl wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178477#comment-17178477 ] karl wang commented on SPARK-32542: --- ok > Add an optimizer rule to split an Expand into multiple

[jira] [Commented] (SPARK-32542) Add an optimizer rule to split an Expand into multiple Expands for aggregates

2020-08-16 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178475#comment-17178475 ] Takeshi Yamamuro commented on SPARK-32542: -- I unset the target/fix version. Please do not set

[jira] [Updated] (SPARK-32542) Add an optimizer rule to split an Expand into multiple Expands for aggregates

2020-08-16 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-32542: - Fix Version/s: (was: 3.0.0) > Add an optimizer rule to split an Expand into

[jira] [Updated] (SPARK-32542) Add an optimizer rule to split an Expand into multiple Expands for aggregates

2020-08-16 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-32542: - Component/s: (was: Optimizer) SQL > Add an optimizer rule to split

[jira] [Updated] (SPARK-32542) Add an optimizer rule to split an Expand into multiple Expands for aggregates

2020-08-16 Thread Takeshi Yamamuro (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Takeshi Yamamuro updated SPARK-32542: - Target Version/s: (was: 3.0.0) > Add an optimizer rule to split an Expand into

[jira] [Commented] (SPARK-32582) Spark SQL Infer Schema Performance

2020-08-16 Thread Jarred Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32582?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17178445#comment-17178445 ] Jarred Li commented on SPARK-32582: --- ??I am not sure it would be helpful since there is no API in

[jira] [Updated] (SPARK-32125) [UI] Support get taskList by status in Web UI and SHS Rest API

2020-08-16 Thread Xiao Li (Jira)
[ https://issues.apache.org/jira/browse/SPARK-32125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xiao Li updated SPARK-32125: Fix Version/s: 3.1.0 > [UI] Support get taskList by status in Web UI and SHS Rest API >