[jira] [Resolved] (SPARK-37150) Migrate DESCRIBE NAMESPACE to use V2 command by default

2021-10-31 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-37150. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34429

[jira] [Assigned] (SPARK-37150) Migrate DESCRIBE NAMESPACE to use V2 command by default

2021-10-31 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan reassigned SPARK-37150: --- Assignee: Terry Kim > Migrate DESCRIBE NAMESPACE to use V2 command by default >

[jira] [Resolved] (SPARK-37160) Add a config to optionally disable paddin for char type

2021-10-31 Thread Wenchen Fan (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wenchen Fan resolved SPARK-37160. - Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34436

[jira] [Commented] (SPARK-37172) Push down filters having both partitioning and non-partitioning columns

2021-10-31 Thread Chungmin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436605#comment-17436605 ] Chungmin commented on SPARK-37172: -- I'm implementing data skipping in Delta Lake and it can handle

[jira] [Commented] (SPARK-37173) Optimize GetFunctionsOperation to get builtin functions only once when use wild function name pattern

2021-10-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436600#comment-17436600 ] Apache Spark commented on SPARK-37173: -- User 'AngersZh' has created a pull request for this

[jira] [Assigned] (SPARK-37173) Optimize GetFunctionsOperation to get builtin functions only once when use wild function name pattern

2021-10-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37173: Assignee: (was: Apache Spark) > Optimize GetFunctionsOperation to get builtin

[jira] [Commented] (SPARK-37173) Optimize GetFunctionsOperation to get builtin functions only once when use wild function name pattern

2021-10-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436599#comment-17436599 ] Apache Spark commented on SPARK-37173: -- User 'AngersZh' has created a pull request for this

[jira] [Assigned] (SPARK-37173) Optimize GetFunctionsOperation to get builtin functions only once when use wild function name pattern

2021-10-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-37173: Assignee: Apache Spark > Optimize GetFunctionsOperation to get builtin functions only

[jira] [Commented] (SPARK-37175) Performance improvement to hash joins with many duplicate keys

2021-10-31 Thread Sumeet (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436588#comment-17436588 ] Sumeet commented on SPARK-37175: I am working on this. > Performance improvement to hash joins with

[jira] [Resolved] (SPARK-37170) Pin PySpark version installed in the Binder environment for tagged commit

2021-10-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-37170. -- Fix Version/s: 3.3.0 3.2.1 Assignee: Kousuke Saruta (was: Apache

[jira] [Commented] (SPARK-37172) Push down filters having both partitioning and non-partitioning columns

2021-10-31 Thread Yuming Wang (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37172?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436574#comment-17436574 ] Yuming Wang commented on SPARK-37172: - Is there a data source can handle these filters? > Push down

[jira] [Resolved] (SPARK-37129) Supplement all micro benchmark results use to Java 17

2021-10-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon resolved SPARK-37129. -- Fix Version/s: 3.3.0 Resolution: Fixed Issue resolved by pull request 34418

[jira] [Assigned] (SPARK-37129) Supplement all micro benchmark results use to Java 17

2021-10-31 Thread Hyukjin Kwon (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hyukjin Kwon reassigned SPARK-37129: Assignee: Yang Jie > Supplement all micro benchmark results use to Java 17 >

[jira] [Updated] (SPARK-37175) Performance improvement to hash joins with many duplicate keys

2021-10-31 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-37175: -- Description: I noticed that HashedRelations with many duplicate keys perform significantly

[jira] [Updated] (SPARK-37175) Performance improvement to hash joins with many duplicate keys

2021-10-31 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-37175: -- Description: I noticed that HashedRelations with many duplicate keys perform significantly

[jira] [Updated] (SPARK-37175) Performance improvement to hash joins with many duplicate keys

2021-10-31 Thread Bruce Robbins (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bruce Robbins updated SPARK-37175: -- Attachment: hash_rel_examples.txt > Performance improvement to hash joins with many duplicate

[jira] [Created] (SPARK-37175) Performance improvement to hash joins with many duplicate keys

2021-10-31 Thread Bruce Robbins (Jira)
Bruce Robbins created SPARK-37175: - Summary: Performance improvement to hash joins with many duplicate keys Key: SPARK-37175 URL: https://issues.apache.org/jira/browse/SPARK-37175 Project: Spark

[jira] [Updated] (SPARK-37174) WARN WindowExec: No Partition Defined is being printed 4 times.

2021-10-31 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-37174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-37174: Description: Hi I use this code   {code:java} f01 =

[jira] [Updated] (SPARK-37174) WARN WindowExec: No Partition Defined is being printed 4 times.

2021-10-31 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-37174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-37174: Description: Hi I use this code  {code:java} f01 =

[jira] [Updated] (SPARK-37174) WARN WindowExec: No Partition Defined is being printed 4 times.

2021-10-31 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-37174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-37174: Description: Hi I use this code  {code:java} f01 =

[jira] [Updated] (SPARK-37174) WARN WindowExec: No Partition Defined is being printed 4 times.

2021-10-31 Thread Jira
[ https://issues.apache.org/jira/browse/SPARK-37174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bjørn Jørgensen updated SPARK-37174: Description: Hi I use this code  {code:java} // f01 =

[jira] [Created] (SPARK-37174) WARN WindowExec: No Partition Defined is being printed 4 times.

2021-10-31 Thread Jira
Bjørn Jørgensen created SPARK-37174: --- Summary: WARN WindowExec: No Partition Defined is being printed 4 times. Key: SPARK-37174 URL: https://issues.apache.org/jira/browse/SPARK-37174 Project:

[jira] [Resolved] (SPARK-37171) Addition of forany and forall semantics to Spark Dataframes

2021-10-31 Thread Dhiren Navani (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dhiren Navani resolved SPARK-37171. --- Resolution: Won't Fix > Addition of forany and forall semantics to Spark Dataframes >

[jira] [Commented] (SPARK-37038) Sample push down in DS v2

2021-10-31 Thread Apache Spark (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37038?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436502#comment-17436502 ] Apache Spark commented on SPARK-37038: -- User 'huaxingao' has created a pull request for this issue:

[jira] [Created] (SPARK-37173) Optimize GetFunctionsOperation to get builtin functions only once when use wild function name pattern

2021-10-31 Thread angerszhu (Jira)
angerszhu created SPARK-37173: - Summary: Optimize GetFunctionsOperation to get builtin functions only once when use wild function name pattern Key: SPARK-37173 URL: https://issues.apache.org/jira/browse/SPARK-37173

[jira] [Updated] (SPARK-37172) Push down filters having both partitioning and non-partitioning columns

2021-10-31 Thread Chungmin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chungmin updated SPARK-37172: - Description: Currently, filters having both partitioning and non-partitioning columns are lost during

[jira] [Updated] (SPARK-37172) Push down filters having both partitioning and non-partitioning columns

2021-10-31 Thread Chungmin (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chungmin updated SPARK-37172: - Description: Currently, filters having both partitioning and non-partitioning columns are lost during

[jira] [Created] (SPARK-37172) Push down filters having both partitioning and non-partitioning columns

2021-10-31 Thread Chungmin (Jira)
Chungmin created SPARK-37172: Summary: Push down filters having both partitioning and non-partitioning columns Key: SPARK-37172 URL: https://issues.apache.org/jira/browse/SPARK-37172 Project: Spark

[jira] [Updated] (SPARK-37170) Pin PySpark version installed in the Binder environment for tagged commit

2021-10-31 Thread Kousuke Saruta (Jira)
[ https://issues.apache.org/jira/browse/SPARK-37170?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kousuke Saruta updated SPARK-37170: --- Summary: Pin PySpark version installed in the Binder environment for tagged commit (was: