[
https://issues.apache.org/jira/browse/SPARK-44379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-44379:
Description:
Context: After migrating to Spark 3 with AQE, we saw a significant increase
[
https://issues.apache.org/jira/browse/SPARK-44379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17742126#comment-17742126
]
Shardul Mahadik commented on SPARK-44379:
-
cc: [~cloud_fan] [~joshrosen] [~mridul] Would be
[
https://issues.apache.org/jira/browse/SPARK-44379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-44379:
Description:
Context: After migrating to Spark 3 with AQE, we saw a significant increase
[
https://issues.apache.org/jira/browse/SPARK-44379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-44379:
Attachment: screenshot-1.png
> Broadcast Joins taking up too much memory
>
[
https://issues.apache.org/jira/browse/SPARK-44379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-44379:
Attachment: screenshot-2.png
> Broadcast Joins taking up too much memory
>
Shardul Mahadik created SPARK-44379:
---
Summary: Broadcast Joins taking up too much memory
Key: SPARK-44379
URL: https://issues.apache.org/jira/browse/SPARK-44379
Project: Spark
Issue Type:
Shardul Mahadik created SPARK-42290:
---
Summary: Spark Driver hangs on OOM during Broadcast when AQE is
enabled
Key: SPARK-42290
URL: https://issues.apache.org/jira/browse/SPARK-42290
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-41557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17679004#comment-17679004
]
Shardul Mahadik commented on SPARK-41557:
-
Confirmed that the test work fine in master now.
[
https://issues.apache.org/jira/browse/SPARK-41557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik resolved SPARK-41557.
-
Resolution: Fixed
> Union of tables with and without metadata column fails when used in
[
https://issues.apache.org/jira/browse/SPARK-41557?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17648866#comment-17648866
]
Shardul Mahadik commented on SPARK-41557:
-
cc: [~Gengliang.Wang] [~cloud_fan]
> Union of
[
https://issues.apache.org/jira/browse/SPARK-41557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-41557:
Description:
Here is a test case that can be added to {{MetadataColumnSuite}} to
[
https://issues.apache.org/jira/browse/SPARK-41557?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-41557:
Description:
Here is a test case that can be added to {{MetadataColumnSuite}} to
Shardul Mahadik created SPARK-41557:
---
Summary: Union of tables with and without metadata column fails
when used in join
Key: SPARK-41557
URL: https://issues.apache.org/jira/browse/SPARK-41557
[
https://issues.apache.org/jira/browse/SPARK-41162?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-41162:
Labels: correctness (was: )
> Anti-join must not be pushed below aggregation with
[
https://issues.apache.org/jira/browse/SPARK-41162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17648751#comment-17648751
]
Shardul Mahadik edited comment on SPARK-41162 at 12/16/22 6:27 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-41162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17648751#comment-17648751
]
Shardul Mahadik commented on SPARK-41162:
-
[~cloud_fan] Can you help take a look at this? This
[
https://issues.apache.org/jira/browse/SPARK-40262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17601021#comment-17601021
]
Shardul Mahadik commented on SPARK-40262:
-
[~cloud_fan] [~viirya] [~joshrosen] Gentle ping on
[
https://issues.apache.org/jira/browse/SPARK-40262?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17597436#comment-17597436
]
Shardul Mahadik commented on SPARK-40262:
-
cc: [~cloud_fan] [~xkrogen] [~mridulm80]
>
Shardul Mahadik created SPARK-40262:
---
Summary: Expensive UDF evaluation pushed down past a join leads to
performance issues
Key: SPARK-40262
URL: https://issues.apache.org/jira/browse/SPARK-40262
[
https://issues.apache.org/jira/browse/SPARK-35253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17524638#comment-17524638
]
Shardul Mahadik edited comment on SPARK-35253 at 4/20/22 12:31 AM:
---
Hi
[
https://issues.apache.org/jira/browse/SPARK-35253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17524638#comment-17524638
]
Shardul Mahadik commented on SPARK-35253:
-
Hi folks. This issue found in SPARK-35578 is now
[
https://issues.apache.org/jira/browse/SPARK-35253?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17513801#comment-17513801
]
Shardul Mahadik commented on SPARK-35253:
-
Hi folks, what is the path forward for this ticket?
Shardul Mahadik created SPARK-38510:
---
Summary: Failure fetching JSON representation of Spark plans with
Hive UDFs
Key: SPARK-38510
URL: https://issues.apache.org/jira/browse/SPARK-38510
Project:
[
https://issues.apache.org/jira/browse/SPARK-38030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17489016#comment-17489016
]
Shardul Mahadik commented on SPARK-38030:
-
During the PR reviews, we used a different approach
[
https://issues.apache.org/jira/browse/SPARK-38030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17482233#comment-17482233
]
Shardul Mahadik commented on SPARK-38030:
-
I plan to create a PR to change the canonicalization
Shardul Mahadik created SPARK-38030:
---
Summary: Query with cast containing non-nullable columns fails
with AQE on Spark 3.1.1
Key: SPARK-38030
URL: https://issues.apache.org/jira/browse/SPARK-38030
Shardul Mahadik created SPARK-37822:
---
Summary: SQL function `split` should return an array of
non-nullable elements
Key: SPARK-37822
URL: https://issues.apache.org/jira/browse/SPARK-37822
Project:
Shardul Mahadik created SPARK-37602:
---
Summary: Add config property to set default Spark listeners
Key: SPARK-37602
URL: https://issues.apache.org/jira/browse/SPARK-37602
Project: Spark
Shardul Mahadik created SPARK-37569:
---
Summary: View Analysis incorrectly marks nested fields as nullable
Key: SPARK-37569
URL: https://issues.apache.org/jira/browse/SPARK-37569
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-36877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17434515#comment-17434515
]
Shardul Mahadik commented on SPARK-36877:
-
Was able to get around this by re-using the RDD for
[
https://issues.apache.org/jira/browse/SPARK-36877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik resolved SPARK-36877.
-
Resolution: Not A Problem
> Calling ds.rdd with AQE enabled leads to jobs being run,
[
https://issues.apache.org/jira/browse/SPARK-36877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17427825#comment-17427825
]
Shardul Mahadik commented on SPARK-36877:
-
{quote} Getting RDD means the physical plan is
[
https://issues.apache.org/jira/browse/SPARK-36905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17422966#comment-17422966
]
Shardul Mahadik commented on SPARK-36905:
-
This cannot be reproduced with a view created from
[
https://issues.apache.org/jira/browse/SPARK-36905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17422958#comment-17422958
]
Shardul Mahadik edited comment on SPARK-36905 at 9/30/21, 6:21 PM:
---
[
https://issues.apache.org/jira/browse/SPARK-36905?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17422958#comment-17422958
]
Shardul Mahadik commented on SPARK-36905:
-
This worked fine prior to
[
https://issues.apache.org/jira/browse/SPARK-36905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-36905:
Description:
Consider a Hive view in which some columns are not explicitly named
[
https://issues.apache.org/jira/browse/SPARK-36905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-36905:
Description:
Consider a Hive view in which some columns are not explicitly named
Shardul Mahadik created SPARK-36905:
---
Summary: Reading Hive view without explicit column names fails in
Spark
Key: SPARK-36905
URL: https://issues.apache.org/jira/browse/SPARK-36905
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-35874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17421780#comment-17421780
]
Shardul Mahadik commented on SPARK-35874:
-
[~dongjoon] Should this be linked in SPARK-33828?
>
[
https://issues.apache.org/jira/browse/SPARK-36877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-36877:
Summary: Calling ds.rdd with AQE enabled leads to jobs being run,
eventually causing
[
https://issues.apache.org/jira/browse/SPARK-36877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17421510#comment-17421510
]
Shardul Mahadik commented on SPARK-36877:
-
cc: [~cloud_fan] [~mridulm80]
> Calling ds.rdd with
Shardul Mahadik created SPARK-36877:
---
Summary: Calling ds.rdd with AQE enabled leads to being jobs being
run, eventually causing reruns
Key: SPARK-36877
URL: https://issues.apache.org/jira/browse/SPARK-36877
[
https://issues.apache.org/jira/browse/SPARK-36877?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-36877:
Attachment: Screen Shot 2021-09-28 at 09.32.20.png
> Calling ds.rdd with AQE enabled
[
https://issues.apache.org/jira/browse/SPARK-36673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17410477#comment-17410477
]
Shardul Mahadik commented on SPARK-36673:
-
[~mgaido] [~cloud_fan] Since you guys were involved
[
https://issues.apache.org/jira/browse/SPARK-36673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-36673:
Description:
If a nested field has different casing on two sides of the union, the
[
https://issues.apache.org/jira/browse/SPARK-36673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-36673:
Description:
If a nested field has different casing on two sides of the union, the
Shardul Mahadik created SPARK-36673:
---
Summary: Incorrect Unions of struct with mismatched field name case
Key: SPARK-36673
URL: https://issues.apache.org/jira/browse/SPARK-36673
Project: Spark
Shardul Mahadik created SPARK-36215:
---
Summary: Add logging for slow fetches to diagnose external shuffle
service issues
Key: SPARK-36215
URL: https://issues.apache.org/jira/browse/SPARK-36215
[
https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380170#comment-17380170
]
Shardul Mahadik edited comment on SPARK-28266 at 7/13/21, 9:27 PM:
---
I
[
https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380170#comment-17380170
]
Shardul Mahadik edited comment on SPARK-28266 at 7/13/21, 9:27 PM:
---
I
[
https://issues.apache.org/jira/browse/SPARK-28266?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380170#comment-17380170
]
Shardul Mahadik commented on SPARK-28266:
-
I would like to propose another angle to look at the
Shardul Mahadik created SPARK-35074:
---
Summary: spark.jars.xxx configs should be moved to
config/package.scala
Key: SPARK-35074
URL: https://issues.apache.org/jira/browse/SPARK-35074
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-35072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-35072:
Description:
During reviews of SPARK-34472, there was a desire to support local:// and
Shardul Mahadik created SPARK-35073:
---
Summary: SparkContext.addJar with an ivy path should not fail with
a custom ivySettings file in non-YARN cluster modes
Key: SPARK-35073
URL:
Shardul Mahadik created SPARK-35072:
---
Summary: spark.jars.ivysettings should support local:// and
hdfs:// schemes
Key: SPARK-35072
URL: https://issues.apache.org/jira/browse/SPARK-35072
Project:
Shardul Mahadik created SPARK-34624:
---
Summary: Filter non-jar dependencies from ivy/maven coordinates
Key: SPARK-34624
URL: https://issues.apache.org/jira/browse/SPARK-34624
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-34472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17289258#comment-17289258
]
Shardul Mahadik commented on SPARK-34472:
-
[~xkrogen] raised a good point at
[
https://issues.apache.org/jira/browse/SPARK-34506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-34506:
Description:
SPARK-33084 added the ability to use ivy coordinates with
[
https://issues.apache.org/jira/browse/SPARK-34506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-34506:
Summary: ADD JAR with ivy coordinates should be compatible with Hive
transitive behavior
[
https://issues.apache.org/jira/browse/SPARK-34506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shardul Mahadik updated SPARK-34506:
Summary: ADD JAR with ivy coordinates should be compatible with Hive
behavior (was: ADD
Shardul Mahadik created SPARK-34506:
---
Summary: ADD JAR with ivy coordinates should transitively fetch
dependencies by default
Key: SPARK-34506
URL: https://issues.apache.org/jira/browse/SPARK-34506
Shardul Mahadik created SPARK-34477:
---
Summary: Kryo NPEs when serializing Avro GenericData objects
(except GenericRecord)
Key: SPARK-34477
URL: https://issues.apache.org/jira/browse/SPARK-34477
[
https://issues.apache.org/jira/browse/SPARK-34472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17286977#comment-17286977
]
Shardul Mahadik commented on SPARK-34472:
-
I will be sending a PR for this soon.
>
Shardul Mahadik created SPARK-34472:
---
Summary: SparkContext.addJar with an ivy path fails in cluster
mode with a custom ivySettings file
Key: SPARK-34472
URL: https://issues.apache.org/jira/browse/SPARK-34472
64 matches
Mail list logo