[
https://issues.apache.org/jira/browse/SPARK-37068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433211#comment-17433211
]
Sean R. Owen commented on SPARK-37068:
--
Yes, too late to change it, but the 'hadoop
[
https://issues.apache.org/jira/browse/SPARK-37084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon resolved SPARK-37084.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34353
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-37084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon reassigned SPARK-37084:
Assignee: Yang He
> Set spark.sql.files.openCostInBytes to bytesConf
> --
[
https://issues.apache.org/jira/browse/SPARK-37068?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433205#comment-17433205
]
Hyukjin Kwon commented on SPARK-37068:
--
The name of the tar file would have to be c
[
https://issues.apache.org/jira/browse/SPARK-37096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433201#comment-17433201
]
Hyukjin Kwon commented on SPARK-37096:
--
cc [~cloud_fan] FYI
> Where clause and whe
[
https://issues.apache.org/jira/browse/SPARK-37100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon updated SPARK-37100:
-
Fix Version/s: (was: 3.2.1)
> Pandas groupby UDFs would benefit from automatically redistrib
[
https://issues.apache.org/jira/browse/SPARK-37096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon updated SPARK-37096:
-
Priority: Major (was: Critical)
> Where clause and where operator will report error on varchar
Richard Williamson created SPARK-37100:
--
Summary: Pandas groupby UDFs would benefit from automatically
redistributing data on the groupby key in order to prevent network issues
running udf
Key: SPARK-37100
U
[
https://issues.apache.org/jira/browse/SPARK-36554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433137#comment-17433137
]
Nicolas Azrak commented on SPARK-36554:
---
[~lekshmiii] I've added a test to validat
[
https://issues.apache.org/jira/browse/SPARK-36554?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432714#comment-17432714
]
Lekshmi Ramachandran edited comment on SPARK-36554 at 10/22/21, 5:27 PM:
-
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean R. Owen updated SPARK-37091:
-
Priority: Trivial (was: Major)
> Support Java 17 in SparkR SystemRequirements
> ---
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433078#comment-17433078
]
Dongjoon Hyun edited comment on SPARK-37091 at 10/22/21, 5:13 PM:
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun updated SPARK-37091:
--
Fix Version/s: (was: 3.2.1)
> Support Java 17 in SparkR SystemRequirements
> -
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433078#comment-17433078
]
Dongjoon Hyun edited comment on SPARK-37091 at 10/22/21, 5:13 PM:
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433078#comment-17433078
]
Dongjoon Hyun commented on SPARK-37091:
---
BTW, [~Bidek]. Please don't set `Target V
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun updated SPARK-37091:
--
Target Version/s: (was: 3.3.0)
> Support Java 17 in SparkR SystemRequirements
>
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun updated SPARK-37091:
--
Summary: Support Java 17 in SparkR SystemRequirements (was: Bump
SystemRequirements to use Ja
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Darek updated SPARK-37091:
--
Description:
Please bump Java version to <= 17 in
[DESCRIPTION|https://github.com/apache/spark/blob/f9f95686c
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Darek updated SPARK-37091:
--
Description:
Please bump Java version to <= 17 in
[DESCRIPTION|https://github.com/apache/spark/blob/f9f95686c
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Darek updated SPARK-37091:
--
Target Version/s: 3.3.0 (was: 3.2.0)
Affects Version/s: (was: 3.2.0)
3.3.0
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Darek updated SPARK-37091:
--
Parent: SPARK-33772
Issue Type: Sub-task (was: Improvement)
> Bump SystemRequirements to use Java > 1
[
https://issues.apache.org/jira/browse/SPARK-35703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-35703:
-
Summary: Relax constraint for Spark bucket join and remove
HashClusteredDistribution (was: Remove HashC
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37091:
Assignee: Apache Spark
> Bump SystemRequirements to use Java > 11
> -
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433051#comment-17433051
]
Apache Spark commented on SPARK-37091:
--
User 'Bidek56' has created a pull request f
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433053#comment-17433053
]
Apache Spark commented on SPARK-37091:
--
User 'Bidek56' has created a pull request f
[
https://issues.apache.org/jira/browse/SPARK-37091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37091:
Assignee: (was: Apache Spark)
> Bump SystemRequirements to use Java > 11
> --
[
https://issues.apache.org/jira/browse/SPARK-37047?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433048#comment-17433048
]
Apache Spark commented on SPARK-37047:
--
User 'cloud-fan' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-37089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17433020#comment-17433020
]
Apache Spark commented on SPARK-37089:
--
User 'ankurdave' has created a pull request
[
https://issues.apache.org/jira/browse/SPARK-37067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wenchen Fan resolved SPARK-37067.
-
Fix Version/s: 3.3.0
3.2.1
Assignee: Linhong Liu
Resolution: F
[
https://issues.apache.org/jira/browse/SPARK-37072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432964#comment-17432964
]
Apache Spark commented on SPARK-37072:
--
User 'LuciferYang' has created a pull reque
[
https://issues.apache.org/jira/browse/SPARK-37072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37072:
Assignee: (was: Apache Spark)
> Pass all UTs in `repl` with Java 17
> ---
[
https://issues.apache.org/jira/browse/SPARK-37072?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432963#comment-17432963
]
Apache Spark commented on SPARK-37072:
--
User 'LuciferYang' has created a pull reque
[
https://issues.apache.org/jira/browse/SPARK-37072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37072:
Assignee: Apache Spark
> Pass all UTs in `repl` with Java 17
> --
[
https://issues.apache.org/jira/browse/SPARK-37006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432914#comment-17432914
]
jinhai commented on SPARK-37006:
hi [~Ngone51], can you review this issue for me?
> Map
[
https://issues.apache.org/jira/browse/SPARK-37006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
jinhai updated SPARK-37006:
---
Comment: was deleted
(was: hi [~Ngone51], can you review this issue for me?)
> MapStatus adds localDirs to
[
https://issues.apache.org/jira/browse/SPARK-37006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17429079#comment-17429079
]
jinhai edited comment on SPARK-37006 at 10/22/21, 11:01 AM:
[
https://issues.apache.org/jira/browse/SPARK-37006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
jinhai updated SPARK-37006:
---
Description:
When executing the ShuffleBlockFetcherIterator.fetchHostLocalBlocks method, in
order to obtain
[
https://issues.apache.org/jira/browse/SPARK-37099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37099:
Assignee: (was: Apache Spark)
> Impl a rank-based filter to optimize top-k computatio
[
https://issues.apache.org/jira/browse/SPARK-37099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432910#comment-17432910
]
Apache Spark commented on SPARK-37099:
--
User 'zhengruifeng' has created a pull requ
[
https://issues.apache.org/jira/browse/SPARK-37099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37099:
Assignee: Apache Spark
> Impl a rank-based filter to optimize top-k computation
> ---
[
https://issues.apache.org/jira/browse/SPARK-37099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-37099:
-
Description:
in JD, we found that more than 80% usage of window function follows this
pattern:
[
https://issues.apache.org/jira/browse/SPARK-37099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-37099:
-
Attachment: skewed_window.png
> Impl a rank-based filter to optimize top-k computation
> ---
[
https://issues.apache.org/jira/browse/SPARK-37099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-37099:
-
Description:
in JD, we found that more than 80% usage of window function follows this
pattern:
zhengruifeng created SPARK-37099:
Summary: Impl a rank-based filter to optimize top-k computation
Key: SPARK-37099
URL: https://issues.apache.org/jira/browse/SPARK-37099
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-37099?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
zhengruifeng updated SPARK-37099:
-
Description:
in JD, we found that more than 80% usage of window function follows this
pattern:
[
https://issues.apache.org/jira/browse/SPARK-37016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432902#comment-17432902
]
dohongdayi commented on SPARK-37016:
Anyone care about this issue?
> Publicise Uppe
[
https://issues.apache.org/jira/browse/SPARK-37097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37097:
Assignee: Apache Spark
> yarn-cluster mode, unregister timeout cause spark retry but AM c
[
https://issues.apache.org/jira/browse/SPARK-37097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37097:
Assignee: (was: Apache Spark)
> yarn-cluster mode, unregister timeout cause spark ret
[
https://issues.apache.org/jira/browse/SPARK-37098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37098:
Assignee: Apache Spark
> Alter table properties should invalidate cache
> ---
[
https://issues.apache.org/jira/browse/SPARK-37098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-37098:
Assignee: (was: Apache Spark)
> Alter table properties should invalidate cache
>
[
https://issues.apache.org/jira/browse/SPARK-37097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432892#comment-17432892
]
Apache Spark commented on SPARK-37097:
--
User 'AngersZh' has created a pull requ
[
https://issues.apache.org/jira/browse/SPARK-37098?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17432891#comment-17432891
]
Apache Spark commented on SPARK-37098:
--
User 'ulysses-you' has created a pull reque
XiDuo You created SPARK-37098:
-
Summary: Alter table properties should invalidate cache
Key: SPARK-37098
URL: https://issues.apache.org/jira/browse/SPARK-37098
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-37097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
angerszhu updated SPARK-37097:
--
Description:
1. Cluster mode AM shutdown hook triggered
2. am unregister from RM timeout, but AM shut
[
https://issues.apache.org/jira/browse/SPARK-37073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon resolved SPARK-37073.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34364
[https://gi
[
https://issues.apache.org/jira/browse/SPARK-37073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hyukjin Kwon reassigned SPARK-37073:
Assignee: Yang Jie
> Pass all UTs in `external/avro` with Java 17
> -
angerszhu created SPARK-37097:
-
Summary: yarn-cluster mode, unregister timeout cause spark retry
but AM container exit with code 0
Key: SPARK-37097
URL: https://issues.apache.org/jira/browse/SPARK-37097
P
[
https://issues.apache.org/jira/browse/SPARK-37096?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ye Li updated SPARK-37096:
--
Description:
create table test1(col1 int, col2 varchar(120)) stored as orc;
insert into test1 values(123, 'ab
Ye Li created SPARK-37096:
-
Summary: Where clause and where operator will report error on
varchar column type
Key: SPARK-37096
URL: https://issues.apache.org/jira/browse/SPARK-37096
Project: Spark
I
59 matches
Mail list logo