[
https://issues.apache.org/jira/browse/SPARK-36679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-36679.
--
Fix Version/s: 3.3.0
Resolution: Duplicate
> Remove lz4 hadoop wrapper classes after Hadoop
[
https://issues.apache.org/jira/browse/SPARK-38179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-38179.
--
Resolution: Won't Fix
> Improve WritableColumnVector to better support null struct
>
[
https://issues.apache.org/jira/browse/SPARK-38237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-38237:
Assignee: Cheng Su
> Introduce a new config to require all cluster keys on Aggregate
>
[
https://issues.apache.org/jira/browse/SPARK-38237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-38237.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 35574
Chao Sun created SPARK-38179:
Summary: Improve WritableColumnVector to better support null struct
Key: SPARK-38179
URL: https://issues.apache.org/jira/browse/SPARK-38179
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-38077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17484894#comment-17484894
]
Chao Sun commented on SPARK-38077:
--
BTW [~thesamet] it seems Spark only guarantees API compatibility,
[
https://issues.apache.org/jira/browse/SPARK-38077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17484873#comment-17484873
]
Chao Sun commented on SPARK-38077:
--
Sorry for breaking the binary compatibility. I wasn't aware that
[
https://issues.apache.org/jira/browse/SPARK-37994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17483399#comment-17483399
]
Chao Sun commented on SPARK-37994:
--
Glad it helped [~tanvu]!
{quote}
We can omit the
[
https://issues.apache.org/jira/browse/SPARK-37994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17482632#comment-17482632
]
Chao Sun commented on SPARK-37994:
--
[~tanvu] Hmm in that case maybe you can try:
{code}
[
https://issues.apache.org/jira/browse/SPARK-37994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17481327#comment-17481327
]
Chao Sun commented on SPARK-37994:
--
I considered to add a new Maven profile for Hadoop versions <= 2.x
[
https://issues.apache.org/jira/browse/SPARK-37994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17481326#comment-17481326
]
Chao Sun commented on SPARK-37994:
--
Yes, thanks [~xkrogen] for pinging me. [~tanvu]: can you try this
[
https://issues.apache.org/jira/browse/SPARK-37957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-37957:
-
Fix Version/s: 3.2.1
> Deterministic flag is not handled for V2 functions
>
[
https://issues.apache.org/jira/browse/SPARK-37957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37957.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 35243
[
https://issues.apache.org/jira/browse/SPARK-37928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37928:
Assignee: Yang Jie
> Add Parquet Data Page V2 bench scenario to DataSourceReadBenchmark
>
[
https://issues.apache.org/jira/browse/SPARK-37928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37928.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 35226
Chao Sun created SPARK-37957:
Summary: Deterministic flag is not handled for V2 functions
Key: SPARK-37957
URL: https://issues.apache.org/jira/browse/SPARK-37957
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-37864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37864.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 35163
[
https://issues.apache.org/jira/browse/SPARK-37864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37864:
Assignee: Yang Jie
> Support Parquet v2 data page RLE encoding (for Boolean Values) for the
>
[
https://issues.apache.org/jira/browse/SPARK-36879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-36879:
Assignee: Parth Chandra
> Support Parquet v2 data page encodings for the vectorized path
>
[
https://issues.apache.org/jira/browse/SPARK-36879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-36879.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34471
[
https://issues.apache.org/jira/browse/SPARK-37633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-37633:
-
Affects Version/s: (was: 3.0.3)
> Unwrap cast should skip if downcast failed with ansi enabled
>
[
https://issues.apache.org/jira/browse/SPARK-37633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37633:
Assignee: Manu Zhang
> Unwrap cast should skip if downcast failed with ansi enabled
>
[
https://issues.apache.org/jira/browse/SPARK-37633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37633.
--
Fix Version/s: 3.3.0
3.2.1
Resolution: Fixed
Issue resolved by pull request
[
https://issues.apache.org/jira/browse/SPARK-37217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-37217:
-
Fix Version/s: 3.2.1
> The number of dynamic partitions should early check when writing to external
>
[
https://issues.apache.org/jira/browse/SPARK-37481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-37481:
-
Fix Version/s: 3.2.1
(was: 3.2.0)
> Disappearance of skipped stages mislead the
[
https://issues.apache.org/jira/browse/SPARK-37217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37217:
Assignee: dzcxzl
> The number of dynamic partitions should early check when writing to external
[
https://issues.apache.org/jira/browse/SPARK-37217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37217.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34493
[
https://issues.apache.org/jira/browse/SPARK-37573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37573.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34830
[
https://issues.apache.org/jira/browse/SPARK-37573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37573:
Assignee: angerszhu
> IsolatedClient fallbackVersion should be build in version, not always
Chao Sun created SPARK-37600:
Summary: Upgrade to Hadoop 3.3.2
Key: SPARK-37600
URL: https://issues.apache.org/jira/browse/SPARK-37600
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-37561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37561:
Assignee: dzcxzl
> Avoid loading all functions when obtaining hive's DelegationToken
>
[
https://issues.apache.org/jira/browse/SPARK-37561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37561.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34822
[
https://issues.apache.org/jira/browse/SPARK-37205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37205.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34635
[
https://issues.apache.org/jira/browse/SPARK-37205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37205:
Assignee: Chao Sun
> Support mapreduce.job.send-token-conf when starting containers in YARN
>
[
https://issues.apache.org/jira/browse/SPARK-37445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37445:
Assignee: angerszhu
> Update hadoop-profile
> -
>
> Key:
[
https://issues.apache.org/jira/browse/SPARK-37445?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37445.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34715
[
https://issues.apache.org/jira/browse/SPARK-36529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36529:
-
Attachment: (was: image.png)
> Decouple CPU with IO work in vectorized Parquet reader
>
[
https://issues.apache.org/jira/browse/SPARK-36529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36529:
-
Attachment: image.png
> Decouple CPU with IO work in vectorized Parquet reader
>
[
https://issues.apache.org/jira/browse/SPARK-35867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-35867.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34611
[
https://issues.apache.org/jira/browse/SPARK-35867?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-35867:
Assignee: Kazuyuki Tanimura
> Enable vectorized read for
Chao Sun created SPARK-37378:
Summary: Convert V2 Transform expressions into catalyst
expressions and load their associated functions from V2 FunctionCatalog
Key: SPARK-37378
URL:
Chao Sun created SPARK-37377:
Summary: Refactor V2 Partitioning interface and remove deprecated
usage of Distribution
Key: SPARK-37377
URL: https://issues.apache.org/jira/browse/SPARK-37377
Project:
Chao Sun created SPARK-37376:
Summary: Introduce a new DataSource V2 interface HasPartitionKey
Key: SPARK-37376
URL: https://issues.apache.org/jira/browse/SPARK-37376
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-37166:
-
Parent: SPARK-37375
Issue Type: Sub-task (was: New Feature)
> SPIP: Storage Partitioned Join
>
Chao Sun created SPARK-37375:
Summary: Umbrella: Storage Partitioned Join
Key: SPARK-37375
URL: https://issues.apache.org/jira/browse/SPARK-37375
Project: Spark
Issue Type: New Feature
[
https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37166.
--
Fix Version/s: 3.3.0
Assignee: Chao Sun
Resolution: Fixed
> SPIP: Storage Partitioned
[
https://issues.apache.org/jira/browse/SPARK-37342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-37342:
-
Component/s: Build
(was: Spark Core)
> Upgrade Apache Arrow to 6.0.0
>
Chao Sun created SPARK-37342:
Summary: Upgrade Apache Arrow to 6.0.0
Key: SPARK-37342
URL: https://issues.apache.org/jira/browse/SPARK-37342
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-37239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37239.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34520
[
https://issues.apache.org/jira/browse/SPARK-37239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-37239:
Assignee: Yang Jie
> Avoid unnecessary `setReplication` in Yarn mode
>
[
https://issues.apache.org/jira/browse/SPARK-35437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-35437:
-
Priority: Major (was: Minor)
> Use expressions to filter Hive partitions at client side
>
[
https://issues.apache.org/jira/browse/SPARK-35437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-35437.
--
Resolution: Fixed
Issue resolved by pull request 34431
[https://github.com/apache/spark/pull/34431]
[
https://issues.apache.org/jira/browse/SPARK-35437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-35437:
Assignee: dzcxzl
> Use expressions to filter Hive partitions at client side
>
[
https://issues.apache.org/jira/browse/SPARK-36998?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17440066#comment-17440066
]
Chao Sun commented on SPARK-36998:
--
Fixed
> Handle concurrent eviction of same application in SHS
>
[
https://issues.apache.org/jira/browse/SPARK-36998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun reassigned SPARK-36998:
Assignee: Thejdeep Gudivada (was: Thejdeep)
> Handle concurrent eviction of same application in
[
https://issues.apache.org/jira/browse/SPARK-37220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17440042#comment-17440042
]
Chao Sun commented on SPARK-37220:
--
Thanks [~hyukjin.kwon]!
> Do not split input file for Parquet
[
https://issues.apache.org/jira/browse/SPARK-37220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37220.
--
Fix Version/s: 3.3.0
Resolution: Fixed
> Do not split input file for Parquet reader with
[
https://issues.apache.org/jira/browse/SPARK-37218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17439554#comment-17439554
]
Chao Sun commented on SPARK-37218:
--
[~dongjoon] please assign this to yourself - somehow I can't do it.
[
https://issues.apache.org/jira/browse/SPARK-37218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun resolved SPARK-37218.
--
Fix Version/s: 3.3.0
Resolution: Fixed
Issue resolved by pull request 34496
[
https://issues.apache.org/jira/browse/SPARK-37205?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-37205:
-
Description: {{mapreduce.job.send-token-conf}} is a useful feature in
Hadoop (see
Chao Sun created SPARK-37205:
Summary: Support mapreduce.job.send-token-conf when starting
containers in YARN
Key: SPARK-37205
URL: https://issues.apache.org/jira/browse/SPARK-37205
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-37166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17436963#comment-17436963
]
Chao Sun commented on SPARK-37166:
--
[~xkrogen] sure just linked.
> SPIP: Storage Partitioned Join
>
Chao Sun created SPARK-37166:
Summary: SPIP: Storage Partitioned Join
Key: SPARK-37166
URL: https://issues.apache.org/jira/browse/SPARK-37166
Project: Spark
Issue Type: New Feature
Chao Sun created SPARK-37113:
Summary: Upgrade Parquet to 1.12.2
Key: SPARK-37113
URL: https://issues.apache.org/jira/browse/SPARK-37113
Project: Spark
Issue Type: Improvement
[
https://issues.apache.org/jira/browse/SPARK-35703?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-35703:
-
Summary: Relax constraint for Spark bucket join and remove
HashClusteredDistribution (was: Remove
[
https://issues.apache.org/jira/browse/SPARK-37069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17432624#comment-17432624
]
Chao Sun commented on SPARK-37069:
--
Thanks for the ping [~zhouyifan279]! yes this is a bug, and let me
[
https://issues.apache.org/jira/browse/SPARK-35640?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17428522#comment-17428522
]
Chao Sun commented on SPARK-35640:
--
[~catalinii] this change seems unrelated since it's only in Spark
[
https://issues.apache.org/jira/browse/SPARK-36936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17426255#comment-17426255
]
Chao Sun commented on SPARK-36936:
--
[~colin.williams] Spark 3.2.0 is not released yet - it will be
[
https://issues.apache.org/jira/browse/SPARK-36936?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17425162#comment-17425162
]
Chao Sun commented on SPARK-36936:
--
[~colin.williams] which version of {{spark-hadoop-cloud}} you were
[
https://issues.apache.org/jira/browse/SPARK-36891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36891:
-
Parent: SPARK-35743
Issue Type: Sub-task (was: Test)
> Refactor
Chao Sun created SPARK-36935:
Summary: Enhance ParquetSchemaConverter to capture Parquet
repetition & definition level
Key: SPARK-36935
URL: https://issues.apache.org/jira/browse/SPARK-36935
Project:
Chao Sun created SPARK-36891:
Summary: Add new test suite to cover Parquet decoding
Key: SPARK-36891
URL: https://issues.apache.org/jira/browse/SPARK-36891
Project: Spark
Issue Type: Test
Chao Sun created SPARK-36879:
Summary: Support Parquet v2 data page encodings for the vectorized
path
Key: SPARK-36879
URL: https://issues.apache.org/jira/browse/SPARK-36879
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-36873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36873:
-
Issue Type: Bug (was: Improvement)
> Add provided Guava dependency for network-yarn module
>
[
https://issues.apache.org/jira/browse/SPARK-36873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36873:
-
Description:
In Spark 3.1 and earlier the network-yarn module implicitly relies on guava
from
[
https://issues.apache.org/jira/browse/SPARK-36873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36873:
-
Description:
In Spark 3.1 and earlier the network-yarn module implicitly relies on guava
from
Chao Sun created SPARK-36873:
Summary: Add provided Guava dependency for network-yarn module
Key: SPARK-36873
URL: https://issues.apache.org/jira/browse/SPARK-36873
Project: Spark
Issue Type:
Chao Sun created SPARK-36863:
Summary: Update dependency manifests for all released artifacts
Key: SPARK-36863
URL: https://issues.apache.org/jira/browse/SPARK-36863
Project: Spark
Issue Type:
[
https://issues.apache.org/jira/browse/SPARK-36835?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17419499#comment-17419499
]
Chao Sun commented on SPARK-36835:
--
Sorry for the regression [~joshrosen]. I forgot exactly why I added
[
https://issues.apache.org/jira/browse/SPARK-36828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36828:
-
Issue Type: Improvement (was: Bug)
> Remove Guava from Spark binary distribution
>
Chao Sun created SPARK-36828:
Summary: Remove Guava from Spark binary distribution
Key: SPARK-36828
URL: https://issues.apache.org/jira/browse/SPARK-36828
Project: Spark
Issue Type: Bug
Chao Sun created SPARK-36820:
Summary: Disable LZ4 test for Hadoop 2.7 profile
Key: SPARK-36820
URL: https://issues.apache.org/jira/browse/SPARK-36820
Project: Spark
Issue Type: Bug
[
https://issues.apache.org/jira/browse/SPARK-36820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36820:
-
Issue Type: Test (was: Bug)
> Disable LZ4 test for Hadoop 2.7 profile
>
[
https://issues.apache.org/jira/browse/SPARK-36726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36726:
-
Priority: Blocker (was: Major)
> Upgrade Parquet to 1.12.1
> -
>
>
Chao Sun created SPARK-36726:
Summary: Upgrade Parquet to 1.12.1
Key: SPARK-36726
URL: https://issues.apache.org/jira/browse/SPARK-36726
Project: Spark
Issue Type: Bug
Components: SQL
[
https://issues.apache.org/jira/browse/SPARK-35959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17412897#comment-17412897
]
Chao Sun commented on SPARK-35959:
--
[~hyukjin.kwon] No I don't think it qualifies as blocker anymore.
[
https://issues.apache.org/jira/browse/SPARK-35959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-35959:
-
Priority: Major (was: Blocker)
> Add a new Maven profile "no-shaded-client" for older Hadoop 3.x
[
https://issues.apache.org/jira/browse/SPARK-36696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17412167#comment-17412167
]
Chao Sun commented on SPARK-36696:
--
[
https://issues.apache.org/jira/browse/SPARK-36696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17412164#comment-17412164
]
Chao Sun commented on SPARK-36696:
--
This looks like the same issue as in PARQUET-2078. The file offset
Chao Sun created SPARK-36695:
Summary: Allow passing V2 functions to data sources via V2 filters
Key: SPARK-36695
URL: https://issues.apache.org/jira/browse/SPARK-36695
Project: Spark
Issue
[
https://issues.apache.org/jira/browse/SPARK-36676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17410726#comment-17410726
]
Chao Sun commented on SPARK-36676:
--
Will post a PR soon
> Create shaded Hive module and upgrade to
Chao Sun created SPARK-36676:
Summary: Create shaded Hive module and upgrade to higher version
of Guava
Key: SPARK-36676
URL: https://issues.apache.org/jira/browse/SPARK-36676
Project: Spark
[
https://issues.apache.org/jira/browse/SPARK-34276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17408324#comment-17408324
]
Chao Sun commented on SPARK-34276:
--
I did some study on the code and it seems this will only affect
[
https://issues.apache.org/jira/browse/SPARK-34276?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17407554#comment-17407554
]
Chao Sun commented on SPARK-34276:
--
[~smilegator] yea seems like Spark will be affected. cc
[
https://issues.apache.org/jira/browse/SPARK-36528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36528:
-
Description: Currently Spark first decode (e.g., RLE/bit-packed, PLAIN)
into column vector and then
[
https://issues.apache.org/jira/browse/SPARK-36528?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36528:
-
Description: Currently Spark first decode (e.g., RLE/bit-packed, PLAIN)
into column vector and then
[
https://issues.apache.org/jira/browse/SPARK-36527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chao Sun updated SPARK-36527:
-
Description: At the moment the Parquet vectorized reader will eagerly
decode all the columns that are
Chao Sun created SPARK-36529:
Summary: Decouple CPU with IO work in vectorized Parquet reader
Key: SPARK-36529
URL: https://issues.apache.org/jira/browse/SPARK-36529
Project: Spark
Issue Type:
Chao Sun created SPARK-36528:
Summary: Implement lazy decoding for the vectorized Parquet reader
Key: SPARK-36528
URL: https://issues.apache.org/jira/browse/SPARK-36528
Project: Spark
Issue
Chao Sun created SPARK-36527:
Summary: Implement lazy materialization for the vectorized Parquet
reader
Key: SPARK-36527
URL: https://issues.apache.org/jira/browse/SPARK-36527
Project: Spark
201 - 300 of 471 matches
Mail list logo