[jira] [Resolved] (HIVE-28102) Invoke validateDataFilesExist for RowDelta operations

2024-03-04 Thread Ayush Saxena (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-28102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ayush Saxena resolved HIVE-28102.
-
Fix Version/s: 4.1.0
   Resolution: Fixed

> Invoke validateDataFilesExist for RowDelta operations
> -
>
> Key: HIVE-28102
> URL: https://issues.apache.org/jira/browse/HIVE-28102
> Project: Hive
>  Issue Type: Bug
>Reporter: Zoltán Borók-Nagy
>Assignee: Ayush Saxena
>Priority: Major
>  Labels: iceberg, pull-request-available
> Fix For: 4.1.0
>
>
> Hive must invoke validateDataFilesExist for RowDelta operations 
> (DELETE/UPDATE/MERGE).
> Without this a concurrent RewriteFiles (compaction) and RowDelta can corrupt 
> a table.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HIVE-28102) Iceberg: Invoke validateDataFilesExist for RowDelta operations

2024-03-04 Thread Ayush Saxena (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-28102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ayush Saxena updated HIVE-28102:

Summary: Iceberg: Invoke validateDataFilesExist for RowDelta operations  
(was: Invoke validateDataFilesExist for RowDelta operations)

> Iceberg: Invoke validateDataFilesExist for RowDelta operations
> --
>
> Key: HIVE-28102
> URL: https://issues.apache.org/jira/browse/HIVE-28102
> Project: Hive
>  Issue Type: Bug
>Reporter: Zoltán Borók-Nagy
>Assignee: Ayush Saxena
>Priority: Major
>  Labels: iceberg, pull-request-available
> Fix For: 4.1.0
>
>
> Hive must invoke validateDataFilesExist for RowDelta operations 
> (DELETE/UPDATE/MERGE).
> Without this a concurrent RewriteFiles (compaction) and RowDelta can corrupt 
> a table.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HIVE-28097) Backport of HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, HIVE-20207, HIVE-20245, HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, HIVE-22331, HIVE-24113, HIVE-25458, HI

2024-03-04 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-28097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-28097:
--
Labels: pull-request-available  (was: )

> Backport of HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, HIVE-20207, 
> HIVE-20245, HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, HIVE-22331, 
> HIVE-24113, HIVE-25458, HIVE-25577, HIVE-25403
> --
>
> Key: HIVE-28097
> URL: https://issues.apache.org/jira/browse/HIVE-28097
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 3.2.0
>Reporter: Aman Raj
>Assignee: Aman Raj
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Resolved] (HIVE-28097) Backport of HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, HIVE-20207, HIVE-20245, HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, HIVE-22331, HIVE-24113, HIVE-25458, H

2024-03-04 Thread Sankar Hariappan (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-28097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sankar Hariappan resolved HIVE-28097.
-
Fix Version/s: 3.2.0
   Resolution: Fixed

> Backport of HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, HIVE-20207, 
> HIVE-20245, HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, HIVE-22331, 
> HIVE-24113, HIVE-25458, HIVE-25577, HIVE-25403
> --
>
> Key: HIVE-28097
> URL: https://issues.apache.org/jira/browse/HIVE-28097
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 3.2.0
>Reporter: Aman Raj
>Assignee: Aman Raj
>Priority: Major
>  Labels: pull-request-available
> Fix For: 3.2.0
>
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HIVE-28098) Fails to copy empty column statistics of materialized CTE

2024-03-04 Thread okumin (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-28098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

okumin updated HIVE-28098:
--
Status: Patch Available  (was: Open)

I created a pull request.

https://github.com/apache/hive/pull/5106

> Fails to copy empty column statistics of materialized CTE
> -
>
> Key: HIVE-28098
> URL: https://issues.apache.org/jira/browse/HIVE-28098
> Project: Hive
>  Issue Type: Bug
>  Components: Query Planning
>Reporter: okumin
>Assignee: okumin
>Priority: Major
>  Labels: pull-request-available
>
> HIVE-28080 introduced the optimization of materialized CTEs, but it turned 
> out that it failed when statistics were empty.
> This query reproduces the issue.
> {code:java}
> set hive.stats.autogather=false;
> CREATE TABLE src_no_stats AS SELECT '123' as key, 'val123' as value UNION ALL 
> SELECT '9' as key, 'val9' as value;
> set hive.optimize.cte.materialize.threshold=2;
> set hive.optimize.cte.materialize.full.aggregate.only=false;
> EXPLAIN WITH materialized_cte1 AS (
>   SELECT * FROM src_no_stats
> ),
> materialized_cte2 AS (
>   SELECT a.key
>   FROM materialized_cte1 a
>   JOIN materialized_cte1 b ON (a.key = b.key)
> )
> SELECT a.key
> FROM materialized_cte2 a
> JOIN materialized_cte2 b ON (a.key = b.key); {code}
> It throws an error.
> {code:java}
> Error: Error while compiling statement: FAILED: IllegalStateException The 
> size of col stats must be equal to that of schema. Stats = [], Schema = [key] 
> (state=42000,code=4) {code}
> Attaching a debugger, FSO of materialized_cte2 has empty stats as 
> JoinOperator loses stats.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HIVE-28098) Fails to copy empty column statistics of materialized CTE

2024-03-04 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-28098?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-28098:
--
Labels: pull-request-available  (was: )

> Fails to copy empty column statistics of materialized CTE
> -
>
> Key: HIVE-28098
> URL: https://issues.apache.org/jira/browse/HIVE-28098
> Project: Hive
>  Issue Type: Bug
>  Components: Query Planning
>Reporter: okumin
>Assignee: okumin
>Priority: Major
>  Labels: pull-request-available
>
> HIVE-28080 introduced the optimization of materialized CTEs, but it turned 
> out that it failed when statistics were empty.
> This query reproduces the issue.
> {code:java}
> set hive.stats.autogather=false;
> CREATE TABLE src_no_stats AS SELECT '123' as key, 'val123' as value UNION ALL 
> SELECT '9' as key, 'val9' as value;
> set hive.optimize.cte.materialize.threshold=2;
> set hive.optimize.cte.materialize.full.aggregate.only=false;
> EXPLAIN WITH materialized_cte1 AS (
>   SELECT * FROM src_no_stats
> ),
> materialized_cte2 AS (
>   SELECT a.key
>   FROM materialized_cte1 a
>   JOIN materialized_cte1 b ON (a.key = b.key)
> )
> SELECT a.key
> FROM materialized_cte2 a
> JOIN materialized_cte2 b ON (a.key = b.key); {code}
> It throws an error.
> {code:java}
> Error: Error while compiling statement: FAILED: IllegalStateException The 
> size of col stats must be equal to that of schema. Stats = [], Schema = [key] 
> (state=42000,code=4) {code}
> Attaching a debugger, FSO of materialized_cte2 has empty stats as 
> JoinOperator loses stats.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Updated] (HIVE-28097) Backport of HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, HIVE-20207, HIVE-20245, HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, HIVE-22331, HIVE-24113, HIVE-25458, HI

2024-03-04 Thread Aman Raj (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-28097?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aman Raj updated HIVE-28097:

Summary: Backport of HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, 
HIVE-20207, HIVE-20245, HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, 
HIVE-22331, HIVE-24113, HIVE-25458, HIVE-25577, HIVE-25403  (was: Backport of 
HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, HIVE-20207, HIVE-20245, 
HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, HIVE-22331, HIVE-24113, 
HIVE-25458)

> Backport of HIVE-20082, HIVE-19564, HIVE-20174, HIVE-20197, HIVE-20207, 
> HIVE-20245, HIVE-20294, HIVE-26447, HIVE-20315, HIVE-22170, HIVE-22331, 
> HIVE-24113, HIVE-25458, HIVE-25577, HIVE-25403
> --
>
> Key: HIVE-28097
> URL: https://issues.apache.org/jira/browse/HIVE-28097
> Project: Hive
>  Issue Type: Sub-task
>Affects Versions: 3.2.0
>Reporter: Aman Raj
>Assignee: Aman Raj
>Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)