[Impala-ASF-CR] IMPALA-8592: Add support for insert events for 'LOAD DATA' statements from Impala

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19052 )

Change subject: IMPALA-8592: Add support for insert events for 'LOAD DATA' 
statements from Impala
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8648/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/19052
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7f1b470f40e0aaf891c9f3f327af393b2f9c74bc
Gerrit-Change-Number: 19052
Gerrit-PatchSet: 3
Gerrit-Owner: Yu-Wen Lai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Yu-Wen Lai 
Gerrit-Comment-Date: Sat, 01 Oct 2022 03:40:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8592: Add support for insert events for 'LOAD DATA' statements from Impala

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19052 )

Change subject: IMPALA-8592: Add support for insert events for 'LOAD DATA' 
statements from Impala
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11510/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19052
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7f1b470f40e0aaf891c9f3f327af393b2f9c74bc
Gerrit-Change-Number: 19052
Gerrit-PatchSet: 3
Gerrit-Owner: Yu-Wen Lai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Yu-Wen Lai 
Gerrit-Comment-Date: Sat, 01 Oct 2022 02:10:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-8592: Add support for insert events for 'LOAD DATA' statements from Impala

2022-09-30 Thread Yu-Wen Lai (Code Review)
Yu-Wen Lai has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19052 )

Change subject: IMPALA-8592: Add support for insert events for 'LOAD DATA' 
statements from Impala
..


Patch Set 3:

(3 comments)

> Patch Set 1:
>
> (3 comments)
>
> This is a pretty nice fix!

http://gerrit.cloudera.org:8080/#/c/19052/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/19052/1//COMMIT_MSG@16
PS1, Line 16: - Run existing test_load.py
> We also need tests to verify the INSERT events. Could you add some tests in
I realized that replication cannot be used as a verification of insert event 
for external tables because hive replication for external tables relies on 
distcp instead of insert events. Given that LOAD DATA is only applicable to 
external tables, we need to use another way to verify the INSERT events. 
Therefore, I added a test and used number of skipped events as an implicit 
indicator. Let me know if you have better idea.


http://gerrit.cloudera.org:8080/#/c/19052/1/be/src/service/client-request-state.cc
File be/src/service/client-request-state.cc:

http://gerrit.cloudera.org:8080/#/c/19052/1/be/src/service/client-request-state.cc@806
PS1, Line 806: string for unpartitione
> nit: Could you add a comment mentioning that the partition_name is an empty
Done


http://gerrit.cloudera.org:8080/#/c/19052/1/be/src/service/client-request-state.cc@809
PS1, Line 809:   
catalog_update.__set_sync_ddl(exec_request_->query_options.sync_ddl);
 :   catalog_update.__set_header(GetCatalogServiceRequestHeader());
 :   catalog_update.target_table = 
exec_request_->load_data_request.table_name.table_name;
 :   catalog_update.db_name = 
exec_request_->load_data_request.table_name.db_name;
 :   catalog_update.is_overwrite = 
exec_request_->load_data_request.overwrite;
 :
 :   const TNetworkAddress& address =
> nit: these duplicate the code in ClientRequestState::ExecLoadDataRequestImp
Done



--
To view, visit http://gerrit.cloudera.org:8080/19052
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7f1b470f40e0aaf891c9f3f327af393b2f9c74bc
Gerrit-Change-Number: 19052
Gerrit-PatchSet: 3
Gerrit-Owner: Yu-Wen Lai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Yu-Wen Lai 
Gerrit-Comment-Date: Sat, 01 Oct 2022 01:59:52 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-8592: Add support for insert events for 'LOAD DATA' statements from Impala

2022-09-30 Thread Yu-Wen Lai (Code Review)
Yu-Wen Lai has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/19052 )

Change subject: IMPALA-8592: Add support for insert events for 'LOAD DATA' 
statements from Impala
..

IMPALA-8592: Add support for insert events for 'LOAD DATA' statements
from Impala

In this patch, we use TUpdateCatalogRequest to refresh metadata after
'LOAD DATA' instead of TResetMetadataRequest so that we can reuse the
code for 'INSERT' statements. It will fire an insert event just same
as what we did for 'INSERT' statements.

Testing:
- Run existing test_load.py
- Added test_load_data_from_impala() in test_event_processing.py

Change-Id: I7f1b470f40e0aaf891c9f3f327af393b2f9c74bc
---
M be/src/service/client-request-state.cc
M be/src/service/client-request-state.h
M common/thrift/Frontend.thrift
M fe/src/main/java/org/apache/impala/common/FileSystemUtil.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M tests/metadata/test_event_processing.py
6 files changed, 129 insertions(+), 35 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/52/19052/3
--
To view, visit http://gerrit.cloudera.org:8080/19052
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7f1b470f40e0aaf891c9f3f327af393b2f9c74bc
Gerrit-Change-Number: 19052
Gerrit-PatchSet: 3
Gerrit-Owner: Yu-Wen Lai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Yu-Wen Lai 


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19087 )

Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11509/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 3
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Fri, 30 Sep 2022 23:04:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Yida Wu (Code Review)
Yida Wu has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19087 )

Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/19087/1/be/src/exec/topn-node.cc
File be/src/exec/topn-node.cc:

http://gerrit.cloudera.org:8080/#/c/19087/1/be/src/exec/topn-node.cc@695
PS1, Line 695:
> Would be good to add a comment before the second loop otherwise someone cou
Done



--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 3
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Fri, 30 Sep 2022 22:45:04 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Yida Wu (Code Review)
Yida Wu has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/19087 )

Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..

IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

The bug is introduced by IMPALA-9979, if RematerializeTuples()
fails in ReclaimTuplePool(), it returns immediately with an error,
however, some Heap unique_ptr in the partition_heaps_ could be
already moved to the rematerialized_heaps, while the Close() of
the TopNNode doesn't know which Heap unique_ptr is released in
partition_heaps_, still calls Close() on the released Heap
unique_ptr, which leads to a crash.

The patch changes the logic of moving each Heap object after
one rematerialize process succeeds, instead, we will move all the
Heap objects in the partition_heaps_ only when all the
rematerialize processes succeed. Therefore, there will be no half
released partition_heaps_.

Because it could be difficult for a testcase to inject an error
for this case to create a crash. I did some hacking in the
code to inject a memory allocation failure in certain cases,
reproduced the issue, and proved the patch can solve the issue
manually.

Tests:
Ran core tests.
Passed manual test.

Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
---
M be/src/exec/topn-node.cc
1 file changed, 5 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/87/19087/3
--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 3
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-11558: Ensure one Kudu client created (FE) for the specified Kudu master addresses

2022-09-30 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19046 )

Change subject: IMPALA-11558: Ensure one Kudu client created (FE) for the 
specified Kudu master addresses
..


Patch Set 5:

Xianqing, verification failed in your new test case.


--
To view, visit http://gerrit.cloudera.org:8080/19046
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1003556d3afc8e8216142cac4007a4c99046caeb
Gerrit-Change-Number: 19046
Gerrit-PatchSet: 5
Gerrit-Owner: Xianqing He 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 30 Sep 2022 22:08:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Abhishek Rawat (Code Review)
Abhishek Rawat has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19087 )

Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..


Patch Set 1:

(1 comment)

The fix looks good. Ideally, we could use smart pointers properly and not run 
into these issues. But, that would require some restructuring.

For instance, having a destructor for TopNNode::Heap::~Heap() (which could 
basically call TopNNode::Heap::Close()). TopNNode::Close() could then simply 
call partition_heaps_.clear() instead of looping through them and calling 
TopNNode::Heap::Close()

http://gerrit.cloudera.org:8080/#/c/19087/1/be/src/exec/topn-node.cc
File be/src/exec/topn-node.cc:

http://gerrit.cloudera.org:8080/#/c/19087/1/be/src/exec/topn-node.cc@695
PS1, Line 695:
Would be good to add a comment before the second loop otherwise someone could 
try to optimize it in future. So something like:
"Only move heaps from partition_heap_ to rematerialized_heaps once all have 
been rematerialized. Otherwise, in case of an error, we could double free 
entries in partition_heap_."



--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 1
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 22:06:54 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11558: Ensure one Kudu client created (FE) for the specified Kudu master addresses

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19046 )

Change subject: IMPALA-11558: Ensure one Kudu client created (FE) for the 
specified Kudu master addresses
..


Patch Set 5: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8647/


--
To view, visit http://gerrit.cloudera.org:8080/19046
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1003556d3afc8e8216142cac4007a4c99046caeb
Gerrit-Change-Number: 19046
Gerrit-PatchSet: 5
Gerrit-Owner: Xianqing He 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 30 Sep 2022 21:25:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19087 )

Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11508/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 2
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 20:16:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Yida Wu (Code Review)
Yida Wu has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/19087 )

Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..

IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

The bug is introduced by IMPALA-9979, if RematerializeTuples()
fails in ReclaimTuplePool(), it returns immediately with an error,
however, some Heap unique_ptr in the partition_heaps_ could be
already moved to the rematerialized_heaps, while the Close() of
the TopNNode doesn't know which Heap unique_ptr is released in
partition_heaps_, still calls Close() on the released Heap
unique_ptr, which leads to a crash.

The patch changes the logic of moving each Heap object after
one rematerialize process succeeds, instead, we will move all the
Heap objects in the partition_heaps_ only when all the
rematerialize processes succeed. Therefore, there will be no half
released partition_heaps_.

Because it could be difficult for a testcase to inject an error
for this case to create a crash. I did some hacking in the
code to inject a memory allocation failure in certain cases,
reproduced the issue, and proved the patch can solve the issue
manually.

Tests:
Ran core tests.
Passed manual test.

Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
---
M be/src/exec/topn-node.cc
1 file changed, 2 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/87/19087/2
--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 2
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19087 )

Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11507/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 1
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 19:33:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19002 )

Change subject: IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11506/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19002
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
Gerrit-Change-Number: 19002
Gerrit-PatchSet: 7
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 30 Sep 2022 19:27:21 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11565: Support IF NOT EXISTS in alter table add columns for kudu table

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18953 )

Change subject: IMPALA-11565: Support IF NOT EXISTS in alter table add columns 
for kudu table
..


Patch Set 8: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8646/


--
To view, visit http://gerrit.cloudera.org:8080/18953
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I82590e5372e881f2e81d4ed3dd0d32a2d3ddb517
Gerrit-Change-Number: 18953
Gerrit-PatchSet: 8
Gerrit-Owner: Baike Xia 
Gerrit-Reviewer: Baike Xia 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 18:57:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19002 )

Change subject: IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.
..


Patch Set 6:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/19002/6/fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/19002/6/fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java@73
PS6, Line 73:   public static final Logger LOG = 
LoggerFactory.getLogger(IcebergCatalogOpExecutor.class);
line too long (91 > 90)


http://gerrit.cloudera.org:8080/#/c/19002/6/fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
File fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java:

http://gerrit.cloudera.org:8080/#/c/19002/6/fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java@4188
PS6, Line 4188: "be an integer type or a timestamp, but is 
'DECIMAL(3,2)': EXECUTE rollback(3.14)");
line too long (92 > 90)



--
To view, visit http://gerrit.cloudera.org:8080/19002
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
Gerrit-Change-Number: 19002
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 30 Sep 2022 18:57:51 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19002 )

Change subject: IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11505/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19002
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
Gerrit-Change-Number: 19002
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 30 Sep 2022 19:18:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

2022-09-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19002 )

Change subject: IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.
..


Patch Set 6:

(7 comments)

Thanks for the review comments.
This change is blocked waiting for 
https://github.com/apache/iceberg/issues/5882 which we need to include the 
rollback in our iceberg transaction.

http://gerrit.cloudera.org:8080/#/c/19002/5/common/thrift/JniCatalog.thrift
File common/thrift/JniCatalog.thrift:

http://gerrit.cloudera.org:8080/#/c/19002/5/common/thrift/JniCatalog.thrift@428
PS5, Line 428:   // Is rollback to a date or snapshot id.
 :   1: required TRollbackType kind
 :
 :   // If kind is TIME_ID this is the date to rollback to.
 :   2: optional i64 timestamp_millis
 :
 :   // If kind is VERSION_ID this is the id to rollback to.
 :   3: optional i64 snapshot_id
 : }
 :
 : /
> Just an idea: can we embed this under TAlterTableExecuteParams?
Thanks, I had to rework but this does make the change better and will allow 
'EXECUTE' functions to be added more easily.


http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/cup/sql-parser.cup
File fe/src/main/cup/sql-parser.cup:

http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/cup/sql-parser.cup@1323
PS5, Line 1323:
> And maybe this could be 'AlterTableExecuteStmt.createExecuteStmt(table, exp
Done


http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteRollbackStmt.java
File 
fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteRollbackStmt.java:

http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteRollbackStmt.java@34
PS5, Line 34:
> nit: shouldn't we name it to 'rollbackSpec'?
yes!


http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/analysis/TableRef.java
File fe/src/main/java/org/apache/impala/analysis/TableRef.java:

http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/analysis/TableRef.java@478
PS5, Line 478:
> nit: indentation is off
Done


http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@1336
PS5, Line 1336:
> This will have the same problem as https://gerrit.cloudera.org/#/c/19036/
This not done. I believe we need https://github.com/apache/iceberg/issues/5882 
to do this.


http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/19002/5/fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java@199
PS5, Line 199: "Execute rollback "
> Do we need this MessagaFormat.format here?
Thanks, yes I have to change
 LOG = Logger.getLogger to
 LOG = LoggerFactory.getLogger
to get this to work (with {})


http://gerrit.cloudera.org:8080/#/c/19002/5/tests/query_test/test_iceberg.py
File tests/query_test/test_iceberg.py:

http://gerrit.cloudera.org:8080/#/c/19002/5/tests/query_test/test_iceberg.py@258
PS5, Line 258: # We rolled back, but that creates a new snapshot, so now 
there are 4.
 : snapshots = self.get_snapshots(self.client, tbl_name, 
expected_result_size=4)
> It's interesting that non-ancestor snapshots are still in the output of DES
This is just what Iceberg does.
But it makes sense to me as it makes the rollback non-destructive



--
To view, visit http://gerrit.cloudera.org:8080/19002
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
Gerrit-Change-Number: 19002
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 30 Sep 2022 18:57:55 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

2022-09-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/19002 )

Change subject: IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.
..

IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

Iceberg table modifications cause new table snapshots to be created;
these snapshots represent an earlier version of the table. The Iceberg
API provides a way to rollback the table to a previous snapshot.

This change add the ability to execute a rollback on Iceberg tables
using the following statements:

- ALTER TABLE  EXECUTE ROLLBACK()
- ALTER TABLE  EXECUTE ROLLBACK('')

The latter form of the command rolls back to the latest snapshot
that has a creation timestamp that is older than the specified
timestamp.

Note that when a table is rolled back to a snapshot, a new snapshot is
created with the same snapshot id, but with a new creation timestamp.

Testing:
 - Added analysis unit tests.
 - Added e2e tests.
 - Added a Snapshot class to the pytests to encapsulate the output of
   DESCRIBE HISTORY.

Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
---
M common/thrift/JniCatalog.thrift
M fe/src/main/cup/sql-parser.cup
A 
fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteExpireSnapshotsStmt.java
A fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteRollbackStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteStmt.java
M fe/src/main/java/org/apache/impala/catalog/FeFsTable.java
M fe/src/main/java/org/apache/impala/common/TransactionKeepalive.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
A testdata/workloads/functional-query/queries/QueryTest/iceberg-rollback.test
M tests/common/iceberg_test_suite.py
M tests/query_test/test_iceberg.py
14 files changed, 547 insertions(+), 110 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/02/19002/7
--
To view, visit http://gerrit.cloudera.org:8080/19002
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
Gerrit-Change-Number: 19002
Gerrit-PatchSet: 7
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

2022-09-30 Thread Andrew Sherman (Code Review)
Andrew Sherman has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/19002 )

Change subject: IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.
..

IMPALA-11482: Alter Table Execute Rollback for Iceberg tables.

Iceberg table modifications cause new table snapshots to be created;
these snapshots represent an earlier version of the table. The Iceberg
API provides a way to rollback the table to a previous snapshot.

This change add the ability to execute a rollback on Iceberg tables
using the following statements:

- ALTER TABLE  EXECUTE ROLLBACK()
- ALTER TABLE  EXECUTE ROLLBACK('')

The latter form of the command rolls back to the latest snapshot
that has a creation timestamp that is older than the specified
timestamp.

Note that when a table is rolled back to a snapshot, a new snapshot is
created with the same snapshot id, but with a new creation timestamp.

Testing:
 - Added analysis unit tests.
 - Added e2e tests.
 - Added a Snapshot class to the pytests to encapsulate the output of
   DESCRIBE HISTORY.

Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
---
M common/thrift/JniCatalog.thrift
M fe/src/main/cup/sql-parser.cup
A 
fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteExpireSnapshotsStmt.java
A fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteRollbackStmt.java
M fe/src/main/java/org/apache/impala/analysis/AlterTableExecuteStmt.java
M fe/src/main/java/org/apache/impala/catalog/FeFsTable.java
M fe/src/main/java/org/apache/impala/common/TransactionKeepalive.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/IcebergCatalogOpExecutor.java
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
A testdata/workloads/functional-query/queries/QueryTest/iceberg-rollback.test
M tests/common/iceberg_test_suite.py
M tests/query_test/test_iceberg.py
14 files changed, 545 insertions(+), 110 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/02/19002/6
--
To view, visit http://gerrit.cloudera.org:8080/19002
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ic74913d3b81103949ffb5eef7cc936303494f8b9
Gerrit-Change-Number: 19002
Gerrit-PatchSet: 6
Gerrit-Owner: Andrew Sherman 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

2022-09-30 Thread Yida Wu (Code Review)
Yida Wu has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/19087


Change subject: IMPALA-11631 Fix impala crashes in 
impala::TopNNode::Heap::Close()
..

IMPALA-11631 Fix impala crashes in impala::TopNNode::Heap::Close()

The bug is introduced by IMPALA-11631, if RematerializeTuples()
fails in ReclaimTuplePool(), it returns immediately with an error,
however, some Heap unique_ptr in the partition_heaps_ could be
already moved to the rematerialized_heaps, while the Close() of
the TopNNode doesn't know which Heap unique_ptr is released in
partition_heaps_, still calls Close() on the released Heap
unique_ptr, which leads to a crash.

The patch changes the logic of moving each Heap object after
one rematerialize process succeeds, instead, we will move all the
Heap objects in the partition_heaps_ only when all the
rematerialize processes succeed. Therefore, there will be no half
released partition_heaps_.

Because it could be difficult for a testcase to inject an error
for this case to create a crash. I did some hacking in the
code to inject a memory allocation failure in certain cases,
reproduced the issue, and proved the patch can solve the issue
manually.

Tests:
Ran core tests.
Passed manual test.

Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
---
M be/src/exec/topn-node.cc
1 file changed, 2 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/87/19087/1
--
To view, visit http://gerrit.cloudera.org:8080/19087
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Iaf45b6ef777f68e1843c076a935e4189acc6990b
Gerrit-Change-Number: 19087
Gerrit-PatchSet: 1
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 


[Impala-ASF-CR] IMPALA-11632: Exclude log4j-1.2-api in some Ranger artifacts

2022-09-30 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19085 )

Change subject: IMPALA-11632: Exclude log4j-1.2-api in some Ranger artifacts
..


Patch Set 1:

Hi all, please let me know if you have any comment on the patch. Thanks!


--
To view, visit http://gerrit.cloudera.org:8080/19085
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I72bf989a4fa04b23daf9093df8764670cc5e974d
Gerrit-Change-Number: 19085
Gerrit-PatchSet: 1
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Yida Wu 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 30 Sep 2022 18:40:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11632: Exclude log4j-1.2-api in some Ranger artifacts

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19085 )

Change subject: IMPALA-11632: Exclude log4j-1.2-api in some Ranger artifacts
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11504/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19085
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I72bf989a4fa04b23daf9093df8764670cc5e974d
Gerrit-Change-Number: 19085
Gerrit-PatchSet: 1
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 18:36:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9448: (Addendum) Avoid generic recreate for Ozone bucket

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19082 )

Change subject: IMPALA-9448: (Addendum) Avoid generic recreate for Ozone bucket
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11503/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I31a3813fac6f2d68859ba7fed53b5c5a82502780
Gerrit-Change-Number: 19082
Gerrit-PatchSet: 3
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 18:18:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11632: Exclude log4j-1.2-api in some Ranger artifacts

2022-09-30 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/19085


Change subject: IMPALA-11632: Exclude log4j-1.2-api in some Ranger artifacts
..

IMPALA-11632: Exclude log4j-1.2-api in some Ranger artifacts

After RANGER-3498, Ranger's ranger-plugins-audit,
ranger-plugins-common start pulling in log4j-1.2-api, which is currently
banned by Impala's frontend. To be able to compile Impala after
RANGER-3498, this patch excludes log4j-1.2-api when adding those Ranger
dependencies mentioned above.

Change-Id: I72bf989a4fa04b23daf9093df8764670cc5e974d
---
M fe/pom.xml
1 file changed, 14 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/85/19085/1
--
To view, visit http://gerrit.cloudera.org:8080/19085
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I72bf989a4fa04b23daf9093df8764670cc5e974d
Gerrit-Change-Number: 19085
Gerrit-PatchSet: 1
Gerrit-Owner: Fang-Yu Rao 


[Impala-ASF-CR] IMPALA-9448: (Addendum) Avoid generic recreate for Ozone bucket

2022-09-30 Thread Michael Smith (Code Review)
Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/19082

to look at the new patch set (#3).

Change subject: IMPALA-9448: (Addendum) Avoid generic recreate for Ozone bucket
..

IMPALA-9448: (Addendum) Avoid generic recreate for Ozone bucket

Avoids using `hadoop fs` to recreate an Ozone bucket during snapshot
load, as this removes the encryption key.

Change-Id: I31a3813fac6f2d68859ba7fed53b5c5a82502780
---
M testdata/bin/load-test-warehouse-snapshot.sh
M tests/custom_cluster/test_exchange_delays.py
M tests/util/filesystem_utils.py
3 files changed, 11 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/82/19082/3
--
To view, visit http://gerrit.cloudera.org:8080/19082
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I31a3813fac6f2d68859ba7fed53b5c5a82502780
Gerrit-Change-Number: 19082
Gerrit-PatchSet: 3
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-11558: Ensure one Kudu client created (FE) for the specified Kudu master addresses

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19046 )

Change subject: IMPALA-11558: Ensure one Kudu client created (FE) for the 
specified Kudu master addresses
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/19046
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1003556d3afc8e8216142cac4007a4c99046caeb
Gerrit-Change-Number: 19046
Gerrit-PatchSet: 5
Gerrit-Owner: Xianqing He 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 30 Sep 2022 16:19:49 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11558: Ensure one Kudu client created (FE) for the specified Kudu master addresses

2022-09-30 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19046 )

Change subject: IMPALA-11558: Ensure one Kudu client created (FE) for the 
specified Kudu master addresses
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/19046
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1003556d3afc8e8216142cac4007a4c99046caeb
Gerrit-Change-Number: 19046
Gerrit-PatchSet: 4
Gerrit-Owner: Xianqing He 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 30 Sep 2022 16:19:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11558: Ensure one Kudu client created (FE) for the specified Kudu master addresses

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19046 )

Change subject: IMPALA-11558: Ensure one Kudu client created (FE) for the 
specified Kudu master addresses
..


Patch Set 5:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8647/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/19046
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1003556d3afc8e8216142cac4007a4c99046caeb
Gerrit-Change-Number: 19046
Gerrit-PatchSet: 5
Gerrit-Owner: Xianqing He 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 30 Sep 2022 16:19:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2022-09-30 Thread Kurt Deschler (Code Review)
Kurt Deschler has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..


Patch Set 11:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/19033/12/fe/src/main/java/org/apache/impala/analysis/SortInfo.java
File fe/src/main/java/org/apache/impala/analysis/SortInfo.java:

http://gerrit.cloudera.org:8080/#/c/19033/12/fe/src/main/java/org/apache/impala/analysis/SortInfo.java@323
PS12, Line 323: / Math.max(numInstances, 1);
Still not comfortable with the division (by instances/cores) here. This is 
better done at the fragment level where the overall contention within the 
fragment an be assessed and applied uniformly to operators in the fragment. As 
minimum, there should be some non-linear adjustment in the divisor, i.e. 
numInstances^(0.8). Having the division here is also misleading since the 
function returns a per-thread estimate, no the overall processing cost.



--
To view, visit http://gerrit.cloudera.org:8080/19033
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
Gerrit-Change-Number: 19033
Gerrit-PatchSet: 11
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 30 Sep 2022 15:50:52 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11504: Specializing DecimalUtil::GetScaleMultiplier().

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18861 )

Change subject: IMPALA-11504: Specializing 
DecimalUtil::GetScaleMultiplier().
..


Patch Set 6: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8645/


--
To view, visit http://gerrit.cloudera.org:8080/18861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
Gerrit-Change-Number: 18861
Gerrit-PatchSet: 6
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 15:48:58 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10436: Require lower privilege for external Kudu table creation

2022-09-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/17640 )

Change subject: IMPALA-10436: Require lower privilege for external Kudu table 
creation
..


Patch Set 7:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/17640/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17640/5//COMMIT_MSG@29
PS5, Line 29:
> Thanks Csaba! Prefixes provided in your example are supported. I will try t
Were these tests added somewhere? Sorry if I have just missed them.


http://gerrit.cloudera.org:8080/#/c/17640/6//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/17640/6//COMMIT_MSG@7
PS6, Line 7: IMPALA-10436: Require lower privilege for external Kudu table 
creation
To me the title suggests that creating external Kudu tables will simply need 
less privilege, while actually a new kind of privilege was added for this.

Maybe a title like "Support storage handler privileges for external Kudu table 
creation" would be more descriptive.


http://gerrit.cloudera.org:8080/#/c/17640/5/fe/src/main/java/org/apache/impala/authorization/ranger/RangerCatalogdAuthorizationManager.java
File 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerCatalogdAuthorizationManager.java:

http://gerrit.cloudera.org:8080/#/c/17640/5/fe/src/main/java/org/apache/impala/authorization/ranger/RangerCatalogdAuthorizationManager.java@406
PS5, Line 406:   // Server is used by column, function, and URI resources.
Can you update the comment?


http://gerrit.cloudera.org:8080/#/c/17640/5/fe/src/main/java/org/apache/impala/authorization/ranger/RangerCatalogdAuthorizationManager.java@445
PS5, Line 445: }
is it valid if we don't go to the if?


http://gerrit.cloudera.org:8080/#/c/17640/5/fe/src/main/java/org/apache/impala/authorization/ranger/RangerImpaladAuthorizationManager.java
File 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerImpaladAuthorizationManager.java:

http://gerrit.cloudera.org:8080/#/c/17640/5/fe/src/main/java/org/apache/impala/authorization/ranger/RangerImpaladAuthorizationManager.java@325
PS5, Line 325:   // Server is used by column, function, and URI resources.
stale comment



--
To view, visit http://gerrit.cloudera.org:8080/17640
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7936e1d8c48696169f7ad7ad92abe44a26eea3c4
Gerrit-Change-Number: 17640
Gerrit-PatchSet: 7
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 15:29:17 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..


Patch Set 12:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11502/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19033
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
Gerrit-Change-Number: 19033
Gerrit-PatchSet: 12
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 30 Sep 2022 15:30:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2022-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..


Patch Set 12:

Fix a build error.


--
To view, visit http://gerrit.cloudera.org:8080/19033
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
Gerrit-Change-Number: 19033
Gerrit-PatchSet: 12
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 30 Sep 2022 15:09:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2022-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has uploaded a new patch set (#12). ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..

IMPALA-11604 Planner changes for CPU usage

This patch augments IMPALA-10992 by establishing an infrastructure
to allow the weighted total amount of data to process per instance
to be used as a new factor in the definition and selection of an
executor group. In this patch the weight component is set to 1.

The weighted amount of data processed is the sum of that in every
fragment in the query to reflect the worse case of minimal parallel
execution among fragments. A fragment's total is the sum of that of
every node in the fragment to reflect these nodes are executed in a
single thread. For each node, the weighted amount of data processed
is computed with a general formula as follows.

  D = (I * C * W) / N

  where D is the weighted amount of data processed
I is input cardinality
C is expression evaluation cost per row, set to 1
W is average row size
N is number of instances

A description of the computation for each kind of plan node is
given below.

1. Aggregation node:
C and W are the sum of the costs and partial row widths for each
AggregateInfo object.

2. AnalyticEval node:
C is sum of the evaluation costs for analytic functions, partition
by equal and order by equal predicate;

3. CardinalityCheck node:
Both C and I are 1;

4. DataSource scan node:
C is computed from a subset of the selection predicates excluding
data source accepted predicates;

5. EmptySet node:
I is 0;

6. Exchange node:
A modification of the general formula when in broadcast mode:
D = (I * C * W / N) * number of receivers;

7. Hash join node:
C is sum of the evaluation cost for equi-join predicate and for
other join predicate, for both probe and build side;

8. Hbase scan node:
N is 1

9. Hdfs and Kudu scan node:
N is mt_dop when query option mt_dop >= 1, otherwise
N is number of nodes * max scan threads;

10. Nested loop join node:
When the right child is not a SingularRowSrc node, C is sum of
the evaluation cost for equi-join predicate and for other join
predicate, for both probe and build side.

When the right child is a SingularRowSrc node, the cost for build
side is multiplied by the cardinality from the probe side;

11. Select node:
Use the general formula;

12. SingularRowSrc node:
I is 1. Since the node is involved once per input in nested loop
join, the total cost of this node is computed in nested loop join;

13. Sort node:
C is the evalation cost for the sort expression and W is the width
of the intermediate tuple being sorted;

14. Subplan node:
C is 1. I is the multiplication of the cardinality of the left and
the right child;

15. Union node:
C is the cost of materializing rows from all non pass-through
children. W is the width of all non pass-through children;

16. Unnest node:
I is the cardinality of the containing subplan node and C is 1.

Testing:
  [TBD]

Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
---
M common/thrift/Frontend.thrift
M common/thrift/Query.thrift
M fe/src/main/java/org/apache/impala/analysis/AggregateInfo.java
M fe/src/main/java/org/apache/impala/analysis/Expr.java
M fe/src/main/java/org/apache/impala/analysis/SortInfo.java
M fe/src/main/java/org/apache/impala/planner/AggregationNode.java
M fe/src/main/java/org/apache/impala/planner/AnalyticEvalNode.java
M fe/src/main/java/org/apache/impala/planner/CardinalityCheckNode.java
M fe/src/main/java/org/apache/impala/planner/DataSourceScanNode.java
M fe/src/main/java/org/apache/impala/planner/EmptySetNode.java
M fe/src/main/java/org/apache/impala/planner/ExchangeNode.java
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
M fe/src/main/java/org/apache/impala/planner/HashJoinNode.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/main/java/org/apache/impala/planner/KuduScanNode.java
M fe/src/main/java/org/apache/impala/planner/NestedLoopJoinNode.java
M fe/src/main/java/org/apache/impala/planner/PlanFragment.java
M fe/src/main/java/org/apache/impala/planner/PlanNode.java
M fe/src/main/java/org/apache/impala/planner/Planner.java
M fe/src/main/java/org/apache/impala/planner/ResourceProfile.java
M fe/src/main/java/org/apache/impala/planner/ResourceProfileBuilder.java
M fe/src/main/java/org/apache/impala/planner/ScanNode.java
M fe/src/main/java/org/apache/impala/planner/SelectNode.java
M fe/src/main/java/org/apache/impala/planner/SingularRowSrcNode.java
M fe/src/main/java/org/apache/impala/planner/SortNode.java
M fe/src/main/java/org/apache/impala/planner/SubplanNode.java
M fe/src/main/java/org/apache/impala/planner/UnionNode.java
M fe/src/main/java/org/apache/impala/planner/UnnestNode.java
M fe/src/main/java/org/apache/impala/service/Frontend

[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..


Patch Set 11:

Build Failed

https://jenkins.impala.io/job/gerrit-code-review-checks/11501/ : Initial code 
review checks failed. See linked job for details on the failure.


--
To view, visit http://gerrit.cloudera.org:8080/19033
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
Gerrit-Change-Number: 19033
Gerrit-PatchSet: 11
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 30 Sep 2022 14:35:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2022-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has uploaded a new patch set (#11). ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..

IMPALA-11604 Planner changes for CPU usage

This patch augments IMPALA-10992 by establishing an infrastructure
to allow the weighted total amount of data to process per instance
to be used as a new factor in the definition and selection of an
executor group. In this patch the weight component is set to 1.

The weighted amount of data processed is the sum of that in every
fragment in the query to reflect the worse case of minimal parallel
execution among fragments. A fragment's total is the sum of that of
every node in the fragment to reflect these nodes are executed in a
single thread. For each node, the weighted amount of data processed
is computed with a general formula as follows.

  D = (I * C * W) / N

  where D is the weighted amount of data processed
I is input cardinality
C is expression evaluation cost per row, set to 1
W is average row size
N is number of instances

A description of the computation for each kind of plan node is
given below.

1. Aggregation node:
C and W are the sum of the costs and partial row widths for each
AggregateInfo object.

2. AnalyticEval node:
C is sum of the evaluation costs for analytic functions, partition
by equal and order by equal predicate;

3. CardinalityCheck node:
Both C and I are 1;

4. DataSource scan node:
C is computed from a subset of the selection predicates excluding
data source accepted predicates;

5. EmptySet node:
I is 0;

6. Exchange node:
A modification of the general formula when in broadcast mode:
D = (I * C * W / N) * number of receivers;

7. Hash join node:
C is sum of the evaluation cost for equi-join predicate and for
other join predicate, for both probe and build side;

8. Hbase scan node:
N is 1

9. Hdfs and Kudu scan node:
N is mt_dop when query option mt_dop >= 1, otherwise
N is number of nodes * max scan threads;

10. Nested loop join node:
When the right child is not a SingularRowSrc node, C is sum of
the evaluation cost for equi-join predicate and for other join
predicate, for both probe and build side.

When the right child is a SingularRowSrc node, the cost for build
side is multiplied by the cardinality from the probe side;

11. Select node:
Use the general formula;

12. SingularRowSrc node:
I is 1. Since the node is involved once per input in nested loop
join, the total cost of this node is computed in nested loop join;

13. Sort node:
C is the evalation cost for the sort expression and W is the width
of the intermediate tuple being sorted;

14. Subplan node:
C is 1. I is the multiplication of the cardinality of the left and
the right child;

15. Union node:
C is the cost of materializing rows from all non pass-through
children. W is the width of all non pass-through children;

16. Unnest node:
I is the cardinality of the containing subplan node and C is 1.

Testing:
  [TBD]

Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
---
M common/thrift/Frontend.thrift
M common/thrift/Query.thrift
M fe/src/main/java/org/apache/impala/analysis/AggregateInfo.java
M fe/src/main/java/org/apache/impala/analysis/Expr.java
M fe/src/main/java/org/apache/impala/analysis/SortInfo.java
M fe/src/main/java/org/apache/impala/planner/AggregationNode.java
M fe/src/main/java/org/apache/impala/planner/AnalyticEvalNode.java
M fe/src/main/java/org/apache/impala/planner/CardinalityCheckNode.java
M fe/src/main/java/org/apache/impala/planner/DataSourceScanNode.java
M fe/src/main/java/org/apache/impala/planner/EmptySetNode.java
M fe/src/main/java/org/apache/impala/planner/ExchangeNode.java
M fe/src/main/java/org/apache/impala/planner/HBaseScanNode.java
M fe/src/main/java/org/apache/impala/planner/HashJoinNode.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/main/java/org/apache/impala/planner/KuduScanNode.java
M fe/src/main/java/org/apache/impala/planner/NestedLoopJoinNode.java
M fe/src/main/java/org/apache/impala/planner/PlanFragment.java
M fe/src/main/java/org/apache/impala/planner/PlanNode.java
M fe/src/main/java/org/apache/impala/planner/Planner.java
M fe/src/main/java/org/apache/impala/planner/ResourceProfile.java
M fe/src/main/java/org/apache/impala/planner/ResourceProfileBuilder.java
M fe/src/main/java/org/apache/impala/planner/ScanNode.java
M fe/src/main/java/org/apache/impala/planner/SelectNode.java
M fe/src/main/java/org/apache/impala/planner/SingularRowSrcNode.java
M fe/src/main/java/org/apache/impala/planner/SortNode.java
M fe/src/main/java/org/apache/impala/planner/SubplanNode.java
M fe/src/main/java/org/apache/impala/planner/UnionNode.java
M fe/src/main/java/org/apache/impala/planner/UnnestNode.java
M fe/src/main/java/org/apache/impala/service/Frontend

[Impala-ASF-CR] IMPALA-11604 Planner changes for CPU usage

2022-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19033 )

Change subject: IMPALA-11604 Planner changes for CPU usage
..


Patch Set 11:

(6 comments)

http://gerrit.cloudera.org:8080/#/c/19033/10/common/thrift/Frontend.thrift
File common/thrift/Frontend.thrift:

http://gerrit.cloudera.org:8080/#/c/19033/10/common/thrift/Frontend.thrift@752
PS10, Line 752:   5: optional i64 max_processing_cost_limit
> It's probably best to stick with vcores here as what the executor group pro
This thrift structure defines an executor group set for internal use. Using 
processing code limit here is better since it hides the details of translating 
#vcores to processing cost.


http://gerrit.cloudera.org:8080/#/c/19033/10/common/thrift/Query.thrift
File common/thrift/Query.thrift:

http://gerrit.cloudera.org:8080/#/c/19033/10/common/thrift/Query.thrift@873
PS10, Line 873:   12: optional i64 dedicated_coord_mem_estimate;
> Vcores here also.
This thrift structure contains the relevant info for a query.  Using processing 
code limit here is better since it hides the details of translating #vcores to 
processing cost.


http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/planner/AggregationNode.java
File fe/src/main/java/org/apache/impala/planner/AggregationNode.java:

http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/planner/AggregationNode.java@627
PS10, Line 627: .setProcessingCost(
> Use (abstract) processingCost naming instead of processedBytes.
Done


http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/planner/AnalyticEvalNode.java
File fe/src/main/java/org/apache/impala/planner/AnalyticEvalNode.java:

http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/planner/AnalyticEvalNode.java@362
PS10, Line 362:   public long computeProcessingCost() {
> rename all of these to computeProcessingCost or similar
Done


http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java
File fe/src/main/java/org/apache/impala/planner/HashJoinNode.java:

http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@308
PS10, Line 308: float eqJoinPredicateEvalCost =
> eqJoinPredicateEvalCost
Done


http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/util/ExprUtil.java
File fe/src/main/java/org/apache/impala/util/ExprUtil.java:

http://gerrit.cloudera.org:8080/#/c/19033/10/fe/src/main/java/org/apache/impala/util/ExprUtil.java@109
PS10, Line 109:   public static float computeExprsTotalCost(List exprs) {
> Use List to share the same function.
Done



--
To view, visit http://gerrit.cloudera.org:8080/19033
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: If32dc770dfffcdd0be2ba789a7720952c68a
Gerrit-Change-Number: 19033
Gerrit-PatchSet: 11
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 30 Sep 2022 14:22:36 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11606: add 'untracked memory' metric.

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19029 )

Change subject: IMPALA-11606: add 'untracked memory' metric.
..


Patch Set 4: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8644/


--
To view, visit http://gerrit.cloudera.org:8080/19029
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib16e00109d732f759c96c7a967eb1cc32124a03f
Gerrit-Change-Number: 19029
Gerrit-PatchSet: 4
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 30 Sep 2022 13:58:09 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11565: Support IF NOT EXISTS in alter table add columns for kudu table

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18953 )

Change subject: IMPALA-11565: Support IF NOT EXISTS in alter table add columns 
for kudu table
..


Patch Set 8:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8646/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18953
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I82590e5372e881f2e81d4ed3dd0d32a2d3ddb517
Gerrit-Change-Number: 18953
Gerrit-PatchSet: 8
Gerrit-Owner: Baike Xia 
Gerrit-Reviewer: Baike Xia 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 13:58:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11565: Support IF NOT EXISTS in alter table add columns for kudu table

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18953 )

Change subject: IMPALA-11565: Support IF NOT EXISTS in alter table add columns 
for kudu table
..


Patch Set 8:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11500/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18953
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I82590e5372e881f2e81d4ed3dd0d32a2d3ddb517
Gerrit-Change-Number: 18953
Gerrit-PatchSet: 8
Gerrit-Owner: Baike Xia 
Gerrit-Reviewer: Baike Xia 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 11:16:08 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11565: Support IF NOT EXISTS in alter table add columns for kudu table

2022-09-30 Thread Baike Xia (Code Review)
Baike Xia has uploaded a new patch set (#8). ( 
http://gerrit.cloudera.org:8080/18953 )

Change subject: IMPALA-11565: Support IF NOT EXISTS in alter table add columns 
for kudu table
..

IMPALA-11565: Support IF NOT EXISTS in alter table add columns for kudu table

Impala already supports IF NOT EXISTS in alter table add columns for
general hive table in IMPALA-7832, but not for kudu table. This patch
try to add such semantics for kudu table.

Testing:
- Updated E2E DDL tests

Change-Id: I82590e5372e881f2e81d4ed3dd0d32a2d3ddb517
---
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/KuduCatalogOpExecutor.java
M testdata/workloads/functional-query/queries/QueryTest/kudu_alter.test
3 files changed, 64 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/53/18953/8
--
To view, visit http://gerrit.cloudera.org:8080/18953
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I82590e5372e881f2e81d4ed3dd0d32a2d3ddb517
Gerrit-Change-Number: 18953
Gerrit-PatchSet: 8
Gerrit-Owner: Baike Xia 
Gerrit-Reviewer: Baike Xia 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-11504: Specializing DecimalUtil::GetScaleMultiplier().

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18861 )

Change subject: IMPALA-11504: Specializing 
DecimalUtil::GetScaleMultiplier().
..


Patch Set 6:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8645/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
Gerrit-Change-Number: 18861
Gerrit-PatchSet: 6
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 10:41:03 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11504: Specializing DecimalUtil::GetScaleMultiplier().

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18861 )

Change subject: IMPALA-11504: Specializing 
DecimalUtil::GetScaleMultiplier().
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11499/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
Gerrit-Change-Number: 18861
Gerrit-PatchSet: 5
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 10:39:01 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11504: Specializing DecimalUtil::GetScaleMultiplier().

2022-09-30 Thread Xiang Yang (Code Review)
Xiang Yang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18861 )

Change subject: IMPALA-11504: Specializing 
DecimalUtil::GetScaleMultiplier().
..


Patch Set 6:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18861/4//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18861/4//COMMIT_MSG@7
PS4, Line 7:
> nit: Lack of space
Done



--
To view, visit http://gerrit.cloudera.org:8080/18861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
Gerrit-Change-Number: 18861
Gerrit-PatchSet: 6
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 10:40:23 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] [IMPALA-11625] Support create/drop materialized view syntax on IMPALA

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19050 )

Change subject: [IMPALA-11625] Support create/drop materialized view syntax on 
IMPALA
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11498/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19050
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I77fdd34bf04a8994a215170747249356cd40622b
Gerrit-Change-Number: 19050
Gerrit-PatchSet: 6
Gerrit-Owner: pengdou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 10:37:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11504: Specializing DecimalUtil::GetScaleMultiplier().

2022-09-30 Thread Xiang Yang (Code Review)
Hello Daniel Becker, lipeng...@sensorsdata.cn, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18861

to look at the new patch set (#6).

Change subject: IMPALA-11504: Specializing 
DecimalUtil::GetScaleMultiplier().
..

IMPALA-11504: Specializing DecimalUtil::GetScaleMultiplier().

Currently decimal-util.h didn't specialize DecimalUtil
::GetScaleMultiplier(), causing more performance loss when
calculate Decimal16Value division.

Testing:
- Ran existing jobs.
- Add decimal-util-benchmark.

Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
---
M be/src/benchmarks/CMakeLists.txt
A be/src/benchmarks/decimal-util-benchmark.cc
M be/src/runtime/decimal-test.cc
M be/src/util/decimal-util.h
4 files changed, 232 insertions(+), 30 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/61/18861/6
--
To view, visit http://gerrit.cloudera.org:8080/18861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
Gerrit-Change-Number: 18861
Gerrit-PatchSet: 6
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Xiang Yang 


[Impala-ASF-CR] IMPALA-11504:Specializing DecimalUtil::GetScaleMultiplier().

2022-09-30 Thread Xiang Yang (Code Review)
Hello Daniel Becker, lipeng...@sensorsdata.cn, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18861

to look at the new patch set (#5).

Change subject: IMPALA-11504:Specializing 
DecimalUtil::GetScaleMultiplier().
..

IMPALA-11504:Specializing DecimalUtil::GetScaleMultiplier().

Currently decimal-util.h didn't specialize DecimalUtil
::GetScaleMultiplier(), causing more performance loss when
calculate Decimal16Value division.

Testing:
- Ran existing jobs.
- Add decimal-util-benchmark.

Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
---
M be/src/benchmarks/CMakeLists.txt
A be/src/benchmarks/decimal-util-benchmark.cc
M be/src/runtime/decimal-test.cc
M be/src/util/decimal-util.h
4 files changed, 232 insertions(+), 30 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/61/18861/5
--
To view, visit http://gerrit.cloudera.org:8080/18861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
Gerrit-Change-Number: 18861
Gerrit-PatchSet: 5
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Xiang Yang 


[Impala-ASF-CR] [IMPALA-11625] Support create/drop materialized view syntax on IMPALA

2022-09-30 Thread pengdou (Code Review)
pengdou has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/19050 )

Change subject: [IMPALA-11625] Support create/drop materialized view syntax on 
IMPALA
..

[IMPALA-11625] Support create/drop materialized view syntax on IMPALA

For infrequently modified datasets, building suitable materialized
views and querying on materialized views directly can significantly
reduce query response time.

Currently IMPALA only treats materialized view built by hive as
a normal table and does not support CREATE MATERIALIZED VIEW AS
SELECTED and DROP MATERIALIZED VIEW syntax on IMPALA.

CREATE TABLE AS SELECT can save the intermediate results of the
query, but it is very inconvenient to use and maintain, for example,
the calculation logic is not visible to the user.

So I implemented supporting of create/drop materialized view syntax
on impala.

CREATE MATERIALIZED VIEW syntax:
CREATE MATERIALIZED VIEW [IF NOT EXISTS] [db_name.]table_name
[PARTITIONED BY (col_name[, ...])]
[SORT BY ([column [, column ...]])]
[COMMENT 'table_comment']
[ROW FORMAT row_format]
[WITH SERDEPROPERTIES ('key1'='value1', 'key2'='value2', ...)]
[STORED AS cmvas_file_format]
LOCATION 'hdfs_path'
[CACHED IN 'pool_name' [WITH REPLICATION = integer] | UNCACHED]
[TBLPROPERTIES ('key1'='value1', 'key2'='value2', ...)]
AS
select_statement

cmvas_file_format:
PARQUET
| TEXTFILE

DROP MATERIALIZED VIEW syntax:
DROP MATERIALIZED VIEW [IF EXISTS] [db_name.]table_name

Change-Id: I77fdd34bf04a8994a215170747249356cd40622b
---
M be/src/service/client-request-state.cc
M common/thrift/JniCatalog.thrift
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
A 
fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java
M fe/src/main/java/org/apache/impala/analysis/LimitElement.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/catalog/View.java
M fe/src/main/java/org/apache/impala/planner/DistributedPlanner.java
M fe/src/main/java/org/apache/impala/planner/Planner.java
M fe/src/main/java/org/apache/impala/planner/PlannerContext.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M testdata/datasets/functional/functional_schema_template.sql
M testdata/datasets/functional/schema_constraints.csv
A 
testdata/workloads/functional-query/queries/QueryTest/create-materialized-view-as-select.test
M tests/metadata/test_ddl.py
23 files changed, 815 insertions(+), 41 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/50/19050/6
--
To view, visit http://gerrit.cloudera.org:8080/19050
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I77fdd34bf04a8994a215170747249356cd40622b
Gerrit-Change-Number: 19050
Gerrit-PatchSet: 6
Gerrit-Owner: pengdou 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-11504:Specializing DecimalUtil::GetScaleMultiplier().

2022-09-30 Thread Xiang Yang (Code Review)
Xiang Yang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18861 )

Change subject: IMPALA-11504:Specializing 
DecimalUtil::GetScaleMultiplier().
..


Patch Set 4:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18861/4/be/src/util/decimal-util.h
File be/src/util/decimal-util.h:

http://gerrit.cloudera.org:8080/#/c/18861/4/be/src/util/decimal-util.h@37
PS4, Line 37:   // The scale upper bound for GetScaleMultiplier()
:   static constexpr int INT32_SCALE_UPPER_BOUND = 
ColumnType::MAX_DECIMAL4_PRECISION + 1;
:   // The scale upper bound for GetScaleMultiplier()
:   static constexpr int INT64_SCALE_UPPER_BOUND = 
ColumnType::MAX_DECIMAL8_PRECISION + 1;
:   // The scale upper bound for GetScaleMultiplier()
:   static constexpr int INT128_SCALE_UPPER_BOUND = 
ColumnType::MAX_PRECISION + 1;
:   // The scale upper bound for GetScaleMultiplier()
:   static constexpr int INT256_SCALE_UPPER_BOUND = 77;
> I don't think these should be public class-level constants, they should be
These constants are referenced in decimal-test.cc and 
decimal-util-benchmark.cc, if we don't make them public, we should redefine 
other constants with same value in decimal-test.cc and 
decimal-util-benchmark.cc. Do you have any better solution?



--
To view, visit http://gerrit.cloudera.org:8080/18861
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I969e2977d51313e738f72c8246db003ae43a3782
Gerrit-Change-Number: 18861
Gerrit-PatchSet: 4
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Xiang Yang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 09:50:49 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11606: add 'untracked memory' metric.

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19029 )

Change subject: IMPALA-11606: add 'untracked memory' metric.
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8644/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/19029
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib16e00109d732f759c96c7a967eb1cc32124a03f
Gerrit-Change-Number: 19029
Gerrit-PatchSet: 4
Gerrit-Owner: Xiang Yang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Jian Zhang 
Gerrit-Reviewer: Xianqing He 
Gerrit-Comment-Date: Fri, 30 Sep 2022 08:49:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3119: DDL support for bucketed tables

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19055 )

Change subject: IMPALA-3119: DDL support for bucketed tables
..


Patch Set 5: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8643/


--
To view, visit http://gerrit.cloudera.org:8080/19055
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I919b4d4139bc3a7784fa6fdb6f064e25666d548e
Gerrit-Change-Number: 19055
Gerrit-PatchSet: 5
Gerrit-Owner: Baike Xia 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 08:37:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] [IMPALA-11625] Support create/drop materialized view syntax on IMPALA

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19050 )

Change subject: [IMPALA-11625] Support create/drop materialized view syntax on 
IMPALA
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11497/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19050
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I77fdd34bf04a8994a215170747249356cd40622b
Gerrit-Change-Number: 19050
Gerrit-PatchSet: 5
Gerrit-Owner: pengdou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 07:56:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3119: DDL support for bucketed tables

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19055 )

Change subject: IMPALA-3119: DDL support for bucketed tables
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11496/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/19055
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I919b4d4139bc3a7784fa6fdb6f064e25666d548e
Gerrit-Change-Number: 19055
Gerrit-PatchSet: 7
Gerrit-Owner: Baike Xia 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 30 Sep 2022 07:54:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11612: fix ORDER BY expression rewrite bug

2022-09-30 Thread Anonymous Coward (Code Review)
jhkc...@qq.com has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19049 )

Change subject: IMPALA-11612: fix ORDER BY expression rewrite bug
..


Patch Set 3:

> Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8642/

Is the build server disk space insufficient?


--
To view, visit http://gerrit.cloudera.org:8080/19049
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I438c61c647e3f99665c94ea15e3a9d98b9ff9912
Gerrit-Change-Number: 19049
Gerrit-PatchSet: 3
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 07:47:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] [IMPALA-11625] Support create/drop materialized view syntax on IMPALA

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19050 )

Change subject: [IMPALA-11625] Support create/drop materialized view syntax on 
IMPALA
..


Patch Set 5:

(30 comments)

http://gerrit.cloudera.org:8080/#/c/19050/5/be/src/service/client-request-state.cc
File be/src/service/client-request-state.cc:

http://gerrit.cloudera.org:8080/#/c/19050/5/be/src/service/client-request-state.cc@708
PS5, Line 708:   // portion of the operation). The exception is if the user 
specified IF NOT EXISTS and the table already
line too long (106 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/be/src/service/client-request-state.cc@1116
PS5, Line 1116: // When the coordinator is not available for CTAS/CMvAS 
that requires a coordinator, check
line too long (94 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java
File 
fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java:

http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java@51
PS5, Line 51:   private final static EnumSet 
SUPPORTED_INSERT_MATERIALIZED_VIEW_FORMATS =
line too long (92 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java@68
PS5, Line 68:   public CreateMaterializedViewAsSelectStmt(CmvasParams params, 
List planHints) {
line too long (91 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java@104
PS5, Line 104: if 
(!SUPPORTED_INSERT_MATERIALIZED_VIEW_FORMATS.contains(createStmt_.getFileFormat()))
 {
line too long (92 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java@122
PS5, Line 122:   throw new AnalysisException("Query statement in 
CreateMaterializedViewAsSelectStmt " +
line too long (92 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java@129
PS5, Line 129:   throw new AnalysisException("Query statement in 
CreateMaterializedViewAsSelectStmt " +
line too long (92 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java
File fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java:

http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java@142
PS5, Line 142: // DROP MATERIALIZED VIEW IF EXISTS 'table' succeeds, 
similarly to Hive, but unlike postgres.
line too long (101 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java@145
PS5, Line 145: "DROP MATERIALIZED VIEW not allowed on a 
none-materialized view: %s.%s", dbName_, getTbl()));
line too long (105 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java@151
PS5, Line 151: "DROP TABLE/VIEW not allowed on a materialized view: 
%s.%s", dbName_, getTbl()));
line too long (93 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
File fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java:

http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java@477
PS5, Line 477: return getCreateTableSql(dbName, tableName, tableComment, 
columnsSql, partitionColumnsSql,
line too long (94 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java@478
PS5, Line 478: primaryKeysSql, foreignKeysSql, kuduPartitionByParams, 
sortProperties, tblProperties,
line too long (93 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/planner/DistributedPlanner.java
File fe/src/main/java/org/apache/impala/planner/DistributedPlanner.java:

http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/planner/DistributedPlanner.java@80
PS5, Line 80: && !singleNodePlan.hasLimit() || 
analysisResult.isCreateMaterializedViewAsSelectStmt()) {
line too long (97 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@2671
PS5, Line 2671:   params.getTable_name().table_name, "Load for DROP 
TABLE/VIEW/MATERIALIZED VIEW ", null);
line too long (98 > 90)


http://gerrit.cloudera.org:8080/#/c/19050/5/fe/src/main/java/org/apache/impala/servi

[Impala-ASF-CR] IMPALA-3119: DDL support for bucketed tables

2022-09-30 Thread Baike Xia (Code Review)
Baike Xia has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/19055 )

Change subject: IMPALA-3119: DDL support for bucketed tables
..

IMPALA-3119: DDL support for bucketed tables

Add syntactic support for creating bucketed table.
The specific syntax is as follows:
CREATE [EXTERNAL] TABLE [IF NOT EXISTS] [db_name.]table_name(
   col_name data_type
   [constraint_specification]
   [COMMENT 'col_comment']
   [, ...]
 )
 [PARTITIONED BY (col_name data_type [COMMENT 'col_comment'], ...)]
 [BUCKETED BY HASH([column [, column ...]])|RANDOM INTO 24 BUCKETS
 [SORT BY ([column [, column ...]])]
 [COMMENT 'table_comment']
 [ROW FORMAT row_format]
 [WITH SERDEPROPERTIES ('key1'='value1', 'key2'='value2', ...)]
 [STORED AS file_format]
 [LOCATION 'hdfs_path']
 [CACHED IN 'pool_name' [WITH REPLICATION = integer] | UNCACHED]
 [TBLPROPERTIES ('key1'='value1', 'key2'='value2', ...)]

Instructions:
1. CLUSTERED BY of Hive is not supported, because HINT has the keyword;
2. The bucket partitioning algorithm contains HASH, RANDOM, KUDU_HASH.
  The default value is HASH;
3. INTO 24 BUCKETS, specifies the number of buckets, the default value
  is 16;
4. Create Bucketed Table statements that do not support Kudu and
  Iceberg tables, but for a Kudu table,
  the hash partition is equivalent to a bucket,
  and the optimization rule applies to join query;
5. In the current version, alter operations(add/drop/change/replace
 columns) on bucketed tables are not supported;

This COMMIT is the first subtask of IMPALA-3118.

Change-Id: I919b4d4139bc3a7784fa6fdb6f064e25666d548e
---
M common/thrift/CatalogObjects.thrift
M common/thrift/JniCatalog.thrift
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeFileStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/TableDef.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/catalog/Table.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
A fe/src/main/java/org/apache/impala/util/BucketUtils.java
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M fe/src/test/java/org/apache/impala/analysis/ToSqlTest.java
M testdata/workloads/functional-query/queries/QueryTest/create-table.test
15 files changed, 411 insertions(+), 21 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/55/19055/7
--
To view, visit http://gerrit.cloudera.org:8080/19055
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I919b4d4139bc3a7784fa6fdb6f064e25666d548e
Gerrit-Change-Number: 19055
Gerrit-PatchSet: 7
Gerrit-Owner: Baike Xia 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] [IMPALA-11625] Support create/drop materialized view syntax on IMPALA

2022-09-30 Thread pengdou (Code Review)
pengdou has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/19050


Change subject: [IMPALA-11625] Support create/drop materialized view syntax on 
IMPALA
..

[IMPALA-11625] Support create/drop materialized view syntax on IMPALA

For infrequently modified datasets, building suitable materialized
views and querying on materialized views directly can significantly
reduce query response time.

Currently IMPALA only treats materialized view built by hive as
a normal table and does not support CREATE MATERIALIZED VIEW AS
SELECTED and DROP MATERIALIZED VIEW syntax on IMPALA.

CREATE TABLE AS SELECT can save the intermediate results of the
query, but it is very inconvenient to use and maintain, for example,
the calculation logic is not visible to the user.

So I implemented supporting of create/drop materialized view syntax
on impala.

CREATE MATERIALIZED VIEW syntax:
CREATE MATERIALIZED VIEW [IF NOT EXISTS] [db_name.]table_name
  [PARTITIONED BY (col_name[, ...])]
  [SORT BY ([column [, column ...]])]
  [COMMENT 'table_comment']
  [ROW FORMAT row_format]
  [WITH SERDEPROPERTIES ('key1'='value1', 'key2'='value2', ...)]
  [STORED AS cmvas_file_format]
  LOCATION 'hdfs_path'
[CACHED IN 'pool_name' [WITH REPLICATION = integer] | UNCACHED]
  [TBLPROPERTIES ('key1'='value1', 'key2'='value2', ...)]
AS
  select_statement

cmvas_file_format:
PARQUET
  | TEXTFILE

DROP MATERIALIZED VIEW syntax:
DROP MATERIALIZED VIEW [IF EXISTS] [db_name.]table_name

Change-Id: I77fdd34bf04a8994a215170747249356cd40622b
---
M be/src/service/client-request-state.cc
M common/thrift/JniCatalog.thrift
M fe/src/main/cup/sql-parser.cup
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
A 
fe/src/main/java/org/apache/impala/analysis/CreateMaterializedViewAsSelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/DropTableOrViewStmt.java
M fe/src/main/java/org/apache/impala/analysis/LimitElement.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/catalog/View.java
M fe/src/main/java/org/apache/impala/planner/DistributedPlanner.java
M fe/src/main/java/org/apache/impala/planner/Planner.java
M fe/src/main/java/org/apache/impala/planner/PlannerContext.java
M fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
M fe/src/main/java/org/apache/impala/service/Frontend.java
M fe/src/main/jflex/sql-scanner.flex
M fe/src/test/java/org/apache/impala/analysis/AnalyzeDDLTest.java
M fe/src/test/java/org/apache/impala/analysis/ParserTest.java
M testdata/datasets/functional/functional_schema_template.sql
M testdata/datasets/functional/schema_constraints.csv
A 
testdata/workloads/functional-query/queries/QueryTest/create-materialized-view-as-select.test
M tests/metadata/test_ddl.py
23 files changed, 793 insertions(+), 37 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/50/19050/5
--
To view, visit http://gerrit.cloudera.org:8080/19050
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I77fdd34bf04a8994a215170747249356cd40622b
Gerrit-Change-Number: 19050
Gerrit-PatchSet: 5
Gerrit-Owner: pengdou 


[Impala-ASF-CR] IMPALA-11624: Bump Impyla dependency to 0.18.0

2022-09-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18456 )

Change subject: IMPALA-11624: Bump Impyla dependency to 0.18.0
..


Patch Set 6:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18456/6/bin/impala-config.sh
File bin/impala-config.sh:

http://gerrit.cloudera.org:8080/#/c/18456/6/bin/impala-config.sh@179
PS6, Line 179: shell/ext-py
> Hi Csaba. Just wondering, what will happen with shell/ext-py/thrift-0.14.2
Oops, I need to update that to - for some reason I thought the copied eggs in 
ext-py were removed



--
To view, visit http://gerrit.cloudera.org:8080/18456
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7265558b0e07959c606cba73cd251c3edfcb3ed5
Gerrit-Change-Number: 18456
Gerrit-PatchSet: 6
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 30 Sep 2022 07:25:55 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11612: fix ORDER BY expression rewrite bug

2022-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/19049 )

Change subject: IMPALA-11612: fix ORDER BY expression rewrite bug
..


Patch Set 3: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8642/


--
To view, visit http://gerrit.cloudera.org:8080/19049
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I438c61c647e3f99665c94ea15e3a9d98b9ff9912
Gerrit-Change-Number: 19049
Gerrit-PatchSet: 3
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Fri, 30 Sep 2022 07:23:03 +
Gerrit-HasComments: No