[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18875 )

Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Sun, 21 Aug 2022 04:49:03 +
Gerrit-HasComments: No


[Impala-ASF-CR] Test review

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has abandoned this change. ( 
http://gerrit.cloudera.org:8080/18860 )

Change subject: Test review
..


Abandoned
--
To view, visit http://gerrit.cloudera.org:8080/18860
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: abandon
Gerrit-Change-Id: Id1c1742716c4922f125b20d719928e9a7b94ecc8
Gerrit-Change-Number: 18860
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18875 )

Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Sun, 21 Aug 2022 02:50:06 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11418: The only select constant statement need not to spool results

2022-08-20 Thread Riza Suminto (Code Review)
Riza Suminto has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18711 )

Change subject: IMPALA-11418: The only select constant statement need not to 
spool results
..


Patch Set 4:

(3 comments)

Thank you for working on this! The change makes sense to me.
I only have few nits.

http://gerrit.cloudera.org:8080/#/c/18711/4/fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
File fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java:

http://gerrit.cloudera.org:8080/#/c/18711/4/fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java@474
PS4, Line 474: The statement only select constant need not to spool query 
results.
nit: A statement that only selects constant does not need to spool query 
results.


http://gerrit.cloudera.org:8080/#/c/18711/4/fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java@480
PS4, Line 480:   clientRequest.query_options.setSpool_query_results(false);
Can you add LOG.trace here? Like what we have in:
https://github.com/apache/impala/blob/f1420ae51b2e00ad093be64a0d2135ca36aa6514/fe/src/main/java/org/apache/impala/planner/PlanRootSink.java#L101-L105


http://gerrit.cloudera.org:8080/#/c/18711/4/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
File fe/src/main/java/org/apache/impala/analysis/SelectStmt.java:

http://gerrit.cloudera.org:8080/#/c/18711/4/fe/src/main/java/org/apache/impala/analysis/SelectStmt.java@1705
PS4, Line 1705:* Check if the select statement only select constant
Can you add two example query in this method documentation for both case?
- Select constant
- Select constant from InlineView



--
To view, visit http://gerrit.cloudera.org:8080/18711
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icd4d73c21106048df68a270cf03d4abd56bd3aac
Gerrit-Change-Number: 18711
Gerrit-PatchSet: 4
Gerrit-Owner: Xianqing He 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Comment-Date: Sun, 21 Aug 2022 02:42:50 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/18810 )

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..

IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

This adds some calls to df and du to track disk space
usage throughout the builds. This also cleans up the
Impala dev environment before creating the m2 archive.

Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Reviewed-on: http://gerrit.cloudera.org:8080/18810
Reviewed-by: Impala Public Jenkins 
Tested-by: Joe McDonnell 
---
M bin/jenkins/build-all-flag-combinations.sh
1 file changed, 17 insertions(+), 0 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved
  Joe McDonnell: Verified

--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 4
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has removed a vote on this change.

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18810 )

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..


Patch Set 3: Verified+1

This only impacts the all-build-options-ub1604 job, and it passed, so going 
ahead with this.


--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Sat, 20 Aug 2022 22:21:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18875 )

Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..


Patch Set 1:

Not related, retrying


--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Sat, 20 Aug 2022 21:56:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18875 )

Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8479/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Sat, 20 Aug 2022 21:56:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11514: Workaround s3 connection timeout issues

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18872 )

Change subject: IMPALA-11514: Workaround s3 connection timeout issues
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/18872
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I08d03eb653fdcb6955340519b0cf5ba97b10d590
Gerrit-Change-Number: 18872
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Sat, 20 Aug 2022 09:42:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18875 )

Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..


Patch Set 1: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8478/


--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Sat, 20 Aug 2022 09:41:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18810 )

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..


Patch Set 3:

The failure is due to IMPALA-11352. I think we can merge this regardless of it.


--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 18 Aug 2022 11:45:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Kurt Deschler (Code Review)
Kurt Deschler has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 11: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 11
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 18 Aug 2022 23:04:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..


Patch Set 5: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 5
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 19 Aug 2022 07:31:18 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11429: Set table owner after creating an Iceberg table

2022-08-20 Thread Tamas Mate (Code Review)
Tamas Mate has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18837 )

Change subject: IMPALA-11429: Set table owner after creating an Iceberg table
..


Patch Set 5:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/18837/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/18837/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3626
PS5, Line 3626: it has
  : // hardcoded that the
nit: this part was fuzzy for me, maybe something like:
"however, the table owner is hardcoded to be the user running the Iceberg 
process"


http://gerrit.cloudera.org:8080/#/c/18837/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3633
PS5, Line 3633: throw new ImpalaRuntimeException("Failed to set table 
owner after creating " +
  : "Iceberg table but the table has been created 
successfully", e);
With Ranger enabled and a failing alter operation this table would be 
inaccessible for the user right?


http://gerrit.cloudera.org:8080/#/c/18837/5/tests/query_test/test_iceberg.py
File tests/query_test/test_iceberg.py:

http://gerrit.cloudera.org:8080/#/c/18837/5/tests/query_test/test_iceberg.py@130
PS5, Line 130: pytest.skip()
Wouldn't a time.sleep(5) solve the issue after create?



--
To view, visit http://gerrit.cloudera.org:8080/18837
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5cac198a4a53be3599cb582864ee5f8c269202c0
Gerrit-Change-Number: 18837
Gerrit-PatchSet: 5
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Fri, 19 Aug 2022 13:48:34 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7942 (part 1): Add query hints for table cardinalities

2022-08-20 Thread wangsheng (Code Review)
wangsheng has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/18829 )

Change subject: IMPALA-7942 (part 1): Add query hints for table cardinalities
..

IMPALA-7942 (part 1): Add query hints for table cardinalities

Currently, We need execute 'COMPUTE STATS' manually to compute
table stats info. Stats is very useful for query planning.
Without these stats, query plan maybe worse. In order to solve
this probelm, this patch adds a new query hint: 'TABLE_NUM_ROWS',
We can use this new hint after a hdfs or kudu table in query
like this:

  * select col from t /* +TABLE_NUM_ROWS(1000) */;

If set, Impala will use this value as table scanned rows when
table no stats or has corrput stats. This hint value will not
valid if table stats is normal.

Testing:
- Added new fe test in 'PlannerTest'
- Added new fe test in 'AnalyzeStmtsTest' for negative cases

Change-Id: I9f0c773f4e67782a1428db64062f68afbd257af7
---
M fe/src/main/java/org/apache/impala/analysis/TableRef.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/main/java/org/apache/impala/planner/KuduScanNode.java
M fe/src/main/java/org/apache/impala/planner/ScanNode.java
M fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java
M fe/src/test/java/org/apache/impala/analysis/AnalyzeStmtsTest.java
M fe/src/test/java/org/apache/impala/planner/PlannerTest.java
A 
testdata/workloads/functional-planner/queries/PlannerTest/table-cardinality-hint.test
8 files changed, 160 insertions(+), 10 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/29/18829/4
--
To view, visit http://gerrit.cloudera.org:8080/18829
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9f0c773f4e67782a1428db64062f68afbd257af7
Gerrit-Change-Number: 18829
Gerrit-PatchSet: 4
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Fucun Chu 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-9482: Support for BINARY columns

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16066 )

Change subject: IMPALA-9482: Support for BINARY columns
..

IMPALA-9482: Support for BINARY columns

This patch adds support for BINARY columns for all table formats with
the exception of Kudu.

In Hive the main difference between STRING and BINARY is that STRING is
assumed to be UTF8 encoded, while BINARY can be any byte array.
Some other differences in Hive:
- BINARY can be only cast from/to STRING
- Only a small subset of built-in STRING functions support BINARY.
- In several file formats (e.g. text) BINARY is base64 encoded.
- No NDV is calculated during COMPUTE STATISTICS.

As Impala doesn't treat STRINGs as UTF8, BINARY and STRING become nearly
identical, especially from the backend's perspective. For this reason,
BINARY is implemented a bit differently compared to other types:
while the frontend treats STRING and BINARY as two separate types, most
of the backend uses PrimitiveType::TYPE_STRING for BINARY too, e.g.
in SlotDesc. Only the following parts of backend need to differentiate
between STRING and BINARY:
- table scanners
- table writers
- HS2/Beeswax service
These parts have access to column metadata, which allows to add special
handling for BINARY.

Only a very few builtins are allowed for BINARY at the moment:
- length
- min/max/count
- coalesce and similar "selector" functions
Other STRING functions can be only used by casting to STRING first.
Adding support for more of these functions is very easy, as simply
the BINARY type has to be "connected" to the already existing STRING
function's signature. Functions where the result depends on utf8_mode
need to ensure that with BINARY it always works as if utf8_mode=0 (for
example length() is mapped to bytes() as length count utf8 chars if
utf8_mode=1).

All kinds of UDFs (native, Hive legacy, Hive generic) support BINARY,
though in case of legacy Hive UDFs it is only supported if the argument
and return types are set explicitely to ensure backward compatibility.
See IMPALA-11340 for details.

The original plan was to behave as close to Hive as possible, but I
realized that Hive has more relaxed casting rules than Impala, which
led to STRING<->BINARY casts being necessary in more cases in Impala.
This was needed to disallow passing a BINARY to functions that expect
a STRING argument. An example for the difference is that in
INSERT ... VALUES () string literals need to be explicitly cast to
BINARY, while this is not needed in Hive.

Testing:
- Added functional.binary_tbl for all file formats (except Kudu)
  to test scanning.
- Removed functional.unsupported_types and related tests, as now
  Impala supports all (non-complex) types that Hive does.
- Added FE/EE tests mainly based on the ones added to the DATE type

Change-Id: I36861a9ca6c2047b0d76862507c86f7f153bc582
Reviewed-on: http://gerrit.cloudera.org:8080/16066
Reviewed-by: Quanlong Huang 
Tested-by: Impala Public Jenkins 
---
M be/src/exec/file-metadata-utils.cc
M be/src/exec/hbase/hbase-scan-node.cc
M be/src/exec/hbase/hbase-scan-node.h
M be/src/exec/hbase/hbase-table-writer.cc
M be/src/exec/hdfs-scanner-ir.cc
M be/src/exec/hdfs-scanner.cc
M be/src/exec/hdfs-text-table-writer.cc
M be/src/exec/orc/orc-metadata-utils.cc
M be/src/exec/parquet/hdfs-parquet-table-writer.cc
M be/src/exec/parquet/parquet-metadata-utils.cc
M be/src/exec/parquet/parquet-metadata-utils.h
M be/src/exec/rcfile/hdfs-rcfile-scanner.cc
M be/src/exec/text-converter.cc
M be/src/exec/text-converter.h
M be/src/exec/text-converter.inline.h
M be/src/exec/text/hdfs-text-scanner.cc
M be/src/exprs/expr-test.cc
M be/src/exprs/utility-functions-ir.cc
M be/src/exprs/utility-functions.h
M be/src/runtime/descriptors.cc
M be/src/runtime/descriptors.h
M be/src/runtime/types.cc
M be/src/runtime/types.h
M be/src/service/hs2-util.cc
M be/src/service/hs2-util.h
M be/src/service/impala-beeswax-server.cc
M be/src/service/impala-hs2-server.cc
M be/src/service/query-result-set.cc
M be/src/testutil/test-udfs.cc
M be/src/util/coding-util.cc
M be/src/util/coding-util.h
M bin/rat_exclude_files.txt
M common/function-registry/impala_functions.py
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
M fe/src/main/java/org/apache/impala/analysis/CastExpr.java
M fe/src/main/java/org/apache/impala/analysis/ComputeStatsStmt.java
M fe/src/main/java/org/apache/impala/analysis/InPredicate.java
M fe/src/main/java/org/apache/impala/analysis/LikePredicate.java
M fe/src/main/java/org/apache/impala/analysis/LiteralExpr.java
M fe/src/main/java/org/apache/impala/analysis/SlotRef.java
M fe/src/main/java/org/apache/impala/catalog/BuiltinsDb.java
M fe/src/main/java/org/apache/impala/catalog/ColumnStats.java
M fe/src/main/java/org/apache/impala/catalog/Function.java
M fe/src/main/java/org/apache/impala/catalog/PrimitiveType.java
M 

[Impala-ASF-CR] IMPALA-11429: Set table owner after creating an Iceberg table

2022-08-20 Thread Code Review
Gergely Fürnstáhl has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18837 )

Change subject: IMPALA-11429: Set table owner after creating an Iceberg table
..


Patch Set 5: Code-Review+1

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18837/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/18837/5/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@3626
PS5, Line 3626: has
nit: "is" feels a bit more natural



--
To view, visit http://gerrit.cloudera.org:8080/18837
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5cac198a4a53be3599cb582864ee5f8c269202c0
Gerrit-Change-Number: 18837
Gerrit-PatchSet: 5
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Fri, 19 Aug 2022 13:28:30 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11294: Remove dictionary runtime filter test workarounds

2022-08-20 Thread Tamas Mate (Code Review)
Tamas Mate has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18839 )

Change subject: IMPALA-11294: Remove dictionary runtime filter test workarounds
..


Patch Set 3:

Failure caused by some infra issues:
 Creating placeholder flownodes because failed loading originals.
java.io.IOException: Tried to load head FlowNodes for execution
 Owner[parallel-all-tests/10340:parallel-all-tests #10340] but FlowNode was not 
found in storage for head id:FlowNodeId 1:19
at
 
org.jenkinsci.plugins.workflow.cps.CpsFlowExecution.initializeStorage(CpsFlowExecution.java:679)
 ...


--
To view, visit http://gerrit.cloudera.org:8080/18839
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9697f5a1a04160dd17553603b00e15d63b49e2bf
Gerrit-Change-Number: 18839
Gerrit-PatchSet: 3
Gerrit-Owner: Tamas Mate 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 19 Aug 2022 14:47:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11436: Change search bind authentication parameters

2022-08-20 Thread Tamas Mate (Code Review)
Tamas Mate has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/18819 )

Change subject: IMPALA-11436: Change search bind authentication parameters
..

IMPALA-11436: Change search bind authentication parameters

Impala's search bind authentication intends to mimic Spring's behaviour.
However, the login username and user dn paremeters were swapped for
group searches compared to Spring. This change intends to align these
parameters.

For user search, Spring uses {0} to replace the login username.
Meanwhile, during group search {0} is used to replace the login user dn
and {1} is used to replace the login username.

Testing:
 - Ran LdapSearchBindImpalaShellTest frontend tests

Change-Id: I9808566a348f7c6200b0571fbc05e67f720f2075
Reviewed-on: http://gerrit.cloudera.org:8080/18819
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/util/ldap-search-bind.cc
M docs/topics/impala_ldap.xml
M 
fe/src/test/java/org/apache/impala/customcluster/LdapSearchBindImpalaShellTest.java
3 files changed, 30 insertions(+), 23 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/18819
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I9808566a348f7c6200b0571fbc05e67f720f2075
Gerrit-Change-Number: 18819
Gerrit-PatchSet: 6
Gerrit-Owner: Tamas Mate 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 


[Impala-ASF-CR] IMPALA-7942 (part 1): Add query hints for table cardinalities

2022-08-20 Thread wangsheng (Code Review)
wangsheng has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18829 )

Change subject: IMPALA-7942 (part 1): Add query hints for table cardinalities
..


Patch Set 4:

(13 comments)

Hi Quanlong, thanks for advice. I think you are right, use hint value to 
replace original table stats may cause consistency when use explain. So I 
modify the code, table hint is valid when no stats or has corrupt stats. Here 
is a problem, I use 'functional.alltypes' for hdfs table with stats, 
'functional_parquet.alltypes' for hdfs table without stats, 
'functional_kudu.alltypes' for kudu table with stats.
But I did not figure out the way to test kudu table without stats.

http://gerrit.cloudera.org:8080/#/c/18829/3//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18829/3//COMMIT_MSG@10
PS3, Line 10: query planning.
> nit: generation? or "query planning", "query optimization"
Done


http://gerrit.cloudera.org:8080/#/c/18829/3//COMMIT_MSG@19
PS3, Line 19: l not
: valid if table stat
> nit: regardless the existense of the stats.
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java
File fe/src/main/java/org/apache/impala/analysis/TableRef.java:

http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java@173
PS3, Line 173:   // Value of query hint 'TABLE_NUM_ROWS' on this table. Used in 
constructing ScanNode if
 :   // the table does not have stats, or has correct stats. -1 
indicates no hint. Currently,
 :   // this hint is valid for hd
> nit: might be better to reword to
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java@510
PS3, Line 510:
> nit: isTableHintSupported
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java@514
PS3, Line 514: estTable() != null &&
> nit: reword to
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java@518
PS3, Line 518: for (PlanHint hint: tableHints_) {
> Does this mean we support such hints for Kudu tables now? I think the SCHED
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java@555
PS3, Line 555: analyzer.setHasPlanHints();
> nit: can we remove this comment? It seems no need to explain the following
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java@556
PS3, Line 556: Long.parseLo
> nit: can use Long.parseLong() directly, which is used internally in Long.va
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/analysis/TableRef.java@564
PS3, Line 564: Returns whether the table supports hint. Currently,
> nit: reword to
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java:

http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@1465
PS3, Line 1465:   cardinality_ = extrapolatedNumRows_;
> Should we overwrite this as well if the hint exists?
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@1542
PS3, Line 1542:* partitions with corrupt stats.
> Could you please mention the hint in this comment?
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@1573
PS3, Line 1573: // by each of the partitions, as the row count for the 
table.
> I thought we only use the hint when missing stats. This always overwrites t
Done


http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/planner/ScanNode.java
File fe/src/main/java/org/apache/impala/planner/ScanNode.java:

http://gerrit.cloudera.org:8080/#/c/18829/3/fe/src/main/java/org/apache/impala/planner/ScanNode.java@81
PS3, Line 81:   // Refer to the comment of 'TableRef.tableNumRowsHint_'
:   protected long tableNumRowsHint_ = -1;
:
:   public ScanNode(PlanNodeId id, TupleDes
> nit: maybe we can just refer to the comment of TableRef.tableNumRowsHint_
Done



--
To view, visit http://gerrit.cloudera.org:8080/18829
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9f0c773f4e67782a1428db64062f68afbd257af7
Gerrit-Change-Number: 18829
Gerrit-PatchSet: 4
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Fucun Chu 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Fri, 19 Aug 2022 13:06:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11436: Change search bind authentication parameters

2022-08-20 Thread Tamas Mate (Code Review)
Tamas Mate has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/18819 )

Change subject: IMPALA-11436: Change search bind authentication parameters
..

IMPALA-11436: Change search bind authentication parameters

Impala's search bind authentication intends to mimic Spring's behaviour.
However, the login username and user dn paremeters were swapped for
group searches compared to Spring. This change intends to align these
parameters.

For user search, Spring uses {0} to replace the login username.
Meanwhile, during group search {0} is used to replace the login user dn
and {1} is used to replace the login username.

Testing:
 - Ran LdapSearchBindImpalaShellTest frontend tests

Change-Id: I9808566a348f7c6200b0571fbc05e67f720f2075
---
M be/src/util/ldap-search-bind.cc
M docs/topics/impala_ldap.xml
M 
fe/src/test/java/org/apache/impala/customcluster/LdapSearchBindImpalaShellTest.java
3 files changed, 30 insertions(+), 23 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/19/18819/5
--
To view, visit http://gerrit.cloudera.org:8080/18819
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9808566a348f7c6200b0571fbc05e67f720f2075
Gerrit-Change-Number: 18819
Gerrit-PatchSet: 5
Gerrit-Owner: Tamas Mate 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 


[Impala-ASF-CR] IMPALA-9482: Support for BINARY columns

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16066 )

Change subject: IMPALA-9482: Support for BINARY columns
..


Patch Set 26: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16066
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I36861a9ca6c2047b0d76862507c86f7f153bc582
Gerrit-Change-Number: 16066
Gerrit-PatchSet: 26
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Steve Carlin 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 19 Aug 2022 13:55:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 11:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18798/11/be/src/runtime/row-batch.h
File be/src/runtime/row-batch.h:

http://gerrit.cloudera.org:8080/#/c/18798/11/be/src/runtime/row-batch.h@122
PS11, Line 122: }
add DCHECK(mem_allocator_ == locking_free_pool);



-- 
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 11
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 19 Aug 2022 15:15:11 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11424: Support pushdown non-equi join predicate from OUTER/INNER JOIN to SCANNODE

2022-08-20 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18731 )

Change subject: IMPALA-11424: Support pushdown non-equi join predicate from 
OUTER/INNER JOIN to SCANNODE
..


Patch Set 6:

(23 comments)

Thanks for your contribution, Baike! This is an important improvement. I still 
need some time to finish my first round of review. Left some comments first.

http://gerrit.cloudera.org:8080/#/c/18731/6//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18731/6//COMMIT_MSG@9
PS6, Line 9: In order to reduce the amount of data read and transmitted, the 
non-equivalent condition of Join can be pushed to SCAN_NODE.
nit: each line of the commit message should have 72 or fewer characters. The 
commit title is ok.


http://gerrit.cloudera.org:8080/#/c/18731/6//COMMIT_MSG@16
PS6, Line 16:
Please introduce the new query option, ENABLE_NONE_EQUAL_PREDICATE_PUSH_DOWN, 
in the commit message.


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java
File fe/src/main/java/org/apache/impala/planner/HashJoinNode.java:

http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@142
PS6, Line 142: pushdownNonEquiConjunct(analyzer);
Should we move these before computeStats() at line 136 to have better 
cardinality?


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@463
PS6, Line 463:
nit: 4 spaces indent


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@507
PS6, Line 507:
nit: 4 spaces indent


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@580
PS6, Line 580:
nit: 4 spaces indent


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@592
PS6, Line 592:   if (!(predicate.getChild(0) instanceof LiteralExpr
 :   && !(predicate.getChild(0) instanceof NullLiteral))
 : && !(predicate.getChild(1) instanceof LiteralExpr
 :   && !(predicate.getChild(1) instanceof 
NullLiteral))) {
This is not that readable. We can simplify it to

  if (!Expr.IS_NON_NULL_LITERAL.apply(predicate.getChild(0))
  && !Expr.IS_NON_NULL_LITERAL.apply(predicate.getChild(1))) {
continue;
  }


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@631
PS6, Line 631: groupOtherJoinConjunctsAccordingToSlotRef
This function is similar to the above one. Can we refactor them into one?


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@632
PS6, Line 632:
nit: 4 spaces indent


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@675
PS6, Line 675:
nit: 4 spaces indent


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@731
PS6, Line 731:
nit: 4 spaces indent


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@746
PS6, Line 746:   if (child instanceof LiteralExpr) {
 : LiteralExpr currentValue = (LiteralExpr) child;
 : if (minValue == null) {
 :   minValue = currentValue;
 : } else {
 :   if (minValue.compareTo(currentValue) > 0) {
 : minValue = currentValue;
 :   }
 : }
 :   }
This is a common patten of the code. We can extract this into a method to 
deduplicate some codes.

EDIT: the code structure in getMaxLiteralFromPredicates() looks better. It'd be 
nice if we can refactor them into one method.


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@775
PS6, Line 775: < 0
Shouldn't this be "> 0" ?


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@784
PS6, Line 784: i = 1
Could you explain why we don't need "i = 0" ?


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@785
PS6, Line 785: !(predicate.getChild(i) instanceof LiteralExpr)
 :   || (predicate.getChild(i) instanceof 
NullLiteral)
This can be simplified to

 !Expr.IS_NON_NULL_LITERAL.apply(predicate.getChild(i))


http://gerrit.cloudera.org:8080/#/c/18731/6/fe/src/main/java/org/apache/impala/planner/HashJoinNode.java@796
PS6, Line 796: minValue = literalValues.get(0);
Shouldn't we update 'minValue' instead of replacing it? There could be more 
than one IN-list, e.g. "x in [1, 2, 3] and x in [0, 1]". It 

[Impala-ASF-CR] IMPALA-11294: Remove dictionary runtime filter test workarounds

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18839 )

Change subject: IMPALA-11294: Remove dictionary runtime filter test workarounds
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8475/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/18839
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9697f5a1a04160dd17553603b00e15d63b49e2bf
Gerrit-Change-Number: 18839
Gerrit-PatchSet: 4
Gerrit-Owner: Tamas Mate 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 19 Aug 2022 14:48:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11294: Remove dictionary runtime filter test workarounds

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18839 )

Change subject: IMPALA-11294: Remove dictionary runtime filter test workarounds
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18839
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9697f5a1a04160dd17553603b00e15d63b49e2bf
Gerrit-Change-Number: 18839
Gerrit-PatchSet: 4
Gerrit-Owner: Tamas Mate 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 19 Aug 2022 14:48:03 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11469: (Addendum) Refactor BackendConfig for tests

2022-08-20 Thread Michael Smith (Code Review)
Michael Smith has abandoned this change. ( 
http://gerrit.cloudera.org:8080/18828 )

Change subject: IMPALA-11469: (Addendum) Refactor BackendConfig for tests
..


Abandoned

No strong reason to refactor this. I prefer avoiding statics where possible, 
but I think that's mostly aesthetic in this case.
--
To view, visit http://gerrit.cloudera.org:8080/18828
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: abandon
Gerrit-Change-Id: I802fcbf70808f2127197f720a7247c3d85389d6f
Gerrit-Change-Number: 18828
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Xiang Yang 


[Impala-ASF-CR] IMPALA-7942 (part 1): Add query hints for table cardinalities

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18829 )

Change subject: IMPALA-7942 (part 1): Add query hints for table cardinalities
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11191/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18829
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9f0c773f4e67782a1428db64062f68afbd257af7
Gerrit-Change-Number: 18829
Gerrit-PatchSet: 4
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Fucun Chu 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Fri, 19 Aug 2022 13:23:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10213: Add test for local vs remote scheduling

2022-08-20 Thread Kurt Deschler (Code Review)
Kurt Deschler has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18841 )

Change subject: IMPALA-10213: Add test for local vs remote scheduling
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18841/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java:

http://gerrit.cloudera.org:8080/#/c/18841/3/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@1385
PS3, Line 1385:   if (fsType != FileSystemUtil.FsType.OZONE) 
++numScanRangesNoDiskIds_;
Add {}



--
To view, visit http://gerrit.cloudera.org:8080/18841
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4e5606528404c3d4fd164c03dec8315345be5f6d
Gerrit-Change-Number: 18841
Gerrit-PatchSet: 3
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 19 Aug 2022 17:59:14 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..

IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when 
num_values is 0

Fix Impalad crashed in the method ParquetBoolDecoder::SkipValues when
the parameter 'num_values' is 0. The function should tolerate that the
'num_values' is 0 values.

Testing:
 - Add e2e tests

Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Reviewed-on: http://gerrit.cloudera.org:8080/18854
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/exec/parquet/parquet-bool-decoder.cc
A 
testdata/workloads/functional-query/queries/QueryTest/parquet-page-index-bugfix.test
M tests/query_test/test_parquet_stats.py
3 files changed, 35 insertions(+), 1 deletion(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 6
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-10214: Ozone remote file handle caching

2022-08-20 Thread Kurt Deschler (Code Review)
Kurt Deschler has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18853 )

Change subject: IMPALA-10214: Ozone remote file handle caching
..


Patch Set 4:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18853/4/tests/custom_cluster/test_hdfs_fd_caching.py
File tests/custom_cluster/test_hdfs_fd_caching.py:

http://gerrit.cloudera.org:8080/#/c/18853/4/tests/custom_cluster/test_hdfs_fd_caching.py@127
PS4, Line 127: s.connect(('10.254.254.254', 1))
Hard-coded address?



--
To view, visit http://gerrit.cloudera.org:8080/18853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9df13208999c6d3b14f4c005a91ee2a92a05bdf9
Gerrit-Change-Number: 18853
Gerrit-PatchSet: 4
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Comment-Date: Fri, 19 Aug 2022 18:00:27 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Omid Shahidi (Code Review)
Omid Shahidi has uploaded a new patch set (#13). ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..

IMPALA-6684: Fix untracked memory in KRPC

During serialization of an row batch header, a tuple_data_ is created
which will hold the compressed tuple data for an outbound row batch.
We would like this tuple data to be trackable as it is responsible for
a significant portion of untrackable memory from the krpc data stream
sender. By using free pool, we are able to allocate tuple data and
compression scratch and account for it in the memory tracker of the
KrpcDataStreamSender. This solution creates a RAII class responsible
for memory allocation and changes the existing code to use a char buffer
pointed by a char* tuple_data_ instead of the previously used
std::string tuple_data_. The thrift implementation is left unchanged and
the protobuf implementation is seperated.

Testing:
 - Passed core tests.
 - Ran a single node benchmark which shows no regression.
 - Updated row-batch-serialize-test and row-batch-serialize-benchmark to
   test the row-batch serialization used by KRPC.
 - Manually collected query-profile, heap growth, and memory usage log
   showing untracked memory decreased by 1/2.
 - Add end-end unit-test to verify the new counters in runtime profile

New row-batch serialization benchmark:

serialize:
Func10%  50%  90%  10%  50%  90% ile
  (rel) (rel) (rel)
---
ser_no_dups_baseline8.36 8.6 8.7   1X  1X  1X
ser_no_dups 6.73 6.85 6.93 0.804X 0.796X 0.796X
ser_no_dups_full5.28 5.38 5.55 0.631X 0.625X 0.637X

ser_adjacent_dups_baseline 12.9 13.2 13.4 1X 1X 1X
ser_adjacent_dups  23.2 23.7 24.1 1.8X 1.8X 1.8X
ser_adjacent_dups_full 19.9 20.3 20.7 1.54X 1.54X 1.55X

ser_dups_baseline  9.17 9.54 9.72 1X  1X 1X
ser_dups7.45 7.69 7.86 0.812X 0.806X 0.809X
ser_dups_full   14.6 15 15.3 1.6X 1.57X 1.57X

deserialize:
Func10%  50%  90%  10%  50%  90% ile
  (rel) (rel) (rel)
---
deser_no_dups_baseline  32.6 33.5 34   1X   1X1X
deser_no_dups   32.5 33.1 33.7 0.999X 0.99X 0.992X

deser_adjacent_dups_baseline  53.1 54 54.7 1X 1X 1X
deser_adjacent_dups 80.3 81.6  82.5 1.51X 1.51X 1.51X

deser_dups_baseline  52.4 54  54.7  1X  1X   1X
deser_dups   86.8 88.4 89.7 1.66X 1.64X 1.64X

Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
---
M be/src/benchmarks/row-batch-serialize-benchmark.cc
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
M be/src/runtime/row-batch-serialize-test.cc
M be/src/runtime/row-batch.cc
M be/src/runtime/row-batch.h
A be/src/runtime/row-batch.inline.h
A testdata/workloads/tpch/queries/datastream-sender.test
A tests/query_test/test_datastream_sender.py
9 files changed, 655 insertions(+), 214 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/98/18798/13
--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 13
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-10213: Add test for local vs remote scheduling

2022-08-20 Thread Michael Smith (Code Review)
Hello Kurt Deschler, Joe McDonnell, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18841

to look at the new patch set (#4).

Change subject: IMPALA-10213: Add test for local vs remote scheduling
..

IMPALA-10213: Add test for local vs remote scheduling

Impala already supports locality-aware scheduling with Ozone because it
returns location data on partitions. That data doesn't include specific
storage ids in getStorageIds, so we skip a warning that will always
trigger on Ozone.

Updates Ozone to add implicit rules mapping localhost -> 127.0.0.1 for
local development. HDFS translates localhost to 127.0.0.1 for host names
in its location data, which Impala will identify as colocated with
executors in the dev environment. Ozone doesn't, and the default Impala
hostname is the machine hostname - not localhost - so without this
change all HDFS access in the minicluster is local but all Ozone access
is remote.

Adds a test to verify local vs remote assignment by using custom
clusters with hostnames that either do or don't match storage hostnames.

Change-Id: I4e5606528404c3d4fd164c03dec8315345be5f6d
---
M be/src/runtime/io/disk-io-mgr.cc
M be/src/scheduling/executor-group.cc
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M tests/common/skip.py
A tests/custom_cluster/test_scheduler_locality.py
M tests/metadata/test_stats_extrapolation.py
6 files changed, 74 insertions(+), 6 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/41/18841/4
--
To view, visit http://gerrit.cloudera.org:8080/18841
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I4e5606528404c3d4fd164c03dec8315345be5f6d
Gerrit-Change-Number: 18841
Gerrit-PatchSet: 4
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Michael Smith 


[Impala-ASF-CR] IMPALA-10214: Ozone remote file handle caching

2022-08-20 Thread Michael Smith (Code Review)
Hello Kurt Deschler, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18853

to look at the new patch set (#5).

Change subject: IMPALA-10214: Ozone remote file handle caching
..

IMPALA-10214: Ozone remote file handle caching

Enables support for caching remote file handles for Ozone. Local file
handles were already cached unintentionally, similar to HDFS. Updates
file handle cache enablement to be more stringent about enabling
caching, similar to what's suggested in IMPALA-10375.

File handle caching is enabled if a max_cached_file_handles is non-zero
and any of the following are true
- HDFS file is local
- HDFS file is remote and cache_remote_file_handles is enabled
- Ozone file is local or remote and cache_ozone_file_handles is enabled
- S3 file is remote and cache_s3_file_handles is enabled
- ABFS file is remote and cache_abfs_file_handles is enabled

Enables testing Ozone in test_hdfs_fd_caching, and adds a test that
remote caching can be disabled using individual flags.

Change-Id: I9df13208999c6d3b14f4c005a91ee2a92a05bdf9
---
M be/src/runtime/io/disk-io-mgr.cc
M be/src/runtime/io/handle-cache.inline.h
M be/src/runtime/io/scan-range.cc
M tests/custom_cluster/test_hdfs_fd_caching.py
4 files changed, 60 insertions(+), 21 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/53/18853/5
--
To view, visit http://gerrit.cloudera.org:8080/18853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9df13208999c6d3b14f4c005a91ee2a92a05bdf9
Gerrit-Change-Number: 18853
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Michael Smith 


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Omid Shahidi (Code Review)
Omid Shahidi has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 13:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/18798/13/be/src/runtime/krpc-data-stream-sender.cc
File be/src/runtime/krpc-data-stream-sender.cc:

http://gerrit.cloudera.org:8080/#/c/18798/13/be/src/runtime/krpc-data-stream-sender.cc@1095
PS13, Line 1095: if (outbound_rb_mem_pool_.get() != nullptr)
possibly change to if(UNLIKELY(outbound_rb_mem_pool.get() != nullptr)

same comment for line 1095


http://gerrit.cloudera.org:8080/#/c/18798/13/be/src/runtime/krpc-data-stream-sender.cc@1103
PS13, Line 1103: delete outbound_rb_free_pool_;
check if outbound_rb_free_pool_ != nullptr



--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 13
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 19 Aug 2022 22:01:47 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11294: Remove dictionary runtime filter test workarounds

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/18839 )

Change subject: IMPALA-11294: Remove dictionary runtime filter test workarounds
..

IMPALA-11294: Remove dictionary runtime filter test workarounds

This commit removes a workaround in the dictionary runtime filter tests
that was necessary till IMPALA-11047 got merged.

Testing
 - Executed e2e test locally.

Change-Id: I9697f5a1a04160dd17553603b00e15d63b49e2bf
Reviewed-on: http://gerrit.cloudera.org:8080/18839
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M tests/query_test/test_runtime_filters.py
1 file changed, 0 insertions(+), 7 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/18839
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I9697f5a1a04160dd17553603b00e15d63b49e2bf
Gerrit-Change-Number: 18839
Gerrit-PatchSet: 5
Gerrit-Owner: Tamas Mate 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-10214: Ozone remote file handle caching

2022-08-20 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18853 )

Change subject: IMPALA-10214: Ozone remote file handle caching
..


Patch Set 4:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18853/4/tests/custom_cluster/test_hdfs_fd_caching.py
File tests/custom_cluster/test_hdfs_fd_caching.py:

http://gerrit.cloudera.org:8080/#/c/18853/4/tests/custom_cluster/test_hdfs_fd_caching.py@127
PS4, Line 127: s.connect(('10.254.254.254', 1))
> Hard-coded address?
Ah, the timeout means it doesn't matter if it resolves (we kind of don't want 
it to). I should've left a comment.



--
To view, visit http://gerrit.cloudera.org:8080/18853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9df13208999c6d3b14f4c005a91ee2a92a05bdf9
Gerrit-Change-Number: 18853
Gerrit-PatchSet: 4
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 19 Aug 2022 21:44:18 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11294: Remove dictionary runtime filter test workarounds

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18839 )

Change subject: IMPALA-11294: Remove dictionary runtime filter test workarounds
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/18839
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9697f5a1a04160dd17553603b00e15d63b49e2bf
Gerrit-Change-Number: 18839
Gerrit-PatchSet: 4
Gerrit-Owner: Tamas Mate 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Fri, 19 Aug 2022 19:35:20 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 13:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11192/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 13
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 19 Aug 2022 20:30:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 13:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8476/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 13
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Fri, 19 Aug 2022 20:49:20 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11514: Workaround s3 connection timeout issues

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/18872


Change subject: IMPALA-11514: Workaround s3 connection timeout issues
..

IMPALA-11514: Workaround s3 connection timeout issues

When running on s3, dataload is failing with errors
like "Timeout waiting for connection from pool". The
underlying issue is a subtle issue in the async draining
codepath (HADOOP-18410). As a temporary workaround, this
adds fs.s3a.input.async.drain.threshold=512G to core-site.xml.
This disables the async drain codepath.

Testing:
 - An s3 job passed with this setting

Change-Id: I08d03eb653fdcb6955340519b0cf5ba97b10d590
---
M testdata/cluster/node_templates/common/etc/hadoop/conf/core-site.xml.py
1 file changed, 3 insertions(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/72/18872/1
--
To view, visit http://gerrit.cloudera.org:8080/18872
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I08d03eb653fdcb6955340519b0cf5ba97b10d590
Gerrit-Change-Number: 18872
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 


[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/18875


Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..

IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

Machines that don't have en_US.UTF-8 installed see
issues when running ExprTest.Utf8MaskTest.
This currently impacts the Docker-based tests.
This installs the appropriate language packs
to have en_US.UTF-8 installed.

Testing:
 - Ran docker-based tests and verified that
   ExprTest.Utf8MaskTest passes.

Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
---
M bin/bootstrap_system.sh
1 file changed, 3 insertions(+), 2 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/75/18875/1
--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 


[Impala-ASF-CR] IMPALA-10214: Ozone remote file handle caching

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18853 )

Change subject: IMPALA-10214: Ozone remote file handle caching
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11193/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18853
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9df13208999c6d3b14f4c005a91ee2a92a05bdf9
Gerrit-Change-Number: 18853
Gerrit-PatchSet: 5
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 19 Aug 2022 22:06:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10213: Add test for local vs remote scheduling

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18841 )

Change subject: IMPALA-10213: Add test for local vs remote scheduling
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11194/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18841
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I4e5606528404c3d4fd164c03dec8315345be5f6d
Gerrit-Change-Number: 18841
Gerrit-PatchSet: 4
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Fri, 19 Aug 2022 22:12:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11514: Workaround s3 connection timeout issues

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18872 )

Change subject: IMPALA-11514: Workaround s3 connection timeout issues
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11195/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18872
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I08d03eb653fdcb6955340519b0cf5ba97b10d590
Gerrit-Change-Number: 18872
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Fri, 19 Aug 2022 23:58:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 13:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/18798/13/be/src/runtime/krpc-data-stream-sender.cc
File be/src/runtime/krpc-data-stream-sender.cc:

http://gerrit.cloudera.org:8080/#/c/18798/13/be/src/runtime/krpc-data-stream-sender.cc@1095
PS13, Line 1095: if (outbound_rb_mem_pool_.get() != nullptr)
> possibly change to if(UNLIKELY(outbound_rb_mem_pool.get() != nullptr)
ok


http://gerrit.cloudera.org:8080/#/c/18798/13/be/src/runtime/krpc-data-stream-sender.cc@1103
PS13, Line 1103: delete outbound_rb_free_pool_;
> check if outbound_rb_free_pool_ != nullptr
that will be safe



--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 13
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Sat, 20 Aug 2022 02:39:49 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 13:

There are 4 end-end unit-test failures due to memory limit exceeded for large 
rows. This is expected behavior since we allocate more memory from trackable 
buffer pool, especially affect test with large rows.
To fix these failures, increase mem_limit for these test cases.


--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 13
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Sat, 20 Aug 2022 02:38:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18875 )

Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11196/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Sat, 20 Aug 2022 01:04:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 13: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8476/


--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 13
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Sat, 20 Aug 2022 01:31:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11514: Workaround s3 connection timeout issues

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18872 )

Change subject: IMPALA-11514: Workaround s3 connection timeout issues
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8477/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18872
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I08d03eb653fdcb6955340519b0cf5ba97b10d590
Gerrit-Change-Number: 18872
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Sat, 20 Aug 2022 04:55:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11492: Add langpacks-en (centos) and language-pack-en (Ubuntu)

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18875 )

Change subject: IMPALA-11492: Add langpacks-en (centos) and language-pack-en 
(Ubuntu)
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8478/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18875
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1b8696190e4713bda787e773d48943b5dfc6335e
Gerrit-Change-Number: 18875
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Sat, 20 Aug 2022 04:55:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..


Patch Set 4:

(1 comment)

> Patch Set 3:
>
> (1 comment)
>
> lgtm, one comment about code cleanness

Hi all, I have slightly revised the patch set 3 according to Csaba's 
suggestion. Please let me know if there is any other comment

Thank you very much for the help!

http://gerrit.cloudera.org:8080/#/c/18850/3/fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java
File 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java:

http://gerrit.cloudera.org:8080/#/c/18850/3/fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java@203
PS3, Line 203:   // to determine whether the correct audit events are 
collected.
 :   auditHandler.getAuthzEvents().clear();
 : } else {
 :   // We send audit log entries to the Ranger server only if 
authorization failed or
 :   // analysis succeeded.
> The two conditions are the opposites of each other - can you convert this t
Thanks Csaba!

I will change the above to an if-else block in the next patch.



--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 17 Aug 2022 16:35:15 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..


Patch Set 4: Code-Review+2

Thanks for the changes!


--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 17 Aug 2022 16:45:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] [IMPALA-11500] Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: [IMPALA-11500] Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 1
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 16 Aug 2022 16:24:47 +
Gerrit-HasComments: No


[Impala-ASF-CR] Test review

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has removed a vote on this change.

Change subject: Test review
..


Removed Verified-1 by Impala Public Jenkins 
--
To view, visit http://gerrit.cloudera.org:8080/18860
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: deleteVote
Gerrit-Change-Id: Id1c1742716c4922f125b20d719928e9a7b94ecc8
Gerrit-Change-Number: 18860
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11172/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 17 Aug 2022 16:54:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] WIP: IMPALA-11484: Create SCAN plan for Iceberg V2 position delete tables

2022-08-20 Thread Zoltan Borok-Nagy (Code Review)
Hello Tamas Mate, Gabor Kaszab, lipeng...@sensorsdata.cn, Gergely Fürnstáhl, 
Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18847

to look at the new patch set (#2).

Change subject: WIP: IMPALA-11484: Create SCAN plan for Iceberg V2 position 
delete tables
..

WIP: IMPALA-11484: Create SCAN plan for Iceberg V2 position delete tables

This patch adds support for reading Iceberg V2 tables use position
deletes. Equality deletes are still not supported. Position delete
files store the file path and file position of the deleted rows.

When an Iceberg table has position delete files we need to do an
ANTI JOIN between data files and delete files. From the data files
we need to query the virtual columns INPUT__FILE__NAME and
FILE__POSITION, while from the delete files we need the data columns
'file_path' and 'pos'. The latter data columns are added as 'hidden
columns' to Iceberg tables. 'Hidden column' is a new concept introduced
by this patch.

This patch introduces a new class 'IcebergScanPlanner' which has
the responsibility of doing a plan for Iceberg table scans. It creates
the aforementioned ANTI JOIN. Also, if there are data files without
corresponding delete files, we can have a separate SCAN node and its
results would be UNIONed to the rows coming from the ANTI JOIN:

  UNION
 / \
SCAN data   ANTI JOIN
 /  \
  SCAN dataSCAN deletes

Predicate pushdown and time travel logic is transferred from
IcebergScanNode to IcebergScanPlanner.

TODO:
 * better cardinality estimates
 * add tests

Change-Id: I672cfee18d8e131772d90378d5b12ad4d0f7dd48
---
M common/thrift/CatalogObjects.thrift
M fe/src/main/java/org/apache/impala/analysis/AlterTableSetTblProperties.java
M fe/src/main/java/org/apache/impala/analysis/SelectStmt.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/catalog/Column.java
M fe/src/main/java/org/apache/impala/catalog/FeCatalogUtils.java
M fe/src/main/java/org/apache/impala/catalog/FeFsTable.java
M fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/IcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/local/LocalIcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/local/LocalTable.java
M fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java
M fe/src/main/java/org/apache/impala/planner/IcebergScanNode.java
A fe/src/main/java/org/apache/impala/planner/IcebergScanPlanner.java
M fe/src/main/java/org/apache/impala/planner/SingleNodePlanner.java
M fe/src/main/java/org/apache/impala/planner/UnionNode.java
M fe/src/main/java/org/apache/impala/util/IcebergUtil.java
M 
testdata/data/iceberg_test/hadoop_catalog/ice/iceberg_v2_delete_positional/data/00191-4-6e780302-527b-4911-8c6e-88d416adac57-1.parquet
18 files changed, 850 insertions(+), 537 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/47/18847/2
--
To view, visit http://gerrit.cloudera.org:8080/18847
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I672cfee18d8e131772d90378d5b12ad4d0f7dd48
Gerrit-Change-Number: 18847
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 


[native-toolchain-CR] IMPALA-9999: Build the toolchain with GCC 10

2022-08-20 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18806 )

Change subject: IMPALA-: Build the toolchain with GCC 10
..


Patch Set 3: Code-Review+1


--
To view, visit http://gerrit.cloudera.org:8080/18806
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: native-toolchain
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2fbfcdcb497ad60772bf0a7837e4043ae75bcfaa
Gerrit-Change-Number: 18806
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Wed, 17 Aug 2022 15:45:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] WIP: IMPALA-11484: Create SCAN plan for Iceberg V2 position delete tables

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18847 )

Change subject: WIP: IMPALA-11484: Create SCAN plan for Iceberg V2 position 
delete tables
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11171/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18847
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I672cfee18d8e131772d90378d5b12ad4d0f7dd48
Gerrit-Change-Number: 18847
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Gergely Fürnstáhl 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Wed, 17 Aug 2022 14:32:58 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Omid Shahidi (Code Review)
Omid Shahidi has uploaded a new patch set (#8). ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..

IMPALA-6684: Fix untracked memory in KRPC

During serialization of an row batch header, a tuple_data_ is created
which will hold the compressed tuple data for an outbound row batch.
We would like this tuple data to be trackable as it is responsible for
a significant portion of untrackable memory from the krpc data stream
sender. By using free pool, we are able to allocate tuple data and
compression scratch and account for it in the memory tracker of the
KrpcDataStreamSender. This solution creates a RAII class responsible
for memory allocation and changes the existing code to use a char buffer
pointed by a char* tuple_data_ instead of the previously used
std::string tuple_data_. The thrift implementation is left unchanged and
the protobuf implementation is seperated.

Testing:
 - Passed core tests.
 - Ran a single node benchmark which shows no regression.
 - Updated row-batch-serialize-test and row-batch-serialize-benchmark to
   test the row-batch serialization used by KRPC.
 - Manually collected query-profile, heap growth, and memory usage log
   showing untracked memory decreased by 1/2.
 - Add end-end unit-test to verify the new counters in runtime profile

Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
---
M be/src/benchmarks/row-batch-serialize-benchmark.cc
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
M be/src/runtime/row-batch-serialize-test.cc
M be/src/runtime/row-batch.cc
M be/src/runtime/row-batch.h
A be/src/runtime/row-batch.inline.h
A testdata/workloads/functional-query/queries/datastream-sender.test
A testdata/workloads/tpch/queries/datastream-sender.test
A tests/query_test/test_datastream_sender.py
10 files changed, 657 insertions(+), 173 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/98/18798/8
--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 8
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 17 Aug 2022 21:51:52 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has uploaded a new patch set (#4). ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..

IMPALA-11494: Don't always produce Ranger audit log for authorized query

Before this patch, when Impala could not resolve a given table
'.' during the query analysis, Impala would still
attempt to register 2 privilege requests. One was for the table
'' under the database '' and the other was for the
table '' under the database 'default'. The first one should be
registered since Impala had to determine whether such an access should
be allowed (even though in fact the table did not exist), whereas the
second one was incorrect in that 'default.' definitely was not
'.' in general.

Furthermore, Impala always sent audit log entries to the Ranger server
for an authorized query against non-existing table(s).

The 2 facts described above resulted in Impala producing Ranger audit
log entries for the tables '.' and
'default.' when a requesting user granted sufficient privileges
on the databases of '' and 'default' submitted a query against
a non-existing table '.'. None of the audit log
entries should be generated because i) the privilege request for
'.' was allowed and '.' did not
exist, and ii) 'default.' did not correspond to a table.

This patch fixes the 2 issues mentioned above so that Impala will not
generate any Ranger audit log entry for an authorized query against a
non-existing table.

Testing:
 - Added a frontend test case to verify no Ranger audit log entry will
   be produced for an authorized query against a non-existing table.

Change-Id: I701652e457d3118f43249e83be933713b17ce48f
---
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
M fe/src/main/java/org/apache/impala/authorization/AuthorizationChecker.java
M fe/src/main/java/org/apache/impala/authorization/BaseAuthorizationChecker.java
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java
M fe/src/test/java/org/apache/impala/authorization/AuthorizationTestBase.java
M 
fe/src/test/java/org/apache/impala/authorization/ranger/RangerAuditLogTest.java
7 files changed, 75 insertions(+), 21 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/50/18850/4
--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8457/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 17 Aug 2022 17:03:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] [IMPALA-11500] Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: [IMPALA-11500] Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..


Patch Set 1:

(2 comments)

Thanks for working on this!

http://gerrit.cloudera.org:8080/#/c/18854/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18854/1//COMMIT_MSG@7
PS1, Line 7: [IMPALA-11500]
nit: you shouldn't use brackets, first line should be:

IMPALA-11500:


http://gerrit.cloudera.org:8080/#/c/18854/1/be/src/exec/parquet/parquet-column-readers.cc
File be/src/exec/parquet/parquet-column-readers.cc:

http://gerrit.cloudera.org:8080/#/c/18854/1/be/src/exec/parquet/parquet-column-readers.cc@396
PS1, Line 396:   if (num_values == 0) return true;
Maybe the solution can be made a bit more future-proof if we make the bool 
decoder's SkipValues() function tolerate 0 values. So if we start using 
SkipValues() at other places we won't run into this crash again.

Actually I think it's already working fine in case of 0, we just need to change 
the initial DCHECK_GT to DCHECK_GE(num_values, 0);



--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 1
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Wed, 17 Aug 2022 11:32:38 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..

IMPALA-11494: Don't always produce Ranger audit log for authorized query

Before this patch, when Impala could not resolve a given table
'.' during the query analysis, Impala would still
attempt to register 2 privilege requests. One was for the table
'' under the database '' and the other was for the
table '' under the database 'default'. The first one should be
registered since Impala had to determine whether such an access should
be allowed (even though in fact the table did not exist), whereas the
second one was incorrect in that 'default.' definitely was not
'.' in general.

Furthermore, Impala always sent audit log entries to the Ranger server
for an authorized query against non-existing table(s).

The 2 facts described above resulted in Impala producing Ranger audit
log entries for the tables '.' and
'default.' when a requesting user granted sufficient privileges
on the databases of '' and 'default' submitted a query against
a non-existing table '.'. None of the audit log
entries should be generated because i) the privilege request for
'.' was allowed and '.' did not
exist, and ii) 'default.' did not correspond to a table.

This patch fixes the 2 issues mentioned above so that Impala will not
generate any Ranger audit log entry for an authorized query against a
non-existing table.

Testing:
 - Added a frontend test case to verify no Ranger audit log entry will
   be produced for an authorized query against a non-existing table.

Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Reviewed-on: http://gerrit.cloudera.org:8080/18850
Reviewed-by: Csaba Ringhofer 
Tested-by: Impala Public Jenkins 
---
M fe/src/main/java/org/apache/impala/analysis/AnalysisContext.java
M fe/src/main/java/org/apache/impala/analysis/Analyzer.java
M fe/src/main/java/org/apache/impala/authorization/AuthorizationChecker.java
M fe/src/main/java/org/apache/impala/authorization/BaseAuthorizationChecker.java
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java
M fe/src/test/java/org/apache/impala/authorization/AuthorizationTestBase.java
M 
fe/src/test/java/org/apache/impala/authorization/ranger/RangerAuditLogTest.java
7 files changed, 75 insertions(+), 21 deletions(-)

Approvals:
  Csaba Ringhofer: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 5
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR](asf-site) Update 4.1.0 release notes with more items

2022-08-20 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18845 )

Change subject: Update 4.1.0 release notes with more items
..


Patch Set 1: Code-Review+2

Thanks for adding these items! I checked the links and all looks good!


--
To view, visit http://gerrit.cloudera.org:8080/18845
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: asf-site
Gerrit-MessageType: comment
Gerrit-Change-Id: I44b85afa19806ef0c654eba267f40278d421868c
Gerrit-Change-Number: 18845
Gerrit-PatchSet: 1
Gerrit-Owner: Quanlong Huang 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Wed, 17 Aug 2022 16:53:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9999: Switch to GCC 10.4

2022-08-20 Thread Joe McDonnell (Code Review)
Hello Andrew Sherman, Abhishek Rawat, Michael Smith, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18134

to look at the new patch set (#11).

Change subject: IMPALA-: Switch to GCC 10.4
..

IMPALA-: Switch to GCC 10.4

This upgrades GCC and libstdc++ to version 10.4. This
required patching or upgrading several dependencies
so they could compile with GCC 10. The toolchain
companion change has details on what items needed
to be upgraded and why.

The toolchain companion change switches GCC to build
with toolchain binutils rather than host binutils. This
means that the python virtualenv initialization needs
to include binutils on the path.

This disables two warnings introduced in the new GCC
versions (Wclass-memaccess and Winit-list-lifetime).
These two warnings occur in our code and also in
dependencies like LLVM and rapidjson. These are not
critical warnings, so they can be addressed
independently and reenabled later.

Binary sizes increase, particulary when including
debug symbols:
 | GCC 7.5 | GCC 10.4
impalad RELEASE stripped |  83204768   |  88702824
impalad RELEASE  | 707278904   | 971711456
impalad DEBUG stripped   | 106677672   |  97391944
impalad DEBUG| 725864760   | 867647512

Testing:
 - Multiple test jobs (core, release exhaustive, ASAN)
 - Performance testing for TPC-H and TPC-DS shows
   a modest improvement (2-4%).
 - Code compiles without warnings on debug and release

Change-Id: Ibe6857b822925226d39fd4d6413457ef6bbaabec
---
M be/CMakeLists.txt
M be/src/codegen/llvm-codegen-test.cc
M be/src/runtime/string-value.inline.h
M be/src/udf/udf.cc
M be/src/util/parquet-reader.cc
M bin/impala-config.sh
M fe/pom.xml
M infra/python/bootstrap_virtualenv.py
8 files changed, 54 insertions(+), 18 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/34/18134/11
--
To view, visit http://gerrit.cloudera.org:8080/18134
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ibe6857b822925226d39fd4d6413457ef6bbaabec
Gerrit-Change-Number: 18134
Gerrit-PatchSet: 11
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 


[Impala-ASF-CR] Test review

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18860 )

Change subject: Test review
..


Patch Set 1: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/8458/


--
To view, visit http://gerrit.cloudera.org:8080/18860
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id1c1742716c4922f125b20d719928e9a7b94ecc8
Gerrit-Change-Number: 18860
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 17 Aug 2022 23:45:51 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Omid Shahidi (Code Review)
Omid Shahidi has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 7:

(5 comments)

> Patch Set 7:
>
> (4 comments)

http://gerrit.cloudera.org:8080/#/c/18798/7/be/src/benchmarks/row-batch-serialize-benchmark.cc
File be/src/benchmarks/row-batch-serialize-benchmark.cc:

http://gerrit.cloudera.org:8080/#/c/18798/7/be/src/benchmarks/row-batch-serialize-benchmark.cc@38
PS7, Line 38:
: // Benchmark to measure how quickly we can serialize and 
deserialize row batches. More
: // specifically, this benchmark was developed to measure the 
overhead of deduplication.
: // The benchmarks are divided into serialization and 
deserialization benchmarks.
: // The serialization benchmarks test different serialization 
methods (the new default of
: // adjacent deduplication vs. the baseline of no deduplication) 
on row batches with
: // different patterns of duplication: no_dups and adjacent_dups.
: // For all benchmarks we use (int, string) tuples to exercise 
both variable-length and
: // fixed-length slot handling. The small tuples with few slots 
emphasizes per-tuple
: // dedup performance rather than per-slot 
serialization/deserialization performance.
: //
: // serialize:Function Rate (iters/ms)  
Comparison
: // 
--
: //  ser_no_dups_baseline   17.43  
1X
: //   ser_no_dups   17.33 
0.9944X
: //  ser_no_dups_full14.1 
0.8092X
: //
: //ser_adjacent_dups_baseline   26.65  
1X
: // ser_adjacent_dups   63.98  
  2.4X
: //ser_adjacent_dups_full   55.88  
2.096X
: //
: // ser_dups_baseline   19.26  
1X
: //  ser_dups   19.55  
1.015X
: // ser_dups_full32.4  
1.682X
: //
: // deserialize:  Function Rate (iters/ms)  
Comparison
: // 
--
: //deser_no_dups_baseline   64.94  
1X
: // deser_no_dups   69.24  
1.066X
: //
: //  deser_adjacent_dups_baseline 112  
1X
: //   deser_adjacent_dups   207.4  
1.852X
: //
: //   deser_dups_baseline   114.8  
1X
: //deser_dups   208.5  
1.817X
: //
: // Earlier results with LossyHashTable
: // serialize:Function Rate (iters/ms)  
Comparison
: // 
--
: // ser_no_dups_lossy   15.93 
0.9139X
: //   ser_adjacent_dups_lossy   58.21  
2.184X
: //ser_dups_lossy   50.46  
 2.62X
: //
: // Earlier results with boost::unordered_map
: // serialize:Function Rate (iters/ms)  
Comparison
: // 
--
: //  ser_no_dups_full8.73 
0.5582X
: //
: //ser_adjacent_dups_full38.7  
1.634X
: //
: // ser_dups_full27.5  
 1.54X
Should this be removed and updated with the current benchmark scores?


http://gerrit.cloudera.org:8080/#/c/18798/7/be/src/runtime/row-batch.inline.h
File be/src/runtime/row-batch.inline.h:

http://gerrit.cloudera.org:8080/#/c/18798/7/be/src/runtime/row-batch.inline.h@22
PS7, Line 22:
> extra line
Done


http://gerrit.cloudera.org:8080/#/c/18798/7/be/src/runtime/row-batch.inline.h@34
PS7, Line 34: for_compression
> These three lines check for_compression, you can change code as:
Done


http://gerrit.cloudera.org:8080/#/c/18798/7/testdata/workloads/tpch/queries/datastream-sender.test
File testdata/workloads/tpch/queries/datastream-sender.test:


[Impala-ASF-CR] Test review

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18860 )

Change subject: Test review
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8458/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18860
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Id1c1742716c4922f125b20d719928e9a7b94ecc8
Gerrit-Change-Number: 18860
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Wed, 17 Aug 2022 23:45:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Anonymous Coward (Code Review)
lipeng...@sensorsdata.cn has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..


Patch Set 2:

(2 comments)

http://gerrit.cloudera.org:8080/#/c/18854/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/18854/1//COMMIT_MSG@7
PS1, Line 7: IMPALA-11500: F
> nit: you shouldn't use brackets, first line should be:
That is my mistakes! Subconsciously using the style of the company I work for.


http://gerrit.cloudera.org:8080/#/c/18854/1/be/src/exec/parquet/parquet-column-readers.cc
File be/src/exec/parquet/parquet-column-readers.cc:

http://gerrit.cloudera.org:8080/#/c/18854/1/be/src/exec/parquet/parquet-column-readers.cc@396
PS1, Line 396:   if (bool_decoder_) {
> Maybe the solution can be made a bit more future-proof if we make the bool
Done



--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 2
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Thu, 18 Aug 2022 02:58:21 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11207: Use hadoop-cloud-storage for Cloud dependencies

2022-08-20 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18817 )

Change subject: IMPALA-11207: Use hadoop-cloud-storage for Cloud dependencies
..


Patch Set 2:

I don't think hadoop-cos is available in the versions of hadoop-cloud-storage 
we pull in yet, so I plan to leave it for now. This seems ready to merge.


--
To view, visit http://gerrit.cloudera.org:8080/18817
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3a1631289f990513823c2b17eb9241cc1b5a7ffd
Gerrit-Change-Number: 18817
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Laszlo Gaal 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Mon, 15 Aug 2022 16:06:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Anonymous Coward (Code Review)
lipeng...@sensorsdata.cn has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..

IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when 
num_values is 0

Fix Impalad crashed in the method ParquetBoolDecoder::SkipValues when
the parameter 'num_values' is 0. The function should tolerate that the
'num_values' is 0 values.

Testing:
 - Add e2e tests

Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
---
M be/src/exec/parquet/parquet-bool-decoder.cc
A 
testdata/workloads/functional-query/queries/QueryTest/parquet-page-index-bugfix.test
M tests/query_test/test_parquet_stats.py
3 files changed, 35 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/54/18854/3
--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 3
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Anonymous Coward (Code Review)
lipeng...@sensorsdata.cn has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..

IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when 
num_values is 0

Fix Impalad crashed in the method ParquetBoolDecoder::SkipValues when
the parameter 'num_values' is 0. In the method
ParquetBoolDecoder::SkipValues, if the parameter 'num_values' is 0
return true without other logic.

Testing:
 - Add e2e tests

Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
---
M be/src/exec/parquet/parquet-bool-decoder.cc
A 
testdata/workloads/functional-query/queries/QueryTest/parquet-page-index-bugfix.test
M tests/query_test/test_parquet_stats.py
3 files changed, 35 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/54/18854/2
--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 2
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-10800: Tidy up the be/src/exec directory

2022-08-20 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18815 )

Change subject: IMPALA-10800: Tidy up the be/src/exec directory
..


Patch Set 3: Code-Review+2

Thanks for working on this change!


--
To view, visit http://gerrit.cloudera.org:8080/18815
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie936c400ea8b112073bba892497ab8a1498c418d
Gerrit-Change-Number: 18815
Gerrit-PatchSet: 3
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 15 Aug 2022 10:55:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9482: Support for BINARY columns

2022-08-20 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16066 )

Change subject: IMPALA-9482: Support for BINARY columns
..


Patch Set 22:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/16066/14/be/src/runtime/descriptors.h
File be/src/runtime/descriptors.h:

http://gerrit.cloudera.org:8080/#/c/16066/14/be/src/runtime/descriptors.h@256
PS14, Line 256: return col_descs_[slot_desc->col_path().back()];
> Change the analyzer to disallow complex types in select list if they have b
Thanks for digging into this!

Sorry that my initial confusion is that "col_descs_" are the top-level columns 
of the table, but the last item in SchemaPath is not always the top-level 
column index. Usually the first item of SchemaPath is the top-level column 
index, and the next items are the index inside the nested type. E.g. the 6th 
column in table complextypestbl is

 nested_struct  struct<
   a: int,
   b: array,
   ...
 >

If the query selects "nested_struct.a" in the SelectList, the corresponding 
SchemaPath is [5, 0]. Here [5] is the SchemaPath of "nested_struct". But we are 
using 0 (the last item) here as the index of col_descs_. So I hope we can add a 
test of selecting the binary column directly inside a struct top level 
column.

Maybe I've missed something. Just explaning my confusion.


http://gerrit.cloudera.org:8080/#/c/16066/14/fe/src/main/java/org/apache/impala/analysis/LikePredicate.java
File fe/src/main/java/org/apache/impala/analysis/LikePredicate.java:

http://gerrit.cloudera.org:8080/#/c/16066/14/fe/src/main/java/org/apache/impala/analysis/LikePredicate.java@116
PS14, Line 116:   private static boolean isLikeableType(Type type) {
> :D poor other types - tbh I don't remember whether the naming was intention
haha


http://gerrit.cloudera.org:8080/#/c/16066/22/testdata/bin/generate-schema-statements.py
File testdata/bin/generate-schema-statements.py:

http://gerrit.cloudera.org:8080/#/c/16066/22/testdata/bin/generate-schema-statements.py@222
PS22, Line 222: 'BINARY': 'bytes'
nit: it'd be nice to add a trailing comma so future changes don't need to touch 
this line.


http://gerrit.cloudera.org:8080/#/c/16066/22/testdata/datasets/functional/functional_schema_template.sql
File testdata/datasets/functional/functional_schema_template.sql:

http://gerrit.cloudera.org:8080/#/c/16066/22/testdata/datasets/functional/functional_schema_template.sql@3532
PS22, Line 3532: binary_in_complex_types
Can we add some data to this table and add some e2e tests? e.g.

 select binary_member_col.b from binary_in_complex_types;
 select a.item from binary_in_complex_types t, t.binary_item_col;
 select m.key from binary_in_complex_types t, t.binary_key_col;
 select m.value from binary_in_complex_types t, t.binary_value_col;



--
To view, visit http://gerrit.cloudera.org:8080/16066
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I36861a9ca6c2047b0d76862507c86f7f153bc582
Gerrit-Change-Number: 16066
Gerrit-PatchSet: 22
Gerrit-Owner: Csaba Ringhofer 
Gerrit-Reviewer: Attila Jeges 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Steve Carlin 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Thu, 18 Aug 2022 09:43:26 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9999: Switch to GCC 10.4

2022-08-20 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18134 )

Change subject: IMPALA-: Switch to GCC 10.4
..


Patch Set 11:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18134/10/be/src/runtime/string-value.inline.h
File be/src/runtime/string-value.inline.h:

http://gerrit.cloudera.org:8080/#/c/18134/10/be/src/runtime/string-value.inline.h@41
PS10, Line 41:   // GCC gives a warning about overflowing the size argument of 
memcmp, because
> The int type is coming from StringValue/StringVal's len field. These are us
I like using len <= 0 to cover it.



--
To view, visit http://gerrit.cloudera.org:8080/18134
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibe6857b822925226d39fd4d6413457ef6bbaabec
Gerrit-Change-Number: 18134
Gerrit-PatchSet: 11
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Thu, 18 Aug 2022 05:26:51 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9999: Switch to GCC 10.4

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18134 )

Change subject: IMPALA-: Switch to GCC 10.4
..


Patch Set 10:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18134/10/be/src/runtime/string-value.inline.h
File be/src/runtime/string-value.inline.h:

http://gerrit.cloudera.org:8080/#/c/18134/10/be/src/runtime/string-value.inline.h@41
PS10, Line 41:   // GCC gives a warning about overflowing the size argument of 
memcmp, because
> This isn't something we want to update to use an unsigned int?
The int type is coming from StringValue/StringVal's len field. These are used 
all over the place, and StringVal is part of our UDF interface. My general read 
is that switching to unsigned would be a lot of work that doesn't get us a lot.

I'm open to ideas. Another alternative is to have:
const int result = (len <= 0) ? 0 : memcmp(s1, s2, len)



--
To view, visit http://gerrit.cloudera.org:8080/18134
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibe6857b822925226d39fd4d6413457ef6bbaabec
Gerrit-Change-Number: 18134
Gerrit-PatchSet: 10
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Thu, 18 Aug 2022 03:49:27 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Omid Shahidi (Code Review)
Omid Shahidi has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 8:

Added benchmark results to commit message for patch 9


--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 8
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 18 Aug 2022 03:49:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11472: Reduce test dimensions for TestClientSsl

2022-08-20 Thread Michael Smith (Code Review)
Michael Smith has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18843 )

Change subject: IMPALA-11472: Reduce test dimensions for TestClientSsl
..


Patch Set 1: Code-Review+1

Jenkins error is in an unrelated test case.


--
To view, visit http://gerrit.cloudera.org:8080/18843
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d4a4792a37cba2231de8bfa2279ba029a05
Gerrit-Change-Number: 18843
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Mon, 15 Aug 2022 15:54:22 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Joe McDonnell (Code Review)
Hello Quanlong Huang, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/18810

to look at the new patch set (#2).

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..

IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

This adds some calls to df and du to track disk space
usage throughout the builds. This also cleans up the
Impala dev environment before creating the m2 archive.

Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
---
M bin/jenkins/build-all-flag-combinations.sh
1 file changed, 17 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/10/18810/2
--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-11494: Don't always produce Ranger audit log for authorized query

2022-08-20 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18850 )

Change subject: IMPALA-11494: Don't always produce Ranger audit log for 
authorized query
..


Patch Set 1:

Hi all, please let me know if you have any comment on the patch. Thank you very 
much for the help!


--
To view, visit http://gerrit.cloudera.org:8080/18850
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I701652e457d3118f43249e83be933713b17ce48f
Gerrit-Change-Number: 18850
Gerrit-PatchSet: 1
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Mon, 15 Aug 2022 16:39:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11472: Reduce test dimensions for TestClientSsl

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18843 )

Change subject: IMPALA-11472: Reduce test dimensions for TestClientSsl
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8447/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18843
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d4a4792a37cba2231de8bfa2279ba029a05
Gerrit-Change-Number: 18843
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Mon, 15 Aug 2022 16:46:44 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..


Patch Set 9:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11178/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 9
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Thu, 18 Aug 2022 04:14:03 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11498: Change port range of TEZ's web UI server

2022-08-20 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/18851


Change subject: IMPALA-11498: Change port range of TEZ's web UI server
..

IMPALA-11498: Change port range of TEZ's web UI server

After TEZ-4347, by default TEZ would attempt to start a web UI server
before opening a session. The default port range for the server
specified in TezConfiguration.java (in the TEZ repository) is
"5-50050", which does not seem to be a good choice in Impala's
testing environment in that there are always some other client programs
holding those ports when TEZ attempts to start its web UI server. As a
result, TEZ could not bind a port in the port range to start its web UI
server, resulting in TEZ session not being created, which in turn fail
some end-to-end test relying on TEZ, e.g., test_json_file_unsupported.

This patch explicitly sets the port range to "32000-32100" to avoid port
collision based on the parameter of 'ip_local_port_range', which is
"32768-60999" on the machine where Impala's tests are run.

Testing:
 - Verified in 2 Jenkins runs of exhaustive RELEASE build that we do not
   see test_json_file_unsupported fail again after this patch.

Change-Id: I3d067953f943a762ad50b0561570e96f0dc39791
---
M fe/src/test/resources/hive-site.xml.py
1 file changed, 1 insertion(+), 0 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/51/18851/1
--
To view, visit http://gerrit.cloudera.org:8080/18851
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I3d067953f943a762ad50b0561570e96f0dc39791
Gerrit-Change-Number: 18851
Gerrit-PatchSet: 1
Gerrit-Owner: Fang-Yu Rao 


[Impala-ASF-CR] IMPALA-6684: Fix untracked memory in KRPC

2022-08-20 Thread Omid Shahidi (Code Review)
Omid Shahidi has uploaded a new patch set (#9). ( 
http://gerrit.cloudera.org:8080/18798 )

Change subject: IMPALA-6684: Fix untracked memory in KRPC
..

IMPALA-6684: Fix untracked memory in KRPC

During serialization of an row batch header, a tuple_data_ is created
which will hold the compressed tuple data for an outbound row batch.
We would like this tuple data to be trackable as it is responsible for
a significant portion of untrackable memory from the krpc data stream
sender. By using free pool, we are able to allocate tuple data and
compression scratch and account for it in the memory tracker of the
KrpcDataStreamSender. This solution creates a RAII class responsible
for memory allocation and changes the existing code to use a char buffer
pointed by a char* tuple_data_ instead of the previously used
std::string tuple_data_. The thrift implementation is left unchanged and
the protobuf implementation is seperated.

Testing:
 - Passed core tests.
 - Ran a single node benchmark which shows no regression.
 - Updated row-batch-serialize-test and row-batch-serialize-benchmark to
   test the row-batch serialization used by KRPC.
 - Manually collected query-profile, heap growth, and memory usage log
   showing untracked memory decreased by 1/2.
 - Add end-end unit-test to verify the new counters in runtime profile

serialize:
Func10%  50%  90%  10%  50%  90% ile
  (rel) (rel) (rel)
---
ser_no_dups_baseline8.36 8.6 8.7   1X  1X  1X
ser_no_dups 6.73 6.85 6.93 0.804X 0.796X 0.796X
ser_no_dups_full5.28 5.38 5.55 0.631X 0.625X 0.637X

ser_adjacent_dups_baseline 12.9 13.2 13.4 1X 1X 1X
ser_adjacent_dups  23.2 23.7 24.1 1.8X 1.8X 1.8X
ser_adjacent_dups_full 19.9 20.3 20.7 1.54X 1.54X 1.55X

ser_dups_baseline  9.17 9.54 9.72 1X  1X 1X
ser_dups7.45 7.69 7.86 0.812X 0.806X 0.809X
ser_dups_full   14.6 15 15.3 1.6X 1.57X 1.57X

deserialize:
Func10%  50%  90%  10%  50%  90% ile
  (rel) (rel) (rel)
---
deser_no_dups_baseline  32.6 33.5 34   1X   1X1X
deser_no_dups   32.5 33.1 33.7 0.999X 0.99X 0.992X

deser_adjacent_dups_baseline  53.1 54 54.7 1X 1X 1X
deser_adjacent_dups 80.3 81.6  82.5 1.51X 1.51X 1.51X

deser_dups_baseline  52.4 54  54.7  1X  1X   1X
deser_dups   86.8 88.4 89.7 1.66X 1.64X 1.64X

Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
---
M be/src/benchmarks/row-batch-serialize-benchmark.cc
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
M be/src/runtime/row-batch-serialize-test.cc
M be/src/runtime/row-batch.cc
M be/src/runtime/row-batch.h
A be/src/runtime/row-batch.inline.h
A testdata/workloads/functional-query/queries/datastream-sender.test
A testdata/workloads/tpch/queries/datastream-sender.test
A tests/query_test/test_datastream_sender.py
10 files changed, 657 insertions(+), 173 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/98/18798/9
--
To view, visit http://gerrit.cloudera.org:8080/18798
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2ba2b907ce4f275a7a1fb8cf75453c7003eb4b82
Gerrit-Change-Number: 18798
Gerrit-PatchSet: 9
Gerrit-Owner: Omid Shahidi 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Kurt Deschler 
Gerrit-Reviewer: Omid Shahidi 
Gerrit-Reviewer: Riza Suminto 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11174/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 3
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Thu, 18 Aug 2022 03:25:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18810 )

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18810/1/bin/jenkins/build-all-flag-combinations.sh
File bin/jenkins/build-all-flag-combinations.sh:

http://gerrit.cloudera.org:8080/#/c/18810/1/bin/jenkins/build-all-flag-combinations.sh@166
PS1, Line 166:   ./bin/clean.sh
> This won't remove toolchain. Should we remove it as well?
The bulk of the space usage is in the be/build directory, so I updated this to 
remove only that. We could remove the toolchain, but removing the be/build 
directory gives us plenty of headroom for the disk usage.



--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 18 Aug 2022 03:39:46 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8459/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 3
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Thu, 18 Aug 2022 03:43:09 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11498: Change port range of TEZ's web UI server

2022-08-20 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/18851 )

Change subject: IMPALA-11498: Change port range of TEZ's web UI server
..

IMPALA-11498: Change port range of TEZ's web UI server

After TEZ-4347, by default TEZ would attempt to start a web UI server
before opening a session. The default port range for the server
specified in TezConfiguration.java (in the TEZ repository) is
"5-50050", which does not seem to be a good choice in Impala's
testing environment in that there are always some other client programs
holding those ports when TEZ attempts to start its web UI server. As a
result, TEZ could not bind a port in the port range to start its web UI
server, resulting in TEZ session not being created, which in turn failed
some end-to-end test relying on TEZ, e.g., test_json_file_unsupported.

This patch explicitly sets the port range to "32000-32100" to avoid port
collision based on the parameter of 'ip_local_port_range', which is
"32768-60999" on the machine where Impala's tests are run.

Testing:
 - Verified in 2 Jenkins runs of exhaustive RELEASE build that we do not
   see test_json_file_unsupported fail again after this patch.

Change-Id: I3d067953f943a762ad50b0561570e96f0dc39791
---
M fe/src/test/resources/hive-site.xml.py
1 file changed, 1 insertion(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/51/18851/2
--
To view, visit http://gerrit.cloudera.org:8080/18851
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I3d067953f943a762ad50b0561570e96f0dc39791
Gerrit-Change-Number: 18851
Gerrit-PatchSet: 2
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: John Sherman 
Gerrit-Reviewer: Riza Suminto 


[Impala-ASF-CR] IMPALA-11469: (Addendum) Initialize BackendConfig

2022-08-20 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18840 )

Change subject: IMPALA-11469: (Addendum) Initialize BackendConfig
..


Patch Set 1: Code-Review+2

Thanks for fixing the test issue! I prefer this one to the other patch 
(https://gerrit.cloudera.org/c/18828/) since the FileSystemUtil changes in it 
is not related to the failure.

For more details about how BackendConfig is initialized in 
FrontendTestBase/AbstractFrontendTest, see the stacktrace here: 
https://gerrit.cloudera.org/c/18828/3/fe/src/test/java/org/apache/impala/catalog/FileMetadataLoaderTest.java#43


--
To view, visit http://gerrit.cloudera.org:8080/18840
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Iaefea5ac982e42a8b0f3b693d16a8db10cc7de31
Gerrit-Change-Number: 18840
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Smith 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 18 Aug 2022 06:53:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11500: Fix Impalad crashed in ParquetBoolDecoder::SkipValues when num values is 0

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18854 )

Change subject: IMPALA-11500: Fix Impalad crashed in 
ParquetBoolDecoder::SkipValues when num_values is 0
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11173/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18854
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8c4c5a4dff9e9e75913c7b524b4ae70967febb37
Gerrit-Change-Number: 18854
Gerrit-PatchSet: 2
Gerrit-Owner: Anonymous Coward 
Gerrit-Reviewer: Anonymous Coward 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Thu, 18 Aug 2022 03:18:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11281: Load table metadata for ResetMetadataStmt

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18561 )

Change subject: IMPALA-11281: Load table metadata for ResetMetadataStmt
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8449/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/18561
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0c90b413974223886661697f11844d99a68fdebf
Gerrit-Change-Number: 18561
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Tue, 16 Aug 2022 12:37:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11281: Load table metadata for ResetMetadataStmt

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18561 )

Change subject: IMPALA-11281: Load table metadata for ResetMetadataStmt
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/18561
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I0c90b413974223886661697f11844d99a68fdebf
Gerrit-Change-Number: 18561
Gerrit-PatchSet: 4
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Tue, 16 Aug 2022 12:37:38 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18810 )

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/8460/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 18 Aug 2022 06:03:44 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9442: (Addendum) Fix Ozone jar name for older Ozone

2022-08-20 Thread Michael Smith (Code Review)
Michael Smith has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/18852


Change subject: IMPALA-9442: (Addendum) Fix Ozone jar name for older Ozone
..

IMPALA-9442: (Addendum) Fix Ozone jar name for older Ozone

Ozone 1.2 changed the name of its hadoop adapter jar. Logic to link
against the jar in pom.xml was correctly updated, but the jar name used
in HADOOP_CLASSPATH assumed Ozone 1.2. Updates the jar in
HADOOP_CLASSPATH to use the correct jar for the detected Ozone version.

Change-Id: I7e401075268ebd81af8b8cac72504f0d1a4e59f1
---
M bin/impala-config.sh
1 file changed, 1 insertion(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/52/18852/1
--
To view, visit http://gerrit.cloudera.org:8080/18852
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I7e401075268ebd81af8b8cac72504f0d1a4e59f1
Gerrit-Change-Number: 18852
Gerrit-PatchSet: 1
Gerrit-Owner: Michael Smith 


[Impala-ASF-CR] IMPALA-9999: Switch to GCC 10.4

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18134 )

Change subject: IMPALA-: Switch to GCC 10.4
..


Patch Set 11:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11177/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18134
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibe6857b822925226d39fd4d6413457ef6bbaabec
Gerrit-Change-Number: 18134
Gerrit-PatchSet: 11
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Michael Smith 
Gerrit-Comment-Date: Thu, 18 Aug 2022 04:02:11 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Quanlong Huang (Code Review)
Quanlong Huang has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18810 )

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..


Patch Set 2: Code-Review+2

(1 comment)

http://gerrit.cloudera.org:8080/#/c/18810/1/bin/jenkins/build-all-flag-combinations.sh
File bin/jenkins/build-all-flag-combinations.sh:

http://gerrit.cloudera.org:8080/#/c/18810/1/bin/jenkins/build-all-flag-combinations.sh@166
PS1, Line 166: if [[ "$GENERATE_M2_ARCHIVE" == true ]]; then
> The bulk of the space usage is in the be/build directory, so I updated this
Ack



--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 18 Aug 2022 06:03:04 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-11471: Track disk usage for build-all-flag-combinations.sh

2022-08-20 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/18810 )

Change subject: IMPALA-11471: Track disk usage for 
build-all-flag-combinations.sh
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/11176/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/18810
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8ab31d8d7096b49d8404edf7521d46f23155526f
Gerrit-Change-Number: 18810
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Thu, 18 Aug 2022 03:59:29 +
Gerrit-HasComments: No


  1   2   3   >