[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7330/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 3
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 01 Oct 2020 03:35:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6494/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 4
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 01 Oct 2020 03:22:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..


Patch Set 4: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 4
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 01 Oct 2020 03:22:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7329/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 2
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 01 Oct 2020 03:22:10 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 3
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Thu, 01 Oct 2020 03:22:16 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread fifteencai (Code Review)
fifteencai has uploaded a new patch set (#3). ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..

IMPALA-10193: Limit the memory usage for the whole test cluster

This patch introduces a new approach of limiting the memory usage
for both mini-cluster and CDH cluster.

Without this limit, clusters are prone to getting killed when running
in docker containers with a lower mem limit than host's memory size.
i.e. The mini-cluster may running in a
container with 32GB limitted by CGROUPS, while the host machine has
128GB. Under this circumstance, if the container is started with
'-privileged' command argument, both mini and CDH clusters compute
their mem_limit according to 128GB rather than 32GB. They will be
killed when attempting to apply for extra resource.

Currently, the mem-limit estimating algorithms for Impalad and Node
Manager are different:

for Impalad:  mem_limit = 0.7 * sys_mem / cluster_size (default is 3)

for Node Manager:
1. Leave aside 24GB, then fit the left into threasholds below.
2. The bare limit is 4GB and maximum limit 48GB

In headge of over-consumption, we

- Added a new environment variable IMPALA_CLUSTER_MAX_MEM_GB
- Modified the algorithm in 'bin/start-impala-cluster.py', making it
  taking IMPALA_CLUSTER_MAX_MEM_GB rather than sys_mem into account.
- Modified the logic in
 'testdata/cluster/node_templates/common/etc/hadoop/conf/yarn-site.xml.py'
  Similarly, making IMPALA_CLUSTER_MAX_MEM_GB substitutes for sys_mem .

Testing: this patch worked in a 32GB docker container running on a 128GB
 host machine. All 1188 unit tests get passed.

Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
---
M bin/impala-config.sh
M bin/start-impala-cluster.py
M testdata/cluster/node_templates/common/etc/hadoop/conf/yarn-site.xml.py
3 files changed, 9 insertions(+), 3 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/16522/3
--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 3
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread fifteencai (Code Review)
fifteencai has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..

IMPALA-10193: Limit the memory usage for the whole test cluster

This patch introduces a new approach of limiting the memory usage
for both mini-cluster and CDH cluster. Without this limit, clusters
are prone to getting killed in containers whose mem limit are lower
than the host's memory size. i.e. The mini-cluster may running in a
container with 32GB limitted by CGROUPS, while the host machine has
128GB. Under this circumstance, if the container is started with
'-privileged' command argument, both mini and CDH clusters compute
their mem_limit according to 128GB rather than 32GB. They will be
killed when attempting to apply for extra memories.

Currently, the mem-limit estimating algorithms for Impalad and Node
Manager are different:

for Impalad:  mem_limit = 0.7 * sys_mem / cluster_size (default is 3)

for Node Manager:
1. Leave aside 24GB, then fit the left memory into threasholds.
2. The bare limit is 4GB and maximum limit 48GB

In headge of over-consumption, we

- Added a new environment variable IMPALA_CLUSTER_MAX_MEM_GB
- Modified the algorithm in 'bin/start-impala-cluster.py', making it
  taking IMPALA_CLUSTER_MAX_MEM_GB rather than sys_mem into account.
- Modified the logic in
 'testdata/cluster/node_templates/common/etc/hadoop/conf/yarn-site.xml.py'
  Similarly, making IMPALA_CLUSTER_MAX_MEM_GB substitutes for sys_mem .

Testing: this patch worked in a 32GB docker container running on a 128GB
 host machine. All 1188 unit tests get passed.

Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
---
M bin/impala-config.sh
M bin/start-impala-cluster.py
M testdata/cluster/node_templates/common/etc/hadoop/conf/yarn-site.xml.py
3 files changed, 10 insertions(+), 3 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/16522/2
--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 2
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test
File 
testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test:

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test@278
PS3, Line 278: select * from
 :   functional.alltypestiny a
 : where a.id in (select id from functional.alltypestiny limit 5) 
limit 5
> Normally, the error becomes larger with more number of tables joined as wel
Not sure I follow. Can you file a follow up JIRA since this has already been 
merged?



--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 22:40:54 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9180 (part 1): Remove legacy ImpalaInternalService

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16291 )

Change subject: IMPALA-9180 (part 1): Remove legacy ImpalaInternalService
..

IMPALA-9180 (part 1): Remove legacy ImpalaInternalService

The legacy Thrift based Impala internal service has been deprecated
and can be removed now.

This patch removes ImpalaInternalService. All infrastructures around it
are cleaned up, except one place for flag be_port.
StatestoreSubscriber::subscriber_id consists be_port, but we cannot
change format of subscriber_id now. This remaining be_port issue will be
fixed in a succeeding patch (part 4).
TQueryCtx.coord_address is changed to TQueryCtx.coord_hostname since the
port in TQueryCtx.coord_address is set as be_port and is unused now.
Also Rename TQueryCtx.coord_krpc_address as TQueryCtx.coord_ip_address.

Testing:
 - Passed the exhaustive test.
 - Passed Quasar-L0 test.

Change-Id: I5fa83c8009590124dded4783f77ef70fa30119e6
Reviewed-on: http://gerrit.cloudera.org:8080/16291
Reviewed-by: Thomas Tauber-Marshall 
Tested-by: Impala Public Jenkins 
---
M be/generated-sources/gen-cpp/CMakeLists.txt
M be/src/benchmarks/expr-benchmark.cc
M be/src/exprs/expr-test.cc
M be/src/exprs/utility-functions-ir.cc
M be/src/rpc/impala-service-pool.cc
M be/src/rpc/impala-service-pool.h
M be/src/rpc/rpc-mgr.cc
M be/src/rpc/rpc-mgr.h
M be/src/rpc/thrift-server-test.cc
D be/src/runtime/backend-client.h
M be/src/runtime/client-cache-types.h
M be/src/runtime/coordinator-backend-state.cc
M be/src/runtime/data-stream-test.cc
M be/src/runtime/exec-env.cc
M be/src/runtime/exec-env.h
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/fragment-instance-state.h
M be/src/runtime/initial-reservations.cc
M be/src/runtime/query-exec-mgr.cc
M be/src/runtime/query-state.cc
M be/src/runtime/runtime-filter-bank.cc
M be/src/runtime/test-env.cc
M be/src/scheduling/scheduler-test-util.h
M be/src/service/CMakeLists.txt
M be/src/service/client-request-state.cc
M be/src/service/control-service.cc
D be/src/service/impala-internal-service.cc
D be/src/service/impala-internal-service.h
M be/src/service/impala-server.cc
M be/src/service/impala-server.h
M be/src/service/impalad-main.cc
M be/src/service/session-expiry-test.cc
M be/src/testutil/in-process-servers.cc
M be/src/testutil/in-process-servers.h
M be/src/util/debug-util.cc
M bin/generate_minidump_collection_testdata.py
M common/thrift/ImpalaInternalService.thrift
M tests/custom_cluster/test_blacklist.py
M tests/custom_cluster/test_process_failures.py
M tests/custom_cluster/test_query_retries.py
M tests/custom_cluster/test_restart_services.py
M tests/webserver/test_web_pages.py
42 files changed, 130 insertions(+), 351 deletions(-)

Approvals:
  Thomas Tauber-Marshall: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/16291
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I5fa83c8009590124dded4783f77ef70fa30119e6
Gerrit-Change-Number: 16291
Gerrit-PatchSet: 15
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-9180 (part 1): Remove legacy ImpalaInternalService

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16291 )

Change subject: IMPALA-9180 (part 1): Remove legacy ImpalaInternalService
..


Patch Set 14: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16291
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5fa83c8009590124dded4783f77ef70fa30119e6
Gerrit-Change-Number: 16291
Gerrit-PatchSet: 14
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 22:40:59 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..

IMPALA-3335: Allow single-node optimization with joins

Enable the single-node optimization for queries with joins.

Testing:
* Ran exhaustive tests
* Looped TPC-DS overnight with NUM_NODES=1 against an impalad
  mini-cluster with a single dedicated coordinator

Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Reviewed-on: http://gerrit.cloudera.org:8080/16521
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M fe/src/main/java/org/apache/impala/planner/Planner.java
M fe/src/main/java/org/apache/impala/util/MaxRowsProcessedVisitor.java
M testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test
3 files changed, 21 insertions(+), 11 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 4
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 22:36:05 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7328/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 6
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 22:00:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9867: Add Support for Spilling to S3: Milestone 1

2020-09-30 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16318 )

Change subject: IMPALA-9867: Add Support for Spilling to S3: Milestone 1
..


Patch Set 13:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/hdfs-fs-cache.cc
File be/src/runtime/hdfs-fs-cache.cc:

http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/hdfs-fs-cache.cc@103
PS13, Line 103:   if (options != nullptr && !options->empty()) {
  : for (auto option : *options) {
  :   hdfsBuilderConfSetStr(
  :   hdfs_builder, option.first.c_str(), 
option.second.c_str());
  : }
  :   }
does this actually work? were you able to confirm that non-default configs set 
this way are actually picked up by the returned hdfsFS object?

according to the comment above, hdfsBuilderSetForceNewInstance has to be called 
in order for hdfsBuilderConfSetStr to have an affect.


http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/io/disk-io-mgr.h
File be/src/runtime/io/disk-io-mgr.h:

http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/io/disk-io-mgr.h@370
PS13, Line 370: REMOTE_DFS_DISK_FILE_OPER_OFFSET,
  : REMOTE_S3_DISK_FILE_OPER_OFFSET,
you cover this a bit in the commit message, but is the reason they were added 
to de-couple HDFS/S3 scans / writes from HDFS/S3 tmp-file-mgr operations? I 
guess that benefit is isolate between operations that spill vs. operations that 
read / write data on behalf of queries?

it would be good to add some documentation here explaining why these two queues 
are necessary and when to, for example, use REMOTE_S3_DISK_FILE_OPER_OFFSET vs. 
REMOTE_S3_DISK_OFFSET


http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/io/disk-io-mgr.cc
File be/src/runtime/io/disk-io-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/io/disk-io-mgr.cc@30
PS13, Line 30: #include "runtime/tmp-file-mgr-internal.h"
Why are we including the tmp-file-mgr here? I see a lot of references to the 
tmp-file-mgr, but ideally I would think the disk-io-mgr and tmp-file-mgr would 
remain separate systems. I see a lot of code referring to tmp_files in this 
class as well, which I concerning.

I would think that the disk-io-mgr should just provide an interface to the 
tmp-file-mgr.


http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/io/disk-io-mgr.cc@241
PS13, Line 241: goto end;
I think we generally try to avoid 'goto' statements because they make the 
control flow harder to follow. in this case the 'goto end' is only used once so 
I don't think it is necessary.


http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/io/request-ranges.h
File be/src/runtime/io/request-ranges.h:

http://gerrit.cloudera.org:8080/#/c/16318/13/be/src/runtime/io/request-ranges.h@113
PS13, Line 113: READ,
  : WRITE,
  : FETCH,
  : UPLOAD,
not sure I understand the difference between all of these. fetch seems like it 
would be the same thing as read, and upload seems like the same thing as write.



-- 
To view, visit http://gerrit.cloudera.org:8080/16318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I419b1d5dbbfe35334d9f964c4b65e553579fdc89
Gerrit-Change-Number: 16318
Gerrit-PatchSet: 13
Gerrit-Owner: Yida Wu 
Gerrit-Reviewer: Abhishek Rawat 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Yida Wu 
Gerrit-Comment-Date: Wed, 30 Sep 2020 21:55:55 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..

IMPALA-9180 (part 2): Refactor executor_list_ map of ExecuterBlacklist

In current ExecuterBlacklist class, we've keyed maps on a
TNetworkAddress of a backend. To simply the logic for the class,
changes it to key off of the UniqueIdPB backend-id, eg. refactor
'executor_list_' to no longer be a map list>
and instead makes it a map.
Also fixes a minor bug with the calculation of elapsed time when
a backend that was on probation is re-blacklisted.

Testing:
 - Passed test_blacklist.py and test_query_retries.py.
 - Passed exhaustive tests.

Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
---
M be/src/scheduling/executor-blacklist.cc
M be/src/scheduling/executor-blacklist.h
2 files changed, 68 insertions(+), 103 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/06/16506/6
--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 6
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..


Patch Set 5:

(4 comments)

http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.h
File be/src/scheduling/executor-blacklist.h:

http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.h@136
PS5, Line 136: executor
> nit. executor->'executor entry'
done


http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.h@137
PS5, Line 137: blacklisted
 :   /// and ones that are on probation
> nit: either blacklisted or on probation.
done


http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.cc
File be/src/scheduling/executor-blacklist.cc:

http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.cc@51
PS5, Line 51:   int64_t elapsed = MonotonicMillis() - entry.blacklist_time_ms;
:   entry.state = State::BLACKLISTED;
:   entry.blacklist_time_ms = MonotonicMillis();
:   entry.cause = cause;
> Should MonotonicMillis() be called only once? That is, in a single time poi
done


http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.cc@164
PS5, Line 164: "on probation")
> Should we also check on NOT_BLACKLISTED here?
No, don't need.



--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 5
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 21:36:09 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9930 (part 2): Introduce new admission control rpc service

2020-09-30 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16412 )

Change subject: IMPALA-9930 (part 2): Introduce new admission control rpc 
service
..


Patch Set 5:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/remote-admission-control-client.cc
File be/src/scheduling/remote-admission-control-client.cc:

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/remote-admission-control-client.cc@86
PS5, Line 86: KUDU_RETURN_IF_ERROR(
: proxy->AdmitQuery(req, &resp, &rpc_controller), 
"AdmitQuery rpc failed");
: Status admit_status(resp.status());
: RETURN_IF_ERROR(admit_status);
> is it necessary to hold the lock while making the RPC to Admit the query? t
nvm, seems there is a separate client per query.



--
To view, visit http://gerrit.cloudera.org:8080/16412
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I594fc593a27b24b6952e381a9bc1a9a5c6b757ae
Gerrit-Change-Number: 16412
Gerrit-PatchSet: 5
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Wed, 30 Sep 2020 21:14:00 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10143: TestAcid.test full acid original files is flaky

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16523 )

Change subject: IMPALA-10143: TestAcid.test_full_acid_original_files is flaky
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Gerrit-Change-Number: 16523
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 30 Sep 2020 21:01:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10143: TestAcid.test full acid original files is flaky

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16523 )

Change subject: IMPALA-10143: TestAcid.test_full_acid_original_files is flaky
..

IMPALA-10143: TestAcid.test_full_acid_original_files is flaky

One of the test queries depended on the values of the row__id.rowid
field. In rare cases the files are written differently by Hive which
leads to different 'rowid' values for the rows.

I took out the test for the 'rowid' values in that particular query.

We have other tests for the 'rowid' field on static data files (from
'testdata/data'), therefore we still have coverage for that and they
shouldn't be flaky.

Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Reviewed-on: http://gerrit.cloudera.org:8080/16523
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M 
testdata/workloads/functional-query/queries/QueryTest/full-acid-original-file.test
1 file changed, 7 insertions(+), 7 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/16523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Gerrit-Change-Number: 16523
Gerrit-PatchSet: 3
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-10164: Supporting HadoopCatalog for Iceberg table

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16446 )

Change subject: IMPALA-10164: Supporting HadoopCatalog for Iceberg table
..


Patch Set 21: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6490/


--
To view, visit http://gerrit.cloudera.org:8080/16446
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic1893c50a633ca22d4bca6726c9937b026f5d5ef
Gerrit-Change-Number: 16446
Gerrit-PatchSet: 21
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 19:42:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10192: Filter out redundant AuthzAuditEvent's for column masking

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16524 )

Change subject: IMPALA-10192: Filter out redundant AuthzAuditEvent's for column 
masking
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7327/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I1dbf65874003523b5176680e42f26fa2114c229b
Gerrit-Change-Number: 16524
Gerrit-PatchSet: 1
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Comment-Date: Wed, 30 Sep 2020 19:41:46 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9930 (part 2): Introduce new admission control rpc service

2020-09-30 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16412 )

Change subject: IMPALA-9930 (part 2): Introduce new admission control rpc 
service
..


Patch Set 5:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.h
File be/src/scheduling/admission-control-service.h:

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.h@78
PS5, Line 78: UniqueIdPB query_id;
: UniqueIdPB coord_id;
> can these both be const ref?
nvm, I guess they need to be copied since they come from the AdmitQueryRequestPB



--
To view, visit http://gerrit.cloudera.org:8080/16412
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I594fc593a27b24b6952e381a9bc1a9a5c6b757ae
Gerrit-Change-Number: 16412
Gerrit-PatchSet: 5
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Wed, 30 Sep 2020 19:34:36 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9930 (part 2): Introduce new admission control rpc service

2020-09-30 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16412 )

Change subject: IMPALA-9930 (part 2): Introduce new admission control rpc 
service
..


Patch Set 5:

(22 comments)

still going through it all, but my comments so far.

http://gerrit.cloudera.org:8080/#/c/16412/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/16412/5//COMMIT_MSG@12
PS5, Line 12: This patch adds some simple configuration flags that make it 
possible
not sure I understand the plan here. this patch adds flags is 
--is_admission_controller to exec-env, which if set will expose the 
AdmissionControllerService. what about this will change exactly in IMPALA-9975?


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.h
File be/src/scheduling/admission-control-service.h:

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.h@76
PS5, Line 76: AdmitQueryRequstPB
nit: typo


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.h@78
PS5, Line 78: UniqueIdPB query_id;
: UniqueIdPB coord_id;
can these both be const ref?


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.h@105
PS5, Line 105: RuntimeProfile* summary_profile;
might be worth mentioning that this is passed to 
AdmissionController::SubmitForAdmission, which is responsible for updating it.


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.h@118
PS5, Line 118: fo
nit: typo


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc
File be/src/scheduling/admission-control-service.cc:

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@85
PS5, Line 85: Adqmission
nit: typo


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@142
PS5, Line 142:   VLOG(1) << "GetQueryStatus " << req->query_id();
might be too verbose to log.


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@155
PS5, Line 155: 100
why wait at all? won't waiting tie up one of the RPC threads? I think the 
client already waits 100 ms.


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@159
PS5, Line 159:   DCHECK(query_info->admit_status.ok());
is this expected? do want to DCHECK if the admit_status is not Status::OK()? if 
AC rejects the query due to cancellation, won't it returned a Status::CANCELLED?


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@197
PS5, Line 197: lock_guard l(query_info->lock);
why does the lock need to be acquired here?


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@209
PS5, Line 209:   VLOG(1) << "ReleaseQueryBackends: query_id=" << 
req->query_id();
might be too verbose to log


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@219
PS5, Line 219: lock_guard l(query_info->lock);
why does the lock need to be acquired here?


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-control-service.cc@249
PS5, Line 249: {query_info->query_id, query_info->coord_id, 
query_info->query_exec_request,
 : query_info->query_options, 
query_info->summary_profile,
 : query_info->blacklisted_executor_addresses},
might make the code easier to read if this just created a AdmissionRequest and 
then passed it in.


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-controller.h
File be/src/scheduling/admission-controller.h:

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-controller.h@316
PS5, Line 316: (usually they
 :   /// are owned by the ClientRequestState).
might need to revise this now.


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/admission-controller.h@345
PS5, Line 345:   Status WaitOnQueued(const UniqueIdPB& query_id,
not clear to me what the returned Status is suppose to indicate.


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/remote-admission-control-client.h
File be/src/scheduling/remote-admission-control-client.h:

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/remote-admission-control-client.h@53
PS5, Line 53:   std::mutex lock_;
what is the lock used to protect?


http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/remote-admission-control-client.cc
File be/src/scheduling/remote-admission-control-client.cc:

http://gerrit.cloudera.org:8080/#/c/16412/5/be/src/scheduling/remote-admission-control-client.cc@63
PS5, Line 63:   KrpcSerializer serializer;
:   int sidecar_idx1;
:   RETURN_IF_ERROR(
:   serializer

[Impala-ASF-CR] IMPALA-10192: Filter out redundant AuthzAuditEvent's for column masking

2020-09-30 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16524


Change subject: IMPALA-10192: Filter out redundant AuthzAuditEvent's for column 
masking
..

IMPALA-10192: Filter out redundant AuthzAuditEvent's for column masking

We found that Ranger would generate an AuthzAuditEvent as long as
there exists a column masking policy corresponding to the column
even though the policy does not apply to the requesting user. This
resulted in an IllegalStateException if a user "A" submits a SELECT
query against a table that has a column specified in a column masking
policy when the policy does not apply to "A", i.e., the field of
'Select User' for this policy in the Ranger web UI does not contain "A".
For such an AuthzAuditEvent, its field of 'accessType' will not be one
of the supported mask types since its corresponding
accessResult.isMaskEnabled() would evaluates to false, indicating that
there is no matching column masking policy associated with the user "A"
and thus the AuthzAuditEvent will not be post-processed by Impala in
RangerAuthorizationCheker#createColumnMask(). But since we did not
filter out such an AuthzAuditEvent when it was generated and returned
from RangerBasePlugin#evalDataMaskPolicies(), we failed the check that
requires every AuthzAuditEvent be column masking-related in
RangerAuthorizationContext#stashAuditEvents().

To address this issue, in this patch we filter out such an
AuthzAuditEvent after each call to
RangerBasePlugin#evalDataMaskPolicies() so that no redundant
AuthzAuditEvent is generated. Furthermore, we also revise all the checks
for the generated AuthzAuditEvent's due to the evaluation of column
masking policies so that a failed check would not fail the query but
only result in an entry in the log file.

Testing:
 - Added a new column masking policy associated with a non-matching user
   in RangerAuditLogTest#testAuditsForColumnMasking() to verify that
   the redundant AuthzAuditEvent is removed.
 - Verified that the patch passes the exhaustive tests in the DEBUG
   build.

Change-Id: I1dbf65874003523b5176680e42f26fa2114c229b
---
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationChecker.java
M 
fe/src/main/java/org/apache/impala/authorization/ranger/RangerAuthorizationContext.java
M 
fe/src/test/java/org/apache/impala/authorization/ranger/RangerAuditLogTest.java
3 files changed, 53 insertions(+), 12 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/24/16524/1
--
To view, visit http://gerrit.cloudera.org:8080/16524
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I1dbf65874003523b5176680e42f26fa2114c229b
Gerrit-Change-Number: 16524
Gerrit-PatchSet: 1
Gerrit-Owner: Fang-Yu Rao 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Quanlong Huang 


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..


Patch Set 5:

(4 comments)

Sorry I was not able to finish the review before Thomas' +2.

http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.h
File be/src/scheduling/executor-blacklist.h:

http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.h@136
PS5, Line 136: executor
nit. executor->'executor entry'


http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.h@137
PS5, Line 137: blacklisted
 :   /// and ones that are on probation
nit: either blacklisted or on probation.


http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.cc
File be/src/scheduling/executor-blacklist.cc:

http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.cc@51
PS5, Line 51:   int64_t elapsed = MonotonicMillis() - entry.blacklist_time_ms;
:   entry.state = State::BLACKLISTED;
:   entry.blacklist_time_ms = MonotonicMillis();
:   entry.cause = cause;
Should MonotonicMillis() be called only once? That is, in a single time point, 
we update the entry. I think it would make things clear logically.


http://gerrit.cloudera.org:8080/#/c/16506/5/be/src/scheduling/executor-blacklist.cc@164
PS5, Line 164: "on probation")
Should we also check on NOT_BLACKLISTED here?



--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 5
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:51:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test
File 
testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test:

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test@278
PS3, Line 278: select * from
 :   functional.alltypestiny a
 : where a.id in (select id from functional.alltypestiny limit 5) 
limit 5
> Right. The concern was whether the estimate is good enough for the joins. W
Normally, the error becomes larger with more number of tables joined as well as 
the complexity of the join predicates.



--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:33:06 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..


Patch Set 1:

(3 comments)

Thanks for looking at this, it should help a lot with docker development. I 
have a couple of requests to change the behaviour but I think this is a 
starting point.

http://gerrit.cloudera.org:8080/#/c/16522/1/bin/impala-config.sh
File bin/impala-config.sh:

http://gerrit.cloudera.org:8080/#/c/16522/1/bin/impala-config.sh@115
PS1, Line 115: # Maximum memory available for mini-cluster and CDH cluster
I think we should call this IMPALA_CLUSTER_MAX_MEM_GB


http://gerrit.cloudera.org:8080/#/c/16522/1/bin/impala-config.sh@116
PS1, Line 116: export MAX_MEM_GB=28
We don't want to change the default here. Can you make the default the empty 
string, then fall back to auto-detecting when it's not set.


http://gerrit.cloudera.org:8080/#/c/16522/1/bin/start-impala-cluster.py
File bin/start-impala-cluster.py:

http://gerrit.cloudera.org:8080/#/c/16522/1/bin/start-impala-cluster.py@574
PS1, Line 574: print("mm " + str(mem_limit))
Delete this print or replace with LOG.debug.



--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 1
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:30:11 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9180 (part 1): Remove legacy ImpalaInternalService

2020-09-30 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16291 )

Change subject: IMPALA-9180 (part 1): Remove legacy ImpalaInternalService
..


Patch Set 14: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16291
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5fa83c8009590124dded4783f77ef70fa30119e6
Gerrit-Change-Number: 16291
Gerrit-PatchSet: 14
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:21:34 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..


Patch Set 5: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 5
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:22:31 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10007: Impala development environment does not support Ubuntu 20.04

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16241 )

Change subject: IMPALA-10007: Impala development environment does not support 
Ubuntu 20.04
..


Patch Set 8:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7326/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16241
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7db302b4f1d57ec9aa2100d7589d5e814db75947
Gerrit-Change-Number: 16241
Gerrit-PatchSet: 8
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:20:48 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..


Patch Set 5:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7325/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 5
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:20:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test
File 
testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test:

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test@278
PS3, Line 278: select * from
 :   functional.alltypestiny a
 : where a.id in (select id from functional.alltypestiny limit 5) 
limit 5
> would the row count estimate change after applying the small-query optimiza
Right. The concern was whether the estimate is good enough for the joins. With 
the change, any such estimation error would be exposed.



--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:15:26 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test
File 
testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test:

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test@278
PS3, Line 278: select * from
 :   functional.alltypestiny a
 : where a.id in (select id from functional.alltypestiny limit 5) 
limit 5
> May need to add other join queries to make sure the row count is estimated
would the row count estimate change after applying the small-query 
optimization? my understanding is that this optimization is applied after 
single node planning which would be responsible for estimating row counts, 
right?



--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 18:09:05 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10007: Impala development environment does not support Ubuntu 20.04

2020-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has uploaded a new patch set (#8). ( 
http://gerrit.cloudera.org:8080/16241 )

Change subject: IMPALA-10007: Impala development environment does not support 
Ubuntu 20.04
..

IMPALA-10007: Impala development environment does not support
Ubuntu 20.04

This is a minor amendment to a previously merged change with
ChangeId I4f592f60881fd8f34e2bf393a76f5a921505010a, to address
additional review comments. In particular, the original commit
referred to Ubuntu 20.4 whereas it should have used Ubuntu 20.04.

Change-Id: I7db302b4f1d57ec9aa2100d7589d5e814db75947
---
M bin/bootstrap_system.sh
M bin/bootstrap_toolchain.py
2 files changed, 3 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/41/16241/8
--
To view, visit http://gerrit.cloudera.org:8080/16241
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7db302b4f1d57ec9aa2100d7589d5e814db75947
Gerrit-Change-Number: 16241
Gerrit-PatchSet: 8
Gerrit-Owner: Qifan Chen 
Gerrit-Reviewer: Aman Sinha 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Wenzhe Zhou (Code Review)
Wenzhe Zhou has uploaded a new patch set (#5). ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..

IMPALA-9180 (part 2): Refactor executor_list_ map of ExecuterBlacklist

In current ExecuterBlacklist class, we've keyed maps on a
TNetworkAddress of a backend. To simply the logic for the class,
changes it to key off of the UniqueIdPB backend-id, eg. refactor
'executor_list_' to no longer be a map list>
and instead makes it a map.
Also fixes a minor bug with the calculation of elapsed time when
a backend that was on probation is re-blacklisted.

Testing:
 - Passed test_blacklist.py and test_query_retries.py.
 - Passed exhaustive tests.

Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
---
M be/src/scheduling/executor-blacklist.cc
M be/src/scheduling/executor-blacklist.h
2 files changed, 67 insertions(+), 103 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/06/16506/5
--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 5
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 


[Impala-ASF-CR] IMPALA-9711: incrementally update aggregate profile

2020-09-30 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15931 )

Change subject: IMPALA-9711: incrementally update aggregate profile
..


Patch Set 10:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15931/10/be/src/runtime/coordinator-backend-state.cc
File be/src/runtime/coordinator-backend-state.cc:

http://gerrit.cloudera.org:8080/#/c/15931/10/be/src/runtime/coordinator-backend-state.cc@539
PS10, Line 539: if (instance_stats.exec_stats_up_to_date_) continue; // 
Already applied.
I'm thinking through this check and wondering if it interacts with 
finalize/done_.

For V1, if this is true, then finalize = true or done_ = true, because that is 
the only time we call UpdateExecStats(). That sounds right to me, because if we 
called UpdateExecStats() with either of those set, then the completion time is 
set.

For V2, we call UpdateExecStats() even when done_=false and finalize=false, so 
exec_stats_up_to_date_ can be true even though we haven't set the completion 
time. I think for done_ that is ok, because to transition from done_=false to 
done_=true involves a new status update. For finalize, there may not be a new 
status update, so I think we might not update the completion time correctly.



--
To view, visit http://gerrit.cloudera.org:8080/15931
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib03e79a40a33d8e74464640ae5f95a1467a6713a
Gerrit-Change-Number: 15931
Gerrit-PatchSet: 10
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 17:37:43 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9180 (part 2): Refactor executor list map of ExecuterBlacklist

2020-09-30 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16506 )

Change subject: IMPALA-9180 (part 2): Refactor executor_list_ map of 
ExecuterBlacklist
..


Patch Set 4:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16506/4/be/src/scheduling/executor-blacklist.h
File be/src/scheduling/executor-blacklist.h:

http://gerrit.cloudera.org:8080/#/c/16506/4/be/src/scheduling/executor-blacklist.h@139
PS4, Line 139: std::hash
nit: I think this is unnecessary



--
To view, visit http://gerrit.cloudera.org:8080/16506
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib1ae082d0e080088756af91b5b770752ca8b3aa1
Gerrit-Change-Number: 16506
Gerrit-PatchSet: 4
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 17:28:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Qifan Chen (Code Review)
Qifan Chen has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3:

(1 comment)

Looks good!

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test
File 
testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test:

http://gerrit.cloudera.org:8080/#/c/16521/3/testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test@278
PS3, Line 278: select * from
 :   functional.alltypestiny a
 : where a.id in (select id from functional.alltypestiny limit 5) 
limit 5
May need to add other join queries to make sure the row count is estimated 
correctly.

1. cross joins
2. left/right joins



--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Qifan Chen 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 17:28:07 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6493/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 17:14:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 3
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 17:14:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9180 (part 1): Remove legacy ImpalaInternalService

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16291 )

Change subject: IMPALA-9180 (part 1): Remove legacy ImpalaInternalService
..


Patch Set 14:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6492/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16291
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I5fa83c8009590124dded4783f77ef70fa30119e6
Gerrit-Change-Number: 16291
Gerrit-PatchSet: 14
Gerrit-Owner: Wenzhe Zhou 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Reviewer: Wenzhe Zhou 
Gerrit-Comment-Date: Wed, 30 Sep 2020 17:13:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 2
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 16:31:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10143: TestAcid.test full acid original files is flaky

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16523 )

Change subject: IMPALA-10143: TestAcid.test_full_acid_original_files is flaky
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6491/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Gerrit-Change-Number: 16523
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 30 Sep 2020 15:35:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10143: TestAcid.test full acid original files is flaky

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16523 )

Change subject: IMPALA-10143: TestAcid.test_full_acid_original_files is flaky
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Gerrit-Change-Number: 16523
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 30 Sep 2020 15:35:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10143: TestAcid.test full acid original files is flaky

2020-09-30 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16523 )

Change subject: IMPALA-10143: TestAcid.test_full_acid_original_files is flaky
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Gerrit-Change-Number: 16523
Gerrit-PatchSet: 1
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 30 Sep 2020 15:30:44 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16521 )

Change subject: IMPALA-3335: Allow single-node optimization with joins
..


Patch Set 2:

> The code change looks good, i think looping some queries with joins
 > and num_nodes=1 would be enough for me to get to a +2

Looped TPC-DS overnight with NUM_NODES=1 against an impalad mini-cluster with a 
single dedicated coordinator


--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 2
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 15:19:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3335: Allow single-node optimization with joins

2020-09-30 Thread Sahil Takiar (Code Review)
Hello Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16521

to look at the new patch set (#2).

Change subject: IMPALA-3335: Allow single-node optimization with joins
..

IMPALA-3335: Allow single-node optimization with joins

Enable the single-node optimization for queries with joins.

Testing:
* Ran exhaustive tests
* Looped TPC-DS overnight with NUM_NODES=1 against an impalad
  mini-cluster with a single dedicated coordinator

Change-Id: I6b189271630214960ed482cb2b552fba9f246770
---
M fe/src/main/java/org/apache/impala/planner/Planner.java
M fe/src/main/java/org/apache/impala/util/MaxRowsProcessedVisitor.java
M testdata/workloads/functional-planner/queries/PlannerTest/small-query-opt.test
3 files changed, 21 insertions(+), 11 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/21/16521/2
--
To view, visit http://gerrit.cloudera.org:8080/16521
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I6b189271630214960ed482cb2b552fba9f246770
Gerrit-Change-Number: 16521
Gerrit-PatchSet: 2
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-10184: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

2020-09-30 Thread Gabor Kaszab (Code Review)
Hello Zoltan Borok-Nagy, wangsheng, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16512

to look at the new patch set (#5).

Change subject: IMPALA-10184: Add PARTITON BY SPEC to SHOW CREATE TABLE for 
Iceberg Tables
..

IMPALA-10184: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

A SHOW CREATE TABLE output didn't contain the PARTITION BY SPEC section
for partitioned Iceberg tables. This patch addresses this shortcoming.

Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
---
M common/thrift/CatalogObjects.thrift
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeFileStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/IcebergPartitionField.java
M fe/src/main/java/org/apache/impala/analysis/IcebergPartitionSpec.java
M fe/src/main/java/org/apache/impala/analysis/ShowStatsStmt.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/IcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/local/LocalIcebergTable.java
M testdata/workloads/functional-query/queries/QueryTest/show-create-table.test
11 files changed, 173 insertions(+), 32 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/12/16512/5
--
To view, visit http://gerrit.cloudera.org:8080/16512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
Gerrit-Change-Number: 16512
Gerrit-PatchSet: 5
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 


[Impala-ASF-CR] IMPALA-10143: TestAcid.test full acid original files is flaky

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16523 )

Change subject: IMPALA-10143: TestAcid.test_full_acid_original_files is flaky
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7324/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Gerrit-Change-Number: 16523
Gerrit-PatchSet: 1
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Wed, 30 Sep 2020 14:31:04 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10164: Supporting HadoopCatalog for Iceberg table

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16446 )

Change subject: IMPALA-10164: Supporting HadoopCatalog for Iceberg table
..


Patch Set 21: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16446
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic1893c50a633ca22d4bca6726c9937b026f5d5ef
Gerrit-Change-Number: 16446
Gerrit-PatchSet: 21
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 14:10:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10164: Supporting HadoopCatalog for Iceberg table

2020-09-30 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16446 )

Change subject: IMPALA-10164: Supporting HadoopCatalog for Iceberg table
..


Patch Set 20:

Failed again with IMPALA-10143 so I uploaded a fix for that issue: 
https://gerrit.cloudera.org/#/c/16523/


--
To view, visit http://gerrit.cloudera.org:8080/16446
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic1893c50a633ca22d4bca6726c9937b026f5d5ef
Gerrit-Change-Number: 16446
Gerrit-PatchSet: 20
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 14:10:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10164: Supporting HadoopCatalog for Iceberg table

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16446 )

Change subject: IMPALA-10164: Supporting HadoopCatalog for Iceberg table
..


Patch Set 21:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6490/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16446
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic1893c50a633ca22d4bca6726c9937b026f5d5ef
Gerrit-Change-Number: 16446
Gerrit-PatchSet: 21
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 14:10:57 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10143: TestAcid.test full acid original files is flaky

2020-09-30 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16523


Change subject: IMPALA-10143: TestAcid.test_full_acid_original_files is flaky
..

IMPALA-10143: TestAcid.test_full_acid_original_files is flaky

One of the test queries depended on the values of the row__id.rowid
field. In rare cases the files are written differently by Hive which
leads to different 'rowid' values for the rows.

I took out the test for the 'rowid' values in that particular query.

We have other tests for the 'rowid' field on static data files (from
'testdata/data'), therefore we still have coverage for that and they
shouldn't be flaky.

Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
---
M 
testdata/workloads/functional-query/queries/QueryTest/full-acid-original-file.test
1 file changed, 7 insertions(+), 7 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/23/16523/1
--
To view, visit http://gerrit.cloudera.org:8080/16523
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I3d36bd23b8d3cc257bad9a83a4462f20e073d437
Gerrit-Change-Number: 16523
Gerrit-PatchSet: 1
Gerrit-Owner: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-10062: TestCompressedNonText.test insensitivity to extension can fail due to wrong filename

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16518 )

Change subject: IMPALA-10062: 
TestCompressedNonText.test_insensitivity_to_extension can fail due to wrong 
filename
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/16518
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I265faf8d2e7f4251b18264052eededbeb2296f57
Gerrit-Change-Number: 16518
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 13:57:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10062: TestCompressedNonText.test insensitivity to extension can fail due to wrong filename

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/16518 )

Change subject: IMPALA-10062: 
TestCompressedNonText.test_insensitivity_to_extension can fail due to wrong 
filename
..

IMPALA-10062: TestCompressedNonText.test_insensitivity_to_extension can fail 
due to wrong filename

Some tests in test_compressed_formats.py hard-coded the filename
of the tables. They used "00_0" for filename. The number after the
underscore is the "attempt id" which can be non-zero if there were
failed attempts during file writing.

I modified the test to do a filesystem listing to retrieve the
filename.

Testing
* I manually renamed one of my files to 00_1 and re-run the test.

Change-Id: I265faf8d2e7f4251b18264052eededbeb2296f57
Reviewed-on: http://gerrit.cloudera.org:8080/16518
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M tests/query_test/test_compressed_formats.py
1 file changed, 6 insertions(+), 6 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/16518
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I265faf8d2e7f4251b18264052eededbeb2296f57
Gerrit-Change-Number: 16518
Gerrit-PatchSet: 3
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-10164: Supporting HadoopCatalog for Iceberg table

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16446 )

Change subject: IMPALA-10164: Supporting HadoopCatalog for Iceberg table
..


Patch Set 20: Verified-1

Build failed: https://jenkins.impala.io/job/gerrit-verify-dryrun/6488/


--
To view, visit http://gerrit.cloudera.org:8080/16446
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic1893c50a633ca22d4bca6726c9937b026f5d5ef
Gerrit-Change-Number: 16446
Gerrit-PatchSet: 20
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 13:45:50 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16512 )

Change subject: IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for 
Iceberg Tables
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7323/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
Gerrit-Change-Number: 16512
Gerrit-PatchSet: 4
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 12:14:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16512 )

Change subject: IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for 
Iceberg Tables
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7322/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
Gerrit-Change-Number: 16512
Gerrit-PatchSet: 3
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 12:11:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

2020-09-30 Thread Gabor Kaszab (Code Review)
Gabor Kaszab has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16512 )

Change subject: IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for 
Iceberg Tables
..


Patch Set 4:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/16512/1/fe/src/main/java/org/apache/impala/analysis/IcebergPartitionField.java
File fe/src/main/java/org/apache/impala/analysis/IcebergPartitionField.java:

http://gerrit.cloudera.org:8080/#/c/16512/1/fe/src/main/java/org/apache/impala/analysis/IcebergPartitionField.java@79
PS1, Line 79:   public String toSql(ToSqlOptions options) {
: StringBuilder builder = new StringBuilder();
: builder.append(origFieldName_+ " " + fieldType_.toString());
: return builder.toString();
:   }
:
:   public TIcebergPartitionField toThrift() {
: TIcebergPartitionField result = new TIcebergPartitionField();
: result.setField_id(fieldId_);
:
> I mean can we add a new field such as 'columnName_' to record source column
Thanks for the explanation! What you're saying totally makes sense and I think 
it should be part of this patch. I made the related changes to introduce a new 
field for the original field name and to populate it from Iceberg.



--
To view, visit http://gerrit.cloudera.org:8080/16512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
Gerrit-Change-Number: 16512
Gerrit-PatchSet: 4
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 11:53:21 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

2020-09-30 Thread Gabor Kaszab (Code Review)
Hello Zoltan Borok-Nagy, wangsheng, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16512

to look at the new patch set (#4).

Change subject: IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for 
Iceberg Tables
..

IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

A SHOW CREATE TABLE output didn't contain the PARTITION BY SPEC section
for partitioned Iceberg tables. This patch addresses this shortcoming.

Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
---
M common/thrift/CatalogObjects.thrift
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeFileStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/IcebergPartitionField.java
M fe/src/main/java/org/apache/impala/analysis/IcebergPartitionSpec.java
M fe/src/main/java/org/apache/impala/analysis/ShowStatsStmt.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/IcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/local/LocalIcebergTable.java
M testdata/workloads/functional-query/queries/QueryTest/show-create-table.test
11 files changed, 173 insertions(+), 32 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/12/16512/4
--
To view, visit http://gerrit.cloudera.org:8080/16512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
Gerrit-Change-Number: 16512
Gerrit-PatchSet: 4
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 


[Impala-ASF-CR] IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

2020-09-30 Thread Gabor Kaszab (Code Review)
Hello Zoltan Borok-Nagy, wangsheng, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/16512

to look at the new patch set (#3).

Change subject: IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for 
Iceberg Tables
..

IMPALA-10187: Add PARTITON BY SPEC to SHOW CREATE TABLE for Iceberg Tables

A SHOW CREATE TABLE output didn't contain the PARTITION BY SPEC section
for partitioned Iceberg tables. This patch addresses this shortcoming.

Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
---
M common/thrift/CatalogObjects.thrift
M fe/src/main/java/org/apache/impala/analysis/CreateTableLikeFileStmt.java
M fe/src/main/java/org/apache/impala/analysis/CreateTableStmt.java
M fe/src/main/java/org/apache/impala/analysis/IcebergPartitionField.java
M fe/src/main/java/org/apache/impala/analysis/IcebergPartitionSpec.java
M fe/src/main/java/org/apache/impala/analysis/ShowStatsStmt.java
M fe/src/main/java/org/apache/impala/analysis/ToSqlUtils.java
M fe/src/main/java/org/apache/impala/catalog/FeIcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/IcebergTable.java
M fe/src/main/java/org/apache/impala/catalog/local/LocalIcebergTable.java
M testdata/workloads/functional-query/queries/QueryTest/show-create-table.test
11 files changed, 174 insertions(+), 33 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/12/16512/3
--
To view, visit http://gerrit.cloudera.org:8080/16512
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ie4c43b75057807ab513a220d348155be2487e714
Gerrit-Change-Number: 16512
Gerrit-PatchSet: 3
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 


[Impala-ASF-CR] IMPALA-10062: TestCompressedNonText.test insensitivity to extension can fail due to wrong filename

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16518 )

Change subject: IMPALA-10062: 
TestCompressedNonText.test_insensitivity_to_extension can fail due to wrong 
filename
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/16518
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I265faf8d2e7f4251b18264052eededbeb2296f57
Gerrit-Change-Number: 16518
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 08:39:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10062: TestCompressedNonText.test insensitivity to extension can fail due to wrong filename

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16518 )

Change subject: IMPALA-10062: 
TestCompressedNonText.test_insensitivity_to_extension can fail due to wrong 
filename
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6489/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16518
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I265faf8d2e7f4251b18264052eededbeb2296f57
Gerrit-Change-Number: 16518
Gerrit-PatchSet: 2
Gerrit-Owner: Zoltan Borok-Nagy 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 08:39:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10164: Supporting HadoopCatalog for Iceberg table

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16446 )

Change subject: IMPALA-10164: Supporting HadoopCatalog for Iceberg table
..


Patch Set 20:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/6488/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/16446
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic1893c50a633ca22d4bca6726c9937b026f5d5ef
Gerrit-Change-Number: 16446
Gerrit-PatchSet: 20
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 08:21:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10164: Supporting HadoopCatalog for Iceberg table

2020-09-30 Thread Gabor Kaszab (Code Review)
Gabor Kaszab has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16446 )

Change subject: IMPALA-10164: Supporting HadoopCatalog for Iceberg table
..


Patch Set 20: Code-Review+2

Those XFAIL failures can be omitted, not those the ones that make the build 
fail. However, there is a failing test in this job:
https://jenkins.impala.io/job/ubuntu-16.04-from-scratch/12194/
I see test_full_acid_original_files failing but that seems to be flaky anyway 
so I'm sure it's unrelated to this change.
https://issues.apache.org/jira/browse/IMPALA-10143

Let me re-run the verify job.
Carry +2 from Zoltan.


--
To view, visit http://gerrit.cloudera.org:8080/16446
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ic1893c50a633ca22d4bca6726c9937b026f5d5ef
Gerrit-Change-Number: 16446
Gerrit-PatchSet: 20
Gerrit-Owner: wangsheng 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Reviewer: wangsheng 
Gerrit-Comment-Date: Wed, 30 Sep 2020 08:20:39 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/16522 )

Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/7321/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 1
Gerrit-Owner: fifteencai 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Wed, 30 Sep 2020 08:17:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-10193: Limit the memory usage for the whole test cluster

2020-09-30 Thread fifteencai (Code Review)
fifteencai has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/16522


Change subject: IMPALA-10193: Limit the memory usage for the whole test cluster
..

IMPALA-10193: Limit the memory usage for the whole test cluster

This patch introduces a new approach of limiting the memory usage
for both mini-cluster and CDH cluster. Without this limit, clusters
are prone to getting killed in containers whose mem limit are lower
than the host's memory size. i.e. The mini-cluster may running in a
container with 32GB limitted by CGROUPS, while the host machine has
128GB. Under this circumstance, if the container is started with
'-privileged' command argument, both mini and CDH clusters compute
their mem_limit according to 128GB rather than 32GB. They will be
killed when attempting to apply for extra memories.

Currently, the mem-limit estimating algorithms for Impalad and Node
Manager are different:

for Impalad:  mem_limit = 0.7 * sys_mem / cluster_size (default is 3)

for Node Manager:
1. Leave aside 24GB, then fit the left memory into threasholds.
2. The bare limit is 4GB and maximum limit 48GB

In headge of over-consumption, we

- Added a new environment variable MAX_MEM_GB in 'bin/impala-config.sh'
- Modified the algorithm in 'bin/start-impala-cluster.py', making it
  taking MAX_MEM_GB rather than sys_mem into account.
- Modified the algorightm in
 'testdata/cluster/node_templates/common/etc/hadoop/conf/yarn-site.xml.py',
  Similarly, making MAX_MEM_GB substitutes for sys_mem .

Testing: this patch worked in a 32GB docker container running on a 128GB
 host machine. All 1188 unit tests get passed.

Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
---
M bin/impala-config.sh
M bin/start-impala-cluster.py
M testdata/cluster/node_templates/common/etc/hadoop/conf/yarn-site.xml.py
3 files changed, 10 insertions(+), 3 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/16522/1
--
To view, visit http://gerrit.cloudera.org:8080/16522
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I8537fd748e279d5a0e689872aeb4dbfd0c84dc93
Gerrit-Change-Number: 16522
Gerrit-PatchSet: 1
Gerrit-Owner: fifteencai