[Impala-ASF-CR] IMPALA-9398: Fix shell history duplication when cmdloop breaks

2020-03-02 Thread Tamas Mate (Code Review)
Tamas Mate has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15345


Change subject: IMPALA-9398: Fix shell history duplication when cmdloop breaks
..

IMPALA-9398: Fix shell history duplication when cmdloop breaks

This change adds a new condition to avoid re-reading the impala-shell
history when the cmdloop is broken. The loop can break due to exceptions
such as KeyboardInterrupt.

Testing:
 - The change was testad manually on my local dev env
 - Added a new EE shell test to verify the history after SIGINT

Change-Id: If4faf46134f44d91e56748642f47d448707db53c
---
M shell/impala_shell.py
M tests/shell/test_shell_interactive.py
2 files changed, 36 insertions(+), 2 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/45/15345/1
--
To view, visit http://gerrit.cloudera.org:8080/15345
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: If4faf46134f44d91e56748642f47d448707db53c
Gerrit-Change-Number: 15345
Gerrit-PatchSet: 1
Gerrit-Owner: Tamas Mate 


[Impala-ASF-CR] IMPALA-9384: Improve Impala shell usability by enabling live progress in interactive mode

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15219 )

Change subject: IMPALA-9384: Improve Impala shell usability by enabling 
live_progress in interactive mode
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5398/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15219
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
Gerrit-Change-Number: 15219
Gerrit-PatchSet: 3
Gerrit-Owner: Alice Fan 
Gerrit-Reviewer: Alice Fan 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 06:14:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9452: slightly reduce reservation for test spilling aggs

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15339 )

Change subject: IMPALA-9452: slightly reduce reservation for test_spilling_aggs
..

IMPALA-9452: slightly reduce reservation for test_spilling_aggs

As far as I can tell, the query failed to spill because the
pre-agg was able to release reservation before the post-agg
needed it. Probably there is some variance because of buffering
in the exchange.

This change slightly reduces the reservation to minimise the
chance of this recurring.

Also remove a duplicated instance of this test.

Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Reviewed-on: http://gerrit.cloudera.org:8080/15339
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M testdata/workloads/functional-query/queries/QueryTest/spilling-aggs.test
1 file changed, 1 insertion(+), 24 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/15339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Gerrit-Change-Number: 15339
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-9452: slightly reduce reservation for test spilling aggs

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15339 )

Change subject: IMPALA-9452: slightly reduce reservation for test_spilling_aggs
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Gerrit-Change-Number: 15339
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Mar 2020 06:12:23 +
Gerrit-HasComments: No


[Impala-ASF-CR] santizer and -release flags should cause buildall.sh to exit

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15341 )

Change subject: santizer and -release flags should cause buildall.sh to exit
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ide0c2017d4e5abbf6fcb25c890d241bbcee8422e
Gerrit-Change-Number: 15341
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 06:02:35 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9384: Improve Impala shell usability by enabling live progress in interactive mode

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15219 )

Change subject: IMPALA-9384: Improve Impala shell usability by enabling 
live_progress in interactive mode
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5397/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15219
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
Gerrit-Change-Number: 15219
Gerrit-PatchSet: 2
Gerrit-Owner: Alice Fan 
Gerrit-Reviewer: Alice Fan 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 05:44:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..


Patch Set 4: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 05:37:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..

IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink
configs where they were created

Currently the exprs created in plan nodes and sink configs were being
closed in their respective exec nodes and data sink instances (that
were created using them) when the fragment instance closes. This patch
ensure that they are closed from the plan nodes and sink configs
themselves so that when these are shared among instances, they don't
prematurely close the expressions.

Testing:
Successfully passed exhaustive tests.

Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Reviewed-on: http://gerrit.cloudera.org:8080/15318
Reviewed-by: Bikramjeet Vig 
Tested-by: Impala Public Jenkins 
---
M be/src/exec/aggregation-node-base.cc
M be/src/exec/aggregation-node-base.h
M be/src/exec/aggregator.cc
M be/src/exec/aggregator.h
M be/src/exec/analytic-eval-node.cc
M be/src/exec/analytic-eval-node.h
M be/src/exec/data-sink.cc
M be/src/exec/data-sink.h
M be/src/exec/exchange-node.cc
M be/src/exec/exchange-node.h
M be/src/exec/exec-node.cc
M be/src/exec/exec-node.h
M be/src/exec/grouping-aggregator.cc
M be/src/exec/grouping-aggregator.h
M be/src/exec/hdfs-scan-node-base.cc
M be/src/exec/hdfs-scan-node-base.h
M be/src/exec/hdfs-table-sink.cc
M be/src/exec/hdfs-table-sink.h
M be/src/exec/nested-loop-join-node.cc
M be/src/exec/nested-loop-join-node.h
M be/src/exec/non-grouping-aggregator.h
M be/src/exec/partial-sort-node.cc
M be/src/exec/partial-sort-node.h
M be/src/exec/partitioned-hash-join-builder.cc
M be/src/exec/partitioned-hash-join-builder.h
M be/src/exec/partitioned-hash-join-node.cc
M be/src/exec/partitioned-hash-join-node.h
M be/src/exec/scan-node.cc
M be/src/exec/sort-node.cc
M be/src/exec/sort-node.h
M be/src/exec/topn-node.cc
M be/src/exec/topn-node.h
M be/src/exec/union-node.cc
M be/src/exec/union-node.h
M be/src/exec/unnest-node.cc
M be/src/exec/unnest-node.h
M be/src/runtime/data-stream-test.cc
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/fragment-instance-state.h
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
41 files changed, 225 insertions(+), 139 deletions(-)

Approvals:
  Bikramjeet Vig: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 5
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9384: Improve Impala shell usability by enabling live progress in interactive mode

2020-03-02 Thread Alice Fan (Code Review)
Hello Andrew Sherman, David Knupp, Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15219

to look at the new patch set (#3).

Change subject: IMPALA-9384: Improve Impala shell usability by enabling 
live_progress in interactive mode
..

IMPALA-9384: Improve Impala shell usability by enabling live_progress in 
interactive mode

In order to improve usability, this patch would like to make Impala shell show 
query
processing status while the query is running. The patch enables shell option
live_progress by default when a user launches impala shell in the interactive 
mode.
The patch also provides a new command line flag "--disable_live_progress", 
which allows
a user to disable live_progress at runtime. In the interactive mode, a user can 
disable
live_progress by either using the proposed command line flag or setting the 
option as
False in the config file. As for in the non-interactive mode (when the -q or -f 
options
are used), live reporting is not supported. Impala-shell will disable 
live_progress if
the mode is detected.

Testing:
- Added and updated tests in test_shell_interactive.py and 
test_shell_commandline.py
- Successfully ran all shell related tests is detected.

Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
---
M shell/impala_shell.py
M shell/impala_shell_config_defaults.py
M shell/option_parser.py
M tests/shell/test_shell_commandline.py
M tests/shell/test_shell_interactive.py
5 files changed, 33 insertions(+), 5 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/19/15219/3
--
To view, visit http://gerrit.cloudera.org:8080/15219
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
Gerrit-Change-Number: 15219
Gerrit-PatchSet: 3
Gerrit-Owner: Alice Fan 
Gerrit-Reviewer: Alice Fan 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9384: Improve Impala shell usability by enabling live progress in interactive mode

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15219 )

Change subject: IMPALA-9384: Improve Impala shell usability by enabling 
live_progress in interactive mode
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15219/2/tests/shell/test_shell_interactive.py
File tests/shell/test_shell_interactive.py:

http://gerrit.cloudera.org:8080/#/c/15219/2/tests/shell/test_shell_interactive.py@535
PS2, Line 535: "
flake8: E501 line too long (91 > 90 characters)



--
To view, visit http://gerrit.cloudera.org:8080/15219
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
Gerrit-Change-Number: 15219
Gerrit-PatchSet: 2
Gerrit-Owner: Alice Fan 
Gerrit-Reviewer: Alice Fan 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 04:59:19 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9384: Improve Impala shell usability by enabling live progress in interactive mode

2020-03-02 Thread Alice Fan (Code Review)
Hello Andrew Sherman, David Knupp, Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15219

to look at the new patch set (#2).

Change subject: IMPALA-9384: Improve Impala shell usability by enabling 
live_progress in interactive mode
..

IMPALA-9384: Improve Impala shell usability by enabling live_progress in 
interactive mode

In order to improve usability, this patch would like to make Impala shell show 
query
processing status by default. The patch enables shell option live_progress when 
a user
launches impala shell in the interactive mode. The patch also provides a new 
command
line flag “--disable_live_progress”, which allows a user to disable 
live_progress at
runtime. In the interactive mode, a user can disable live_progress by either 
using
the proposed command line flag or setting the option as False in the config 
file. As for
in the non-interactive mode (when the -q or -f options are used), live 
reporting is not
supported. Impala-shell will disable live_progress if the mode is detected.

Testing:
- Added and updated tests in test_shell_interactive.py and 
test_shell_commandline.py
- Successfully ran all shell related tests

Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
---
M shell/impala_shell.py
M shell/impala_shell_config_defaults.py
M shell/option_parser.py
M tests/shell/test_shell_commandline.py
M tests/shell/test_shell_interactive.py
5 files changed, 33 insertions(+), 5 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/19/15219/2
--
To view, visit http://gerrit.cloudera.org:8080/15219
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
Gerrit-Change-Number: 15219
Gerrit-PatchSet: 2
Gerrit-Owner: Alice Fan 
Gerrit-Reviewer: Alice Fan 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5396/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 2
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Anonymous Coward (572)
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 03:05:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Sahil Takiar (Code Review)
Hello Anonymous Coward (572), Tim Armstrong, Joe McDonnell, Impala Public 
Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15334

to look at the new patch set (#2).

Change subject: IMPALA-9400: Initial Ozone support
..

IMPALA-9400: Initial Ozone support

Ozone is a block-based filesystem store and is similar to HDFS in
many aspects. This patch adds initial support for reading data stored
on Ozone and writing data to Ozone. It adds Ozone to the whitelist of
Impala writable filesystems and adds the necessary Ozone client jars
to the Impala classpath.

Initially, Ozone will be treated as a remote storage service, similar to
how Impala treats S3 and ABFS. Ozone will typically be deployed on a
separate storage cluster, so all reads will be remote.

Rather than updating to a new CDP GBN with the added Ozone client jars,
I just modified the existing GBN to include the new jars in the Maven
repo.

Testing:
* Manually tested that connecting to an external Ozone cluster works and
  that Impala can create an Ozone-backed table, write to the
  table, read from the table, and drop the table
* Ran core tests with and without USE_CDP_HIVE=true

Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
---
M be/src/exec/hdfs-table-sink.cc
M be/src/runtime/io/disk-io-mgr.cc
M be/src/runtime/io/disk-io-mgr.h
M be/src/util/hdfs-util.cc
M be/src/util/hdfs-util.h
M bin/impala-config.sh
M fe/pom.xml
M fe/src/main/java/org/apache/impala/common/FileSystemUtil.java
M impala-parent/pom.xml
9 files changed, 72 insertions(+), 4 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/34/15334/2
--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 2
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Anonymous Coward (572)
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG@18
PS1, Line 18:
> I think this is fine to leave enabled by default. This would be an argument
Sounds good. Will follow up on the EC support separately.



--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Anonymous Coward (572)
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 02:20:32 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] santizer and -release flags should cause buildall.sh to exit

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15341 )

Change subject: santizer and -release flags should cause buildall.sh to exit
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ide0c2017d4e5abbf6fcb25c890d241bbcee8422e
Gerrit-Change-Number: 15341
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:53:03 +
Gerrit-HasComments: No


[Impala-ASF-CR] santizer and -release flags should cause buildall.sh to exit

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15341 )

Change subject: santizer and -release flags should cause buildall.sh to exit
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5395/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ide0c2017d4e5abbf6fcb25c890d241bbcee8422e
Gerrit-Change-Number: 15341
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:29:02 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..


Patch Set 4:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5394/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:23:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9452: slightly reduce reservation for test spilling aggs

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15339 )

Change subject: IMPALA-9452: slightly reduce reservation for test_spilling_aggs
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/5441/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Gerrit-Change-Number: 15339
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:14:34 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9452: slightly reduce reservation for test spilling aggs

2020-03-02 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15339 )

Change subject: IMPALA-9452: slightly reduce reservation for test_spilling_aggs
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Gerrit-Change-Number: 15339
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:14:14 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9452: slightly reduce reservation for test spilling aggs

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15339 )

Change subject: IMPALA-9452: slightly reduce reservation for test_spilling_aggs
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Gerrit-Change-Number: 15339
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:14:33 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..


Patch Set 3:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5393/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:13:07 +
Gerrit-HasComments: No


[Impala-ASF-CR] santizer and -release flags should cause buildall.sh to exit

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15341 )

Change subject: santizer and -release flags should cause buildall.sh to exit
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/5440/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/15341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ide0c2017d4e5abbf6fcb25c890d241bbcee8422e
Gerrit-Change-Number: 15341
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Mar 2020 01:07:15 +
Gerrit-HasComments: No


[Impala-ASF-CR] santizer and -release flags should cause buildall.sh to exit

2020-03-02 Thread Sahil Takiar (Code Review)
Sahil Takiar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15341


Change subject: santizer and -release flags should cause buildall.sh to exit
..

santizer and -release flags should cause buildall.sh to exit

Currently, if any sanitizer (or clang-tidy) flag is added to buildall.sh
and the -release flag is added, then buildall.sh will silently ignore
the -release flag. Impala does not support adding sanitizer flags to
debug/release builds. Sanitizers, release, and debug builds are all
distinct and use their own set of compile flags.

This patch changes the behavior of buildall.sh so that if -release and
any sanitizer flag is specified, the build exits with the error:
"ERROR: more than one CMake build type defined: RELEASE TSAN"

Testing:
* './buildall.sh -skiptests -noclean -tsan -release' fails (as expected)
* './buildall.sh -skiptests -noclean -tsan' passes
* './buildall.sh -notests -noclean -codecoverage -release' passes

Change-Id: Ide0c2017d4e5abbf6fcb25c890d241bbcee8422e
---
M buildall.sh
1 file changed, 10 insertions(+), 7 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/41/15341/1
--
To view, visit http://gerrit.cloudera.org:8080/15341
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ide0c2017d4e5abbf6fcb25c890d241bbcee8422e
Gerrit-Change-Number: 15341
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 


[Impala-ASF-CR] IMPALA-9452: slightly reduce reservation for test spilling aggs

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15339 )

Change subject: IMPALA-9452: slightly reduce reservation for test_spilling_aggs
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5392/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Gerrit-Change-Number: 15339
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Comment-Date: Tue, 03 Mar 2020 00:59:53 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9369: Make createInsertEvents() async.

2020-03-02 Thread Vihang Karajgaonkar (Code Review)
Vihang Karajgaonkar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15263 )

Change subject: IMPALA-9369: Make createInsertEvents() async.
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15263/2/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java
File fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java:

http://gerrit.cloudera.org:8080/#/c/15263/2/fe/src/main/java/org/apache/impala/service/CatalogOpExecutor.java@4393
PS2, Line 4393: fireInsertEventsAsync(table.getDb().getName(),
This implementation will spin up a singleThreadExecutor for each partition 
which was inserted which might be expensive. Can we collect all the insert 
events first and then call the async method so all the insert events are 
processed by same single thread?

May be something like:


CompletableFuture.runAsync(() -> {
  try (MetaStoreClient metaStoreClient = catalog_.getMetaStoreClient()) {
   for (InsertEventInfo info : insertEventInfos) {
MetaStoreUtil.fireInsertEvent(info);
   } catch (Exception e) {
 LOG.error(...);
   }}, 
Executors.newSingleThreadExecutor()).thenRun(()->fireInsertEventThread.shutdown)
})



--
To view, visit http://gerrit.cloudera.org:8080/15263
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I97802a5c03abc067fccf9e3a9d0047324626706e
Gerrit-Change-Number: 15263
Gerrit-PatchSet: 2
Gerrit-Owner: Anurag Mantripragada 
Gerrit-Reviewer: Anurag Mantripragada 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Quanlong Huang 
Gerrit-Reviewer: Vihang Karajgaonkar 
Gerrit-Comment-Date: Tue, 03 Mar 2020 00:52:01 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..


Patch Set 4:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/5438/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 00:40:59 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..


Patch Set 4: Code-Review+2

rebased and carrying forward Tim and Csaba's +2


--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 00:40:32 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Bikramjeet Vig (Code Review)
Hello Tim Armstrong, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15318

to look at the new patch set (#4).

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..

IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink
configs where they were created

Currently the exprs created in plan nodes and sink configs were being
closed in their respective exec nodes and data sink instances (that
were created using them) when the fragment instance closes. This patch
ensure that they are closed from the plan nodes and sink configs
themselves so that when these are shared among instances, they don't
prematurely close the expressions.

Testing:
Successfully passed exhaustive tests.

Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
---
M be/src/exec/aggregation-node-base.cc
M be/src/exec/aggregation-node-base.h
M be/src/exec/aggregator.cc
M be/src/exec/aggregator.h
M be/src/exec/analytic-eval-node.cc
M be/src/exec/analytic-eval-node.h
M be/src/exec/data-sink.cc
M be/src/exec/data-sink.h
M be/src/exec/exchange-node.cc
M be/src/exec/exchange-node.h
M be/src/exec/exec-node.cc
M be/src/exec/exec-node.h
M be/src/exec/grouping-aggregator.cc
M be/src/exec/grouping-aggregator.h
M be/src/exec/hdfs-scan-node-base.cc
M be/src/exec/hdfs-scan-node-base.h
M be/src/exec/hdfs-table-sink.cc
M be/src/exec/hdfs-table-sink.h
M be/src/exec/nested-loop-join-node.cc
M be/src/exec/nested-loop-join-node.h
M be/src/exec/non-grouping-aggregator.h
M be/src/exec/partial-sort-node.cc
M be/src/exec/partial-sort-node.h
M be/src/exec/partitioned-hash-join-builder.cc
M be/src/exec/partitioned-hash-join-builder.h
M be/src/exec/partitioned-hash-join-node.cc
M be/src/exec/partitioned-hash-join-node.h
M be/src/exec/scan-node.cc
M be/src/exec/sort-node.cc
M be/src/exec/sort-node.h
M be/src/exec/topn-node.cc
M be/src/exec/topn-node.h
M be/src/exec/union-node.cc
M be/src/exec/union-node.h
M be/src/exec/unnest-node.cc
M be/src/exec/unnest-node.h
M be/src/runtime/data-stream-test.cc
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/fragment-instance-state.h
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
41 files changed, 225 insertions(+), 139 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/18/15318/4
--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..


Patch Set 3:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/aggregation-node-base.cc
File be/src/exec/aggregation-node-base.cc:

http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/aggregation-node-base.cc@50
PS2, Line 50: void AggregationPlanNode::Close() {
:   for (AggregatorConfig* config : aggs_) config->Close();
:   PlanNode::Close();
: }
> I don't want to hold this up, just wanted to mention that the whole Expr/Pl
I agree, that will save us the effort of adding the close methods, but will 
have to investigate more to figure out if decoupling of cache_entry_ management 
and exprs is cleaner and intuitive. will add a JIRA for this:IMPALA-9454


http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/exec-node.h
File be/src/exec/exec-node.h:

http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/exec-node.h@73
PS2, Line 73:   /// Close() releases all resources that were allocated in 
Init().
> The second sentence is probably at risk of getting stale, maybe remove it?
Done. Here and in data-sink.h


http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/exec-node.cc
File be/src/exec/exec-node.cc:

http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/exec-node.cc@91
PS2, Line 91:   for (auto& child : children_) {
> Consider using ranged for, just for consistency?
Done



--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 00:30:34 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Bikramjeet Vig (Code Review)
Hello Tim Armstrong, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15318

to look at the new patch set (#3).

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..

IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink
configs where they were created

Currently the exprs created in plan nodes and sink configs were being
closed in their respective exec nodes and data sink instances (that
were created using them) when the fragment instance closes. This patch
ensure that they are closed from the plan nodes and sink configs
themselves so that when these are shared among instances, they don't
prematurely close the expressions.

Testing:
Successfully passed exhaustive tests.

Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
---
M be/src/exec/aggregation-node-base.cc
M be/src/exec/aggregation-node-base.h
M be/src/exec/aggregator.cc
M be/src/exec/aggregator.h
M be/src/exec/analytic-eval-node.cc
M be/src/exec/analytic-eval-node.h
M be/src/exec/data-sink.cc
M be/src/exec/data-sink.h
M be/src/exec/exchange-node.cc
M be/src/exec/exchange-node.h
M be/src/exec/exec-node.cc
M be/src/exec/exec-node.h
M be/src/exec/grouping-aggregator.cc
M be/src/exec/grouping-aggregator.h
M be/src/exec/hdfs-scan-node-base.cc
M be/src/exec/hdfs-scan-node-base.h
M be/src/exec/hdfs-table-sink.cc
M be/src/exec/hdfs-table-sink.h
M be/src/exec/nested-loop-join-node.cc
M be/src/exec/nested-loop-join-node.h
M be/src/exec/non-grouping-aggregator.h
M be/src/exec/partial-sort-node.cc
M be/src/exec/partial-sort-node.h
M be/src/exec/partitioned-hash-join-builder.cc
M be/src/exec/partitioned-hash-join-builder.h
M be/src/exec/partitioned-hash-join-node.cc
M be/src/exec/partitioned-hash-join-node.h
M be/src/exec/scan-node.cc
M be/src/exec/sort-node.cc
M be/src/exec/sort-node.h
M be/src/exec/topn-node.cc
M be/src/exec/topn-node.h
M be/src/exec/union-node.cc
M be/src/exec/union-node.h
M be/src/exec/unnest-node.cc
M be/src/exec/unnest-node.h
M be/src/runtime/data-stream-test.cc
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/fragment-instance-state.h
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
41 files changed, 224 insertions(+), 141 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/18/15318/3
--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15322 )

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..


Patch Set 3: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 00:26:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15322 )

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..

IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node
and from data sinks to their config classes

This patch moves the code responsible for codegening from partitioned
hash join's exec node to plan node and from all data sinks to their
respective config classes.

Testing:
Manually verified that codegen works for all modified exec nodes and
data sinks.
Ran all exhaustive tests successfully.

Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Reviewed-on: http://gerrit.cloudera.org:8080/15322
Reviewed-by: Impala Public Jenkins 
Tested-by: Impala Public Jenkins 
---
M be/src/exec/blocking-join-node.cc
M be/src/exec/blocking-join-node.h
M be/src/exec/data-sink.cc
M be/src/exec/data-sink.h
M be/src/exec/filter-context.cc
M be/src/exec/filter-context.h
M be/src/exec/grouping-aggregator.cc
M be/src/exec/nested-loop-join-node.cc
M be/src/exec/partitioned-hash-join-builder.cc
M be/src/exec/partitioned-hash-join-builder.h
M be/src/exec/partitioned-hash-join-node.cc
M be/src/exec/partitioned-hash-join-node.h
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
M be/src/runtime/runtime-state.h
16 files changed, 281 insertions(+), 190 deletions(-)

Approvals:
  Impala Public Jenkins: Looks good to me, approved; Verified

--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 4
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15337 )

Change subject: IMPALA-9444: Fix URL for postgresql jar download
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Tue, 03 Mar 2020 00:22:58 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Joe McDonnell (Code Review)
Joe McDonnell has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15337 )

Change subject: IMPALA-9444: Fix URL for postgresql jar download
..

IMPALA-9444: Fix URL for postgresql jar download

The current URL uses http://central.maven.org, which has been
decommissioned as part of the transition to HTTPS. See:
https://central.sonatype.org/articles/2019/Jul/15/central-http-deprecation-update/
https://central.sonatype.org/articles/2020/Jan/15/501-https-required-error/
This switches the URL to use https://repo.maven.apache.org/.

Testing:
 - Removed postgresql jar, ran bin/create-test-configuration.sh,
   verified that it downloaded the jar.

Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Reviewed-on: http://gerrit.cloudera.org:8080/15337
Reviewed-by: Joe McDonnell 
Tested-by: Impala Public Jenkins 
---
M bin/create-test-configuration.sh
1 file changed, 1 insertion(+), 1 deletion(-)

Approvals:
  Joe McDonnell: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9452: slightly reduce reservation for test spilling aggs

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15339


Change subject: IMPALA-9452: slightly reduce reservation for test_spilling_aggs
..

IMPALA-9452: slightly reduce reservation for test_spilling_aggs

As far as I can tell, the query failed to spill because the
pre-agg was able to release reservation before the post-agg
needed it. Probably there is some variance because of buffering
in the exchange.

This change slightly reduces the reservation to minimise the
chance of this recurring.

Also remove a duplicated instance of this test.

Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
---
M testdata/workloads/functional-query/queries/QueryTest/spilling-aggs.test
1 file changed, 1 insertion(+), 24 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/39/15339/1
--
To view, visit http://gerrit.cloudera.org:8080/15339
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: Ifb8376e2e12d3f73d6c0e27c697be4fc86f9c755
Gerrit-Change-Number: 15339
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 


[Impala-ASF-CR] IMPALA-8690: Add LIRS cache eviction algorithm

2020-03-02 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15306 )

Change subject: IMPALA-8690: Add LIRS cache eviction algorithm
..


Patch Set 9:

(9 comments)

http://gerrit.cloudera.org:8080/#/c/15306/9//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15306/9//COMMIT_MSG@17
PS9, Line 17: the
the startup flag


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/cache.cc
File be/src/util/cache/cache.cc:

http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/cache.cc@87
PS9, Line 87:   case Cache::EvictionPolicy::FIFO:
nit: formatting


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc
File be/src/util/cache/lirs-cache.cc:

http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc@42
PS9, Line 42: DEFINE_double_hidden(lirs_tombstone_multiple, 2.00,
I don't see anywhere you validate that this value is reasonable. Might be good 
to do a quick check somewhere and log a warning or something to help avoid 
people accidentally mis-configuring this and getting bad perf as a result.


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc@89
PS9, Line 89: // oldest PROTECTED entry is demoted to UNPROTECTED.
Might be nice to also include the semantics of UNINIT in this paragraph.


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc@93
PS9, Line 93: last entry
I think some of the comments around the ordering of the lists get a little 
confusing - if I understand correctly, by 'last entry' here you mean the oldest 
entry, which is actually at the front of the queue. Similarly, later in this 
paragraph you refer to the elements being removed as being 'at the end' but 
TrimRecencyQueue refers to them as being at the 'front of the queue'. Would be 
good to settle on a small number of terms, eg. oldest/front and newest/back, 
and use them consistently everywhere.

This is a little nit-pick-ier, but it might also be nice to be consistent about 
whether they're referred to as 'list' or 'queue'


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc@100
PS9, Line 100: cache
list


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc@241
PS9, Line 241: DCHECK_LT(cur_state.ref_count, 
std::numeric_limits::max());
Is there anything that guarantees this won't be the case, i.e. as written why 
wouldn't it be valid to call Lookup() a bunch of times on the same element and 
violate this?

Admittedly it seems unlikely this would be violated in a real workload, but we 
may still need to handle it.


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc@242
PS9, Line 242: ++
nit: its impala convention to always prefer this at the beginning, i.e. 
'++cur_state.ref_count'


http://gerrit.cloudera.org:8080/#/c/15306/9/be/src/util/cache/lirs-cache.cc@466
PS9, Line 466: MoveToQueueBack
MoveToRecencyQueueBack



--
To view, visit http://gerrit.cloudera.org:8080/15306
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I670fa4b2b7c93998130dc4e8b2546bb93e9a84f8
Gerrit-Change-Number: 15306
Gerrit-PatchSet: 9
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Thomas Tauber-Marshall 
Gerrit-Comment-Date: Mon, 02 Mar 2020 23:28:55 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9439: Make --scale a mandatory option in single node perf run.py

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15335 )

Change subject: IMPALA-9439: Make --scale a mandatory option in 
single_node_perf_run.py
..

IMPALA-9439: Make --scale a mandatory option in single_node_perf_run.py

This makes the --scale option mandatory when running
./bin/single_node_perf_run.py. If the option is not set, the script
attempts to run the workloads against the database
'[workload-name]None_[file-format]', which is typically not what the
user wants.

Makes some minor documentation improvements to the script.

Testing:
* Confirmed that running without the --scale option set causes the
  script to error out with a help message

Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
Reviewed-on: http://gerrit.cloudera.org:8080/15335
Reviewed-by: Tim Armstrong 
Tested-by: Impala Public Jenkins 
---
M bin/single_node_perf_run.py
1 file changed, 13 insertions(+), 2 deletions(-)

Approvals:
  Tim Armstrong: Looks good to me, approved
  Impala Public Jenkins: Verified

--
To view, visit http://gerrit.cloudera.org:8080/15335
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
Gerrit-Change-Number: 15335
Gerrit-PatchSet: 2
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9439: Make --scale a mandatory option in single node perf run.py

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15335 )

Change subject: IMPALA-9439: Make --scale a mandatory option in 
single_node_perf_run.py
..


Patch Set 1: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15335
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
Gerrit-Change-Number: 15335
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 22:35:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-3343: Make impala-shell compatible with python 3.

2020-03-02 Thread David Knupp (Code Review)
David Knupp has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15132 )

Change subject: IMPALA-3343: Make impala-shell compatible with python 3.
..


Patch Set 17:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15132/15/tests/shell/test_shell_commandline.py
File tests/shell/test_shell_commandline.py:

http://gerrit.cloudera.org:8080/#/c/15132/15/tests/shell/test_shell_commandline.py@485
PS15, Line 485:   if SHELL_IS_PYTHON_2:
> I got confused by the scenario, but I checked this out and played around an
Still digging into this. The proposed change of cname.decode('utf8') fails 
under python 3 because with python 3:

  'str' object has no attribute 'decode'

In the meantime, with that change, the test against the built tarball still 
fails:

E   Query: select '?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?, ?,?, ?, 
?, ?, ?, ?,?, ?, ?, ?, ?, ?, ?, ?, ?\t'
  E   Query submitted at: 2020-03-02 14:06:17 (Coordinator: 
http://dknupp-desktop:25000)
  E   Query progress can be monitored at: 
http://dknupp-desktop:25000/query_plan?query_id=c649f690ad99d702:8571f574
  E   Unknown Exception : 'ascii' codec can't encode character u'\u0430' in 
position 107: ordinal not in range(128)
  E   Traceback (most recent call last):
  E File 
"/home/dknupp/Impala/shell/build/impala-shell-3.4.0-SNAPSHOT/impala_shell.py", 
line 1934, in 
  E   impala_shell_main()
  E File 
"/home/dknupp/Impala/shell/build/impala-shell-3.4.0-SNAPSHOT/impala_shell.py", 
line 1888, in impala_shell_main
  E   if execute_queries_non_interactive_mode(options, query_options):
  E File 
"/home/dknupp/Impala/shell/build/impala-shell-3.4.0-SNAPSHOT/impala_shell.py", 
line 1697, in execute_queries_non_interactive_mode
  E   shell.execute_query_list(queries))
  E File 
"/home/dknupp/Impala/shell/build/impala-shell-3.4.0-SNAPSHOT/impala_shell.py", 
line 1531, in execute_query_list
  E   print('Could not execute command: %s' % q, file=sys.stderr)
  E   UnicodeEncodeError: 'ascii' codec can't encode character u'\u0410' in 
position 35: ordinal not in range(128)



--
To view, visit http://gerrit.cloudera.org:8080/15132
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ibb75e162bac0faeae3e12106c15da39cbfb8b462
Gerrit-Change-Number: 15132
Gerrit-PatchSet: 17
Gerrit-Owner: David Knupp 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 22:09:55 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5391/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 21:59:36 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..


Patch Set 2:

Tested manually, skipping CI to unblock builds faster.


--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 21:31:41 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has submitted this change and it was merged. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..

IMPALA-9445: skip test_local_tz_conversion for hbase

This skips the test for filesystems where we don't start
HBase. It's a bit unfortunate that we have to disable
all of this test, so this is a stopgap to get builds
green.

Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Reviewed-on: http://gerrit.cloudera.org:8080/15338
Reviewed-by: Csaba Ringhofer 
Tested-by: Tim Armstrong 
---
M tests/custom_cluster/test_local_tz_conversion.py
1 file changed, 4 insertions(+), 1 deletion(-)

Approvals:
  Csaba Ringhofer: Looks good to me, approved
  Tim Armstrong: Verified

--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: merged
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 3
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15338/2//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15338/2//COMMIT_MSG@10
PS2, Line 10: bit unfortunate
I think that it doesn't add extra coverage to run this on S3, so it's ok to 
leave it this way. I added this test only to avoid the very specific bug of 
depending on flag use_local_tz_for_unix_timestamp_conversions when reading 
timestamps from different fileformats, as this bug could not be detected by 
tests that run with default flags.



--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 21:31:32 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..


Patch Set 2: Verified+1


--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 21:31:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..


Patch Set 2: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Mon, 02 Mar 2020 21:23:28 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Tim Armstrong (Code Review)
Hello Joe McDonnell, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15338

to look at the new patch set (#2).

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..

IMPALA-9445: skip test_local_tz_conversion for hbase

This skips the test for filesystems where we don't start
HBase. It's a bit unfortunate that we have to disable
all of this test, so this is a stopgap to get builds
green.

Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
---
M tests/custom_cluster/test_local_tz_conversion.py
1 file changed, 4 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/38/15338/2
--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 2
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15322 )

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5389/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 20:15:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5390/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Mon, 02 Mar 2020 20:11:49 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9424: Add six to shell/ext-py

2020-03-02 Thread Thomas Tauber-Marshall (Code Review)
Thomas Tauber-Marshall has abandoned this change. ( 
http://gerrit.cloudera.org:8080/15294 )

Change subject: IMPALA-9424: Add six to shell/ext-py
..


Abandoned
--
To view, visit http://gerrit.cloudera.org:8080/15294
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: abandon
Gerrit-Change-Id: If5567a28884b75643929f64f0ca913fc7ce42e5d
Gerrit-Change-Number: 15294
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15337 )

Change subject: IMPALA-9444: Fix URL for postgresql jar download
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5388/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:44:13 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15338 )

Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15338/1/tests/custom_cluster/test_local_tz_conversion.py
File tests/custom_cluster/test_local_tz_conversion.py:

http://gerrit.cloudera.org:8080/#/c/15338/1/tests/custom_cluster/test_local_tz_conversion.py@22
PS1, Line 22: from tests.common.skip import SkipIfBuildType, SkipIfABFS, 
SkipIfADLS, SkipIfS3
flake8: F401 'tests.common.skip.SkipIfBuildType' imported but unused



--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:38:21 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9445: skip test local tz conversion for hbase

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15338


Change subject: IMPALA-9445: skip test_local_tz_conversion for hbase
..

IMPALA-9445: skip test_local_tz_conversion for hbase

This skips the test for filesystems where we don't start
HBase. It's a bit unfortunate that we have to disable
all of this test, so this is a stopgap to get builds
green.

Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
---
M tests/custom_cluster/test_local_tz_conversion.py
1 file changed, 4 insertions(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/38/15338/1
--
To view, visit http://gerrit.cloudera.org:8080/15338
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I9fb603ebe0a5820d2eed0b6ec5dca2980fd5521d
Gerrit-Change-Number: 15338
Gerrit-PatchSet: 1
Gerrit-Owner: Tim Armstrong 


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15322 )

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..


Patch Set 2: Code-Review+2

carrying over Tim's +2


--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:32:56 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15322 )

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..


Patch Set 3:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/5437/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:33:30 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15322 )

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..


Patch Set 3: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:33:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Bikramjeet Vig (Code Review)
Bikramjeet Vig has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15322 )

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..


Patch Set 2:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15322/1/be/src/exec/partitioned-hash-join-builder.cc
File be/src/exec/partitioned-hash-join-builder.cc:

http://gerrit.cloudera.org:8080/#/c/15322/1/be/src/exec/partitioned-hash-join-builder.cc@1162
PS1, Line 1162:   codegen, true, *hash_table_config_, _build_row_fn));
> line too long (92 > 90)
Done



--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:32:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node and from data sinks to their config classes

2020-03-02 Thread Bikramjeet Vig (Code Review)
Hello Daniel Becker, Tim Armstrong, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15322

to look at the new patch set (#2).

Change subject: IMPALA-4080 [part 6]: Move codegen from join's exec node to 
plan node and from data sinks to their config classes
..

IMPALA-4080 [part 6]: Move codegen from join's exec node to plan node
and from data sinks to their config classes

This patch moves the code responsible for codegening from partitioned
hash join's exec node to plan node and from all data sinks to their
respective config classes.

Testing:
Manually verified that codegen works for all modified exec nodes and
data sinks.
Ran all exhaustive tests successfully.

Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
---
M be/src/exec/blocking-join-node.cc
M be/src/exec/blocking-join-node.h
M be/src/exec/data-sink.cc
M be/src/exec/data-sink.h
M be/src/exec/filter-context.cc
M be/src/exec/filter-context.h
M be/src/exec/grouping-aggregator.cc
M be/src/exec/nested-loop-join-node.cc
M be/src/exec/partitioned-hash-join-builder.cc
M be/src/exec/partitioned-hash-join-builder.h
M be/src/exec/partitioned-hash-join-node.cc
M be/src/exec/partitioned-hash-join-node.h
M be/src/runtime/fragment-instance-state.cc
M be/src/runtime/krpc-data-stream-sender.cc
M be/src/runtime/krpc-data-stream-sender.h
M be/src/runtime/runtime-state.h
16 files changed, 281 insertions(+), 190 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/22/15322/2
--
To view, visit http://gerrit.cloudera.org:8080/15322
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I8fe54b4040d6e3c2fc224f7cd6c8c0bfe813707a
Gerrit-Change-Number: 15322
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15337 )

Change subject: IMPALA-9444: Fix URL for postgresql jar download
..


Patch Set 2:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/5436/ 
DRY_RUN=true


--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:28:00 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Joe McDonnell (Code Review)
Joe McDonnell has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15337 )

Change subject: IMPALA-9444: Fix URL for postgresql jar download
..


Patch Set 2: Code-Review+2

(1 comment)

Carry +2

http://gerrit.cloudera.org:8080/#/c/15337/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15337/1//COMMIT_MSG@11
PS1, Line 11: 
https://central.sonatype.org/articles/2019/Jul/15/central-http-deprecation-update/
> Add https://central.sonatype.org/articles/2020/Jan/15/501-https-required-er
Added some context



--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:27:39 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Joe McDonnell (Code Review)
Hello Tim Armstrong, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15337

to look at the new patch set (#2).

Change subject: IMPALA-9444: Fix URL for postgresql jar download
..

IMPALA-9444: Fix URL for postgresql jar download

The current URL uses http://central.maven.org, which has been
decommissioned as part of the transition to HTTPS. See:
https://central.sonatype.org/articles/2019/Jul/15/central-http-deprecation-update/
https://central.sonatype.org/articles/2020/Jan/15/501-https-required-error/
This switches the URL to use https://repo.maven.apache.org/.

Testing:
 - Removed postgresql jar, ran bin/create-test-configuration.sh,
   verified that it downloaded the jar.

Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
---
M bin/create-test-configuration.sh
1 file changed, 1 insertion(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/37/15337/2
--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 2
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15337 )

Change subject: IMPALA-9444: Fix URL for postgresql jar download
..


Patch Set 1: Code-Review+2

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15337/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15337/1//COMMIT_MSG@11
PS1, Line 11:
Add https://central.sonatype.org/articles/2020/Jan/15/501-https-required-error/ 
for context, maybe?



--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:20:01 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG@18
PS1, Line 18:
> Do you mean add the flag ALLOW_OZONE_FILES and set it to false by default,
I think this is fine to leave enabled by default. This would be an argument for 
enabling erasure encoding.



--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Anonymous Coward (572)
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 19:15:48 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9444: Fix URL for postgresql jar download

2020-03-02 Thread Joe McDonnell (Code Review)
Joe McDonnell has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15337


Change subject: IMPALA-9444: Fix URL for postgresql jar download
..

IMPALA-9444: Fix URL for postgresql jar download

The current URL uses central.maven.org, which has been decommissioned.
This switches the URL to use https://repo.maven.apache.org/.

Testing:
 - Removed postgresql jar, ran bin/create-test-configuration.sh,
   verified that it downloaded the jar.

Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
---
M bin/create-test-configuration.sh
1 file changed, 1 insertion(+), 1 deletion(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/37/15337/1
--
To view, visit http://gerrit.cloudera.org:8080/15337
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I7ee9a1ce77bc3f8c6b3f728633cafe4eb37e669d
Gerrit-Change-Number: 15337
Gerrit-PatchSet: 1
Gerrit-Owner: Joe McDonnell 


[Impala-ASF-CR] IMPALA-9425 (part 1): Introduce uuids for impalads

2020-03-02 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15321 )

Change subject: IMPALA-9425 (part 1): Introduce uuids for impalads
..


Patch Set 2:

(5 comments)

http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/service/impala-http-handler.cc
File be/src/service/impala-http-handler.cc:

http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/service/impala-http-handler.cc@498
PS2, Line 498: membership_snapshot
nit: DCHECK this is not null?


http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/service/impala-http-handler.cc@503
PS2, Line 503: backend_id
Tim already mentioned it, but might be good to expose the UUID here; it looks 
like its possible for there to be two separate entries in the table, with the 
same address, so adding the uuid would help differentiate duplicate addresses


http://gerrit.cloudera.org:8080/#/c/15321/2/common/thrift/StatestoreService.thrift
File common/thrift/StatestoreService.thrift:

http://gerrit.cloudera.org:8080/#/c/15321/2/common/thrift/StatestoreService.thrift@62
PS2, Line 62:   1: required Types.BackendId backend_id;
nit: add docs


http://gerrit.cloudera.org:8080/#/c/15321/2/tests/common/impala_test_suite.py
File tests/common/impala_test_suite.py:

http://gerrit.cloudera.org:8080/#/c/15321/2/tests/common/impala_test_suite.py@1080
PS2, Line 1080:   def wait_for_state(self, handle, expected_state, timeout, 
client=None):
  : """Waits for the given 'query_handle' to reach the 
'expected_state'. If it does not
  : reach the given state within 'timeout' seconds, the method 
throws an AssertionError.
  : """
  : self.wait_for_any_state(handle, [expected_state], timeout, 
client)
  :
  :   def wait_for_any_state(self, handle, expected_states, 
timeout, client=None):
  : """Waits for the given 'query_handle' to reach one of 
'expected_states'. If it does
  : not reach one of the given states within 'timeout' seconds, 
the method throws an
  : AssertionError. Returns the final state.
update docs for 'client' param


http://gerrit.cloudera.org:8080/#/c/15321/2/tests/custom_cluster/test_restart_services.py
File tests/custom_cluster/test_restart_services.py:

http://gerrit.cloudera.org:8080/#/c/15321/2/tests/custom_cluster/test_restart_services.py@103
PS2, Line 103:   # Debug action to delay statestore updates to give the 
restarted impalad time to
 :   # register itself before a membership topic update is 
generated.
 :   self._start_impala_cluster(
 :   
["--state_store_args=--debug_actions=DO_SUBSCRIBER_UPDATE:JITTER@1"],
 :   num_coordinators=1, cluster_size=3)
 :   assert len(self.cluster.impalads) == 3
nit: use CustomClusterTestSuite.with_args?



--
To view, visit http://gerrit.cloudera.org:8080/15321
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icf8067349ed6b765f6fed830b7140f60738e9061
Gerrit-Change-Number: 15321
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 18:53:01 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9425 (part 1): Introduce uuids for impalads

2020-03-02 Thread Fang-Yu Rao (Code Review)
Fang-Yu Rao has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15321 )

Change subject: IMPALA-9425 (part 1): Introduce uuids for impalads
..


Patch Set 2:

(4 comments)

Hi Thomas, I have read this patch but do not have any major comment. I raised 
those questions mostly because I am not familiar with this part of our BE 
codebase and would like to check if my understanding is correct. Thank you very 
much!

http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/scheduling/cluster-membership-mgr.cc
File be/src/scheduling/cluster-membership-mgr.cc:

http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/scheduling/cluster-membership-mgr.cc@135
PS2, Line 135:   bool ss_is_recovering = statestore_subscriber_ != nullptr
It is not directly related to this patch. But is it ok for us to move this 
following statement to the place where we have known there is at least one 
condition in the following if-statement at Line 148 that evaluates to false? I 
just like to know whether or not my understanding is correct. Thanks!


http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/scheduling/cluster-membership-mgr.cc@347
PS2, Line 347:   // Send notifications to all callbacks registered to receive 
updates.
Is it true that the callbacks referred to here correspond to the input argument 
to the function cluster_membership_mgr_->RegisterUpdateCallbackFn() at Line 548 
in exec-env.cc? I'd just like to check if my understanding is correct.


http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/service/impala-http-handler.cc
File be/src/service/impala-http-handler.cc:

http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/service/impala-http-handler.cc@506
PS2, Line 506: Value 
location_name(TNetworkAddressToString(be_entry.second.address).c_str(),
'be_entry.second' is of the type TBackendDescriptor, which has 3 fields that 
are of the type TNetworkAddress, i.e., 'address', 'debug_http_address', and 
'krpc_address'. Is it okay to use any of the other two instead, i.e., 
'debug_http_address' or 'krpc_address'? Thanks!


http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/service/impala-server.cc
File be/src/service/impala-server.cc:

http://gerrit.cloudera.org:8080/#/c/15321/2/be/src/service/impala-server.cc@1723
PS2, Line 1723:   map> queries_to_cancel;
Is it possible to use 'backend_id' instead of 'address' to construct the map? 
Or the change would be too huge since the elements in 'queries_to_cancel' will 
also be used in other function later on, e.g., 
cancellation_thread_pool_->Offer(CancellationWork::BackendFailure(cancellation_entry.first,
 cancellation_entry.second));"?



--
To view, visit http://gerrit.cloudera.org:8080/15321
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Icf8067349ed6b765f6fed830b7140f60738e9061
Gerrit-Change-Number: 15321
Gerrit-PatchSet: 2
Gerrit-Owner: Thomas Tauber-Marshall 
Gerrit-Reviewer: Fang-Yu Rao 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 18:52:14 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG@18
PS1, Line 18:
> I don't want to scope creep this, but another thought I had was that it see
Do you mean add the flag ALLOW_OZONE_FILES and set it to false by default, or 
do you mean set ALLOW_ERASURE_CODED_FILES=true by default?



--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Anonymous Coward (572)
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 18:38:05 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG@18
PS1, Line 18:
> From what I know so far, Ozone does expose block locations.
I don't want to scope creep this, but another thought I had was that it seems 
weird to allow Ozone by default but not allow HDFS erasure coding 
(ALLOW_ERASURE_CODED_FILES=false). Maybe we should just leave as-is though... 
might be worth thinking about flipping the result.

Treating it as a remote FS by default makes sense to me, I think, since we then 
avoid the overhead of fetching and storing the block locations.

No we don't support rack-aware scheduling in Impala..



--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Anonymous Coward (572)
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 18:27:17 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Anonymous Coward (Code Review)
Anonymous Coward (572) has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

Ozone does support block locations (LocatedFIleStatus) and we have tested it 
against Hive. It does not however support short circuit reads.


--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Anonymous Coward (572)
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 18:03:51 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9384: Improve Impala shell usability by automatically enable live progress in the interactive mode

2020-03-02 Thread David Knupp (Code Review)
David Knupp has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15219 )

Change subject: IMPALA-9384: Improve Impala shell usability by automatically 
enable live_progress in the interactive mode
..


Patch Set 1:

> Patch Set 1:
> David and Andrew, please let me know if you agree with option 2? Thanks!

Thanks Alice -- Andrew and I both in favor of option 2.


-- 
To view, visit http://gerrit.cloudera.org:8080/15219
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I3765b775f663fa227e59728acffe4d5ea9a5e2d3
Gerrit-Change-Number: 15219
Gerrit-PatchSet: 1
Gerrit-Owner: Alice Fan 
Gerrit-Reviewer: Alice Fan 
Gerrit-Reviewer: Andrew Sherman 
Gerrit-Reviewer: David Knupp 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:42:40 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9439: Make --scale a mandatory option in single node perf run.py

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15335 )

Change subject: IMPALA-9439: Make --scale a mandatory option in 
single_node_perf_run.py
..


Patch Set 1:

Build started: https://jenkins.impala.io/job/gerrit-verify-dryrun/5435/ 
DRY_RUN=false


--
To view, visit http://gerrit.cloudera.org:8080/15335
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
Gerrit-Change-Number: 15335
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:38:09 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Sahil Takiar (Code Review)
Sahil Takiar has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG@18
PS1, Line 18:
> I have some questions about the block locations and scheduling. Does this l
>From what I know so far, Ozone does expose block locations.

This patch changes supportsStorageIds so that it returns true for Ozone 
filesystems as well (this should make the HdfsScanNode fetch the block 
locations and perform scheduling based on the block locations). However, I 
think I might revert that part of the patch.

I believe, for the most part, we expect to deploy Ozone on a dedicated storage 
cluster that is separate from the Impala compute cluster - e.g. separate 
compute vs storage cluster. So I think, at least for now, its okay if we treat 
Ozone as remote storage (similar to what we do for ABFS and S3A).

Even with separate storage + compute clusters, block locations might help with 
rack-aware scheduling, but I don't think we support that in Impala yet?

I'll make the fixes for the DiskIoMgr.



--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Sahil Takiar 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:37:17 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9439: Make --scale a mandatory option in single node perf run.py

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15335 )

Change subject: IMPALA-9439: Make --scale a mandatory option in 
single_node_perf_run.py
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5387/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15335
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
Gerrit-Change-Number: 15335
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:21:42 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7686: Allow RANGE() clause before HASH() clause for PARTITION BY

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15332 )

Change subject: IMPALA-7686: Allow RANGE() clause before HASH() clause for 
PARTITION BY
..


Patch Set 2:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5386/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15332
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I914e340e9acfb0f49c7d4f78705dbd9bde0aec8c
Gerrit-Change-Number: 15332
Gerrit-PatchSet: 2
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:08:26 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-7686: Allow RANGE() clause before HASH() clause for PARTITION BY

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15332 )

Change subject: IMPALA-7686: Allow RANGE() clause before HASH() clause for 
PARTITION BY
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5385/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15332
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I914e340e9acfb0f49c7d4f78705dbd9bde0aec8c
Gerrit-Change-Number: 15332
Gerrit-PatchSet: 1
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:07:29 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9439: Make --scale a mandatory option in single node perf run.py

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15335 )

Change subject: IMPALA-9439: Make --scale a mandatory option in 
single_node_perf_run.py
..


Patch Set 1: Code-Review+2


--
To view, visit http://gerrit.cloudera.org:8080/15335
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
Gerrit-Change-Number: 15335
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:03:33 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Tim Armstrong (Code Review)
Tim Armstrong has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15334/1//COMMIT_MSG@18
PS1, Line 18:
I have some questions about the block locations and scheduling. Does this load 
block locations and do locality-based scheduling?

It looks like there isn't any code added to DiskIoMgr to route the reads to a 
particular queue - it seems like in DiskIoMgr::AssignQueue() it falls through 
in a weird way.

OK if we want to defer this but should probably file a JIRA.



--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 17:02:03 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-4080 [part 3]: Moved codegen code for TupleRowComparator into a Config object

2020-03-02 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15281 )

Change subject: IMPALA-4080 [part 3]: Moved codegen code for TupleRowComparator 
into a Config object
..


Patch Set 3: Code-Review+1

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15281/3/be/src/util/tuple-row-compare.h
File be/src/util/tuple-row-compare.h:

http://gerrit.cloudera.org:8080/#/c/15281/3/be/src/util/tuple-row-compare.h@164
PS3, Line 164: std::vector nulls_first_;
Could be const ref as the bool->int8_t conversion is done in 
TupleRowComparatorConfig



--
To view, visit http://gerrit.cloudera.org:8080/15281
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I265604ecbae2c80ff59e697fddfeea4976b25cd9
Gerrit-Change-Number: 15281
Gerrit-PatchSet: 3
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Daniel Becker 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 16:56:47 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9439: Make --scale a mandatory option in single node perf run.py

2020-03-02 Thread Sahil Takiar (Code Review)
Sahil Takiar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15335


Change subject: IMPALA-9439: Make --scale a mandatory option in 
single_node_perf_run.py
..

IMPALA-9439: Make --scale a mandatory option in single_node_perf_run.py

This makes the --scale option mandatory when running
./bin/single_node_perf_run.py. If the option is not set, the script
attempts to run the workloads against the database
'[workload-name]None_[file-format]', which is typically not what the
user wants.

Makes some minor documentation improvements to the script.

Testing:
* Confirmed that running without the --scale option set causes the
  script to error out with a help message

Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
---
M bin/single_node_perf_run.py
1 file changed, 13 insertions(+), 2 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/35/15335/1
--
To view, visit http://gerrit.cloudera.org:8080/15335
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I9ad13580f8f74388981a37d6960087d95cde574b
Gerrit-Change-Number: 15335
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 


[Impala-ASF-CR] IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and sink configs where they were created

2020-03-02 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15318 )

Change subject: IMPALA-4080 [part 5]: Invoke close on Exprs from plan nodes and 
sink configs where they were created
..


Patch Set 2: Code-Review+2

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/aggregation-node-base.cc
File be/src/exec/aggregation-node-base.cc:

http://gerrit.cloudera.org:8080/#/c/15318/2/be/src/exec/aggregation-node-base.cc@50
PS2, Line 50: void AggregationPlanNode::Close() {
:   for (AggregatorConfig* config : aggs_) config->Close();
:   PlanNode::Close();
: }
I don't want to hold this up, just wanted to mention that the whole 
Expr/PlanNode closing logic looks unnecessary to me: if I didn't miss 
something, then the only resources that need closing are 'cache_entry_'s that 
are used by ScalarFnCall to  get UDF/UDA files. These could be registered to 
the fragment instance when we get the file and released during closing.



--
To view, visit http://gerrit.cloudera.org:8080/15318
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I7b79d8b8f8e740609676dd13bd300a5514c65b0d
Gerrit-Change-Number: 15318
Gerrit-PatchSet: 2
Gerrit-Owner: Bikramjeet Vig 
Gerrit-Reviewer: Bikramjeet Vig 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 16:24:51 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-7686: Allow RANGE() clause before HASH() clause for PARTITION BY

2020-03-02 Thread Adam Tamas (Code Review)
Adam Tamas has uploaded a new patch set (#2). ( 
http://gerrit.cloudera.org:8080/15332 )

Change subject: IMPALA-7686: Allow RANGE() clause before HASH() clause for 
PARTITION BY
..

IMPALA-7686: Allow RANGE() clause before HASH() clause for PARTITION BY

Modified the sql_parser.cup to accept the reversed syntax.

Testing:
-Added an extra AnalyzesOk into ‘AnalyzeKuduDDLTest()’ ->
‘testDDlsOnKuduTable()’ in AnalyzeKuduDDLTest.java where the RANGE() and
HASH() pair is swapped.

Change-Id: I914e340e9acfb0f49c7d4f78705dbd9bde0aec8c
---
M fe/src/main/cup/sql-parser.cup
M fe/src/test/java/org/apache/impala/analysis/AnalyzeKuduDDLTest.java
2 files changed, 10 insertions(+), 0 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/32/15332/2
--
To view, visit http://gerrit.cloudera.org:8080/15332
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I914e340e9acfb0f49c7d4f78705dbd9bde0aec8c
Gerrit-Change-Number: 15332
Gerrit-PatchSet: 2
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 


[Impala-ASF-CR] IMPALA-7686: Allow RANGE() clause before HASH() clause for PARTITION BY

2020-03-02 Thread Adam Tamas (Code Review)
Adam Tamas has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15332


Change subject: IMPALA-7686: Allow RANGE() clause before HASH() clause for 
PARTITION BY
..

IMPALA-7686: Allow RANGE() clause before HASH() clause for PARTITION BY

Modified the sql_parser.cup to accept the reversed syntax.

Testing:
-swapped an instance of RANGE() + HASH() pair in test_kudu.py ->
‘test_primary_key_and_distribution()’ and left the rest to check if
it is still create the right table with ‘assert_show_create_equals()’.

Change-Id: I914e340e9acfb0f49c7d4f78705dbd9bde0aec8c
---
M fe/src/main/cup/sql-parser.cup
M tests/query_test/test_kudu.py
2 files changed, 10 insertions(+), 3 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/32/15332/1
--
To view, visit http://gerrit.cloudera.org:8080/15332
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I914e340e9acfb0f49c7d4f78705dbd9bde0aec8c
Gerrit-Change-Number: 15332
Gerrit-PatchSet: 1
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Tamas Mate 


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 8:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5384/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 8
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 16:11:37 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15334 )

Change subject: IMPALA-9400: Initial Ozone support
..


Patch Set 1:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5383/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Joe McDonnell 
Gerrit-Reviewer: Tim Armstrong 
Gerrit-Comment-Date: Mon, 02 Mar 2020 15:54:34 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Csaba Ringhofer (Code Review)
Csaba Ringhofer has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 8: Code-Review+2

(4 comments)

http://gerrit.cloudera.org:8080/#/c/15104/5//COMMIT_MSG
Commit Message:

http://gerrit.cloudera.org:8080/#/c/15104/5//COMMIT_MSG@20
PS5, Line 20: Note,
typo: testing


http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/hdfs-columnar-scanner-ir.cc
File be/src/exec/hdfs-columnar-scanner-ir.cc:

http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/hdfs-columnar-scanner-ir.cc@23
PS5, Line 23: scratch_batch_
nit: the != nullptr in written out in most of Impala


http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/hdfs-orc-scanner.cc
File be/src/exec/hdfs-orc-scanner.cc:

http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/hdfs-orc-scanner.cc@702
PS5, Line 702:
nit: the != nullptr in written out in most of Impala


http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/orc-column-readers.cc
File be/src/exec/orc-column-readers.cc:

http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/orc-column-readers.cc@264
PS5, Line 264: >col_id_path_map_[
Can you mention it in the commit message that no scratch batch is used if there 
are collections? This will lead to also not having codegen in this case if the 
we want to use the same functions as Parquet, which always uses scratch batches.



--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 8
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 15:30:11 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Gabor Kaszab (Code Review)
Gabor Kaszab has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 7:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15104/7/be/src/exec/hdfs-scanner.h
File be/src/exec/hdfs-scanner.h:

http://gerrit.cloudera.org:8080/#/c/15104/7/be/src/exec/hdfs-scanner.h@176
PS7, Line 176: Not inlined in IR so it can be replaced with a constant.
> nit: this comment is applicable to both 'tuple_byte_size' functions, so it
Done



--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 7
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 15:26:44 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Gabor Kaszab (Code Review)
Hello Zoltan Borok-Nagy, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15104

to look at the new patch set (#8).

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..

IMPALA-9228: ORC scanner reads rows into scratch batch

Because of performance considerations this change enhances ORC
scanner to populate a scratch batch on a column-by-column manner
using data from the column readers. Once this is done the parquet
code was reused to apply runtime filter and conjuncts and to
populate the outgoing row batch.

This approach reduces the number of virtual function calls and takes
advantage of the columnar orientation of the data to enhance scan
performance. Additionally, introducing the scratch batch concept also
opens the door for codegen runtime filtering and applying conjuncts.

Note, this change doesn't cover collection types just primitive types
and struct.

Tesing:
  - Re-run the full test suite to verify that no regression is
introduced.
  - Checked the performance impact by running TPCH workload on a
scale 25 database using single_node_perf_run.py. The total query
runtime is decreased by 0-20% depending on how scan heavy the
particular query was. The more scan heavy the query is the more
performance gain I observe.

Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
---
M be/src/codegen/gen_ir_descriptions.py
M be/src/codegen/impala-ir.cc
M be/src/exec/CMakeLists.txt
R be/src/exec/hdfs-columnar-scanner-ir.cc
A be/src/exec/hdfs-columnar-scanner.cc
A be/src/exec/hdfs-columnar-scanner.h
M be/src/exec/hdfs-orc-scanner.cc
M be/src/exec/hdfs-orc-scanner.h
M be/src/exec/hdfs-scanner.h
M be/src/exec/orc-column-readers.cc
M be/src/exec/orc-column-readers.h
M be/src/exec/parquet/CMakeLists.txt
M be/src/exec/parquet/hdfs-parquet-scanner.cc
M be/src/exec/parquet/hdfs-parquet-scanner.h
R be/src/exec/scratch-tuple-batch.h
15 files changed, 425 insertions(+), 144 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/04/15104/8
--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 8
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-9400: Initial Ozone support

2020-03-02 Thread Sahil Takiar (Code Review)
Sahil Takiar has uploaded this change for review. ( 
http://gerrit.cloudera.org:8080/15334


Change subject: IMPALA-9400: Initial Ozone support
..

IMPALA-9400: Initial Ozone support

Ozone is a block-based filesystem store and is similar to HDFS in
many aspects. This patch adds initial support for reading data stored
on Ozone and writing data to Ozone. It adds Ozone to the whitelist of
Impala writable filesystems and adds the necessary Ozone client jars
to the Impala classpath.

Rather than updating to a new CDP GBN with the added Ozone client jars,
I just modified the existing GBN to include the new jars in the Maven
repo.

Testing:
* Manually tested that connecting to an external Ozone cluster works and
  that Impala can create an Ozone-backed table, write to the
  table, read from the table, and drop the table
* Ran core tests with and without USE_CDP_HIVE=true

Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
---
M be/src/exec/hdfs-table-sink.cc
M be/src/util/hdfs-util.cc
M be/src/util/hdfs-util.h
M bin/impala-config.sh
M fe/pom.xml
M fe/src/main/java/org/apache/impala/common/FileSystemUtil.java
M impala-parent/pom.xml
7 files changed, 55 insertions(+), 4 deletions(-)



  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/34/15334/1
--
To view, visit http://gerrit.cloudera.org:8080/15334
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newchange
Gerrit-Change-Id: I2544ff61e8767f061253a62a01ce2b7dfc384a99
Gerrit-Change-Number: 15334
Gerrit-PatchSet: 1
Gerrit-Owner: Sahil Takiar 


[Impala-ASF-CR] IMPALA-6360: Don't show full query statement on Impala webUI by default

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15288 )

Change subject: IMPALA-6360: Don't show full query statement on Impala webUI by 
default
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5382/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15288
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
Gerrit-Change-Number: 15288
Gerrit-PatchSet: 7
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Mon, 02 Mar 2020 14:34:12 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 7:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5381/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 7
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 13:58:43 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6360: Don't show full query statement on Impala webUI by default

2020-03-02 Thread Adam Tamas (Code Review)
Adam Tamas has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15288 )

Change subject: IMPALA-6360: Don't show full query statement on Impala webUI by 
default
..


Patch Set 7:

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15288/6/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/15288/6/tests/webserver/test_web_pages.py@423
PS6, Line 423:
> nit: This should be in the same line as the string, also the second line sh
Done



--
To view, visit http://gerrit.cloudera.org:8080/15288
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
Gerrit-Change-Number: 15288
Gerrit-PatchSet: 7
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Mon, 02 Mar 2020 13:48:28 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-6360: Don't show full query statement on Impala webUI by default

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15288 )

Change subject: IMPALA-6360: Don't show full query statement on Impala webUI by 
default
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5380/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15288
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
Gerrit-Change-Number: 15288
Gerrit-PatchSet: 6
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Mon, 02 Mar 2020 13:48:25 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6360: Don't show full query statement on Impala webUI by default

2020-03-02 Thread Adam Tamas (Code Review)
Adam Tamas has uploaded a new patch set (#7). ( 
http://gerrit.cloudera.org:8080/15288 )

Change subject: IMPALA-6360: Don't show full query statement on Impala webUI by 
default
..

IMPALA-6360: Don't show full query statement on Impala webUI by default

Added the ‘query_stmt_size’ flag to impala-server.cc with default value
of 250 and modified the ‘ImpalaHttpHandler::QueryStateToJson()’ to
truncate the end of the statements if they are too long.

Testing:
-manual testing in the WebUI.
-added 'test_query_stmt()' to test_web_pages.py

Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
---
M be/src/service/impala-http-handler.cc
M be/src/service/impala-server.cc
M tests/webserver/test_web_pages.py
3 files changed, 32 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/88/15288/7
--
To view, visit http://gerrit.cloudera.org:8080/15288
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
Gerrit-Change-Number: 15288
Gerrit-PatchSet: 7
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Impala Public Jenkins (Code Review)
Impala Public Jenkins has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 6:

Build Successful

https://jenkins.impala.io/job/gerrit-code-review-checks/5379/ : Initial code 
review checks passed. Use gerrit-verify-dryrun-external or gerrit-verify-dryrun 
to run full precommit tests.


--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 6
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 13:41:58 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-6360: Don't show full query statement on Impala webUI by default

2020-03-02 Thread Tamas Mate (Code Review)
Tamas Mate has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15288 )

Change subject: IMPALA-6360: Don't show full query statement on Impala webUI by 
default
..


Patch Set 6: Code-Review+1

(1 comment)

Thanks for the updates. Just found one more nit, other than that LGTM!

http://gerrit.cloudera.org:8080/#/c/15288/6/tests/webserver/test_web_pages.py
File tests/webserver/test_web_pages.py:

http://gerrit.cloudera.org:8080/#/c/15288/6/tests/webserver/test_web_pages.py@423
PS6, Line 423: """
nit: This should be in the same line as the string, also the second line should 
start 3 characters earlier. Some words might also fit in the first line till 
col 90.



--
To view, visit http://gerrit.cloudera.org:8080/15288
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
Gerrit-Change-Number: 15288
Gerrit-PatchSet: 6
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 
Gerrit-Comment-Date: Mon, 02 Mar 2020 13:40:11 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Zoltan Borok-Nagy (Code Review)
Zoltan Borok-Nagy has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 7: Code-Review+1

(1 comment)

http://gerrit.cloudera.org:8080/#/c/15104/7/be/src/exec/hdfs-scanner.h
File be/src/exec/hdfs-scanner.h:

http://gerrit.cloudera.org:8080/#/c/15104/7/be/src/exec/hdfs-scanner.h@176
PS7, Line 176: Not inlined in IR so it can be replaced with a constant.
nit: this comment is applicable to both 'tuple_byte_size' functions, so it 
should be put before L174, similarly to Base.



-- 
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 7
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 13:35:23 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Gabor Kaszab (Code Review)
Gabor Kaszab has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 7:

(3 comments)

http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/hdfs-scanner.h
File be/src/exec/hdfs-scanner.h:

http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/hdfs-scanner.h@a566
PS5, Line 566:
 :
 :
 :
 :
> nit: maybe these could be moved together to public.
Done


http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/orc-column-readers.h
File be/src/exec/orc-column-readers.h:

http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/orc-column-readers.h@188
PS5, Line 188:   Status ReadValue(int row_idx, Tuple* tuple, MemPool* pool) 
final WARN_UNUSED_RESULT;
> Since ReadValue() is public I think you don't need the friend declarations
Friend declarations are still needed for accessing 'derived->batch_' from 
OrcPrimitiveColumnReader::ReadValueBatch()


http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/orc-column-readers.cc
File be/src/exec/orc-column-readers.cc:

http://gerrit.cloudera.org:8080/#/c/15104/5/be/src/exec/orc-column-readers.cc@221
PS5, Line 221: ErrorMsg msg(errorCode, scanner_->filename(), 
orc_column_id_);
 : return scanner_-
> nit: fits single line again
Done



--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 7
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 13:14:45 +
Gerrit-HasComments: Yes


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Gabor Kaszab (Code Review)
Hello Zoltan Borok-Nagy, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15104

to look at the new patch set (#7).

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..

IMPALA-9228: ORC scanner reads rows into scratch batch

Because of performance considerations this change enhances ORC
scanner to populate a scratch batch on a column-by-column manner
using data from the column readers. Once this is done the parquet
code was reused to apply runtime filter and conjuncts and to
populate the outgoing row batch.

This approach reduces the number of virtual function calls and takes
advantage of the columnar orientation of the data to enhance scan
performance. Additionally, introducing the scratch batch concept also
opens the door for codegen runtime filtering and applying conjuncts.

Tesing:
  - Re-run the full test suite to verify that no regression is
introduced.
  - Checked the performance impact by running TPCH workload on a
scale 25 database using single_node_perf_run.py. The total query
runtime is decreased by 0-20% depending on how scan heavy the
particular query was. The more scan heavy the query is the more
performance gain I observe.

Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
---
M be/src/codegen/gen_ir_descriptions.py
M be/src/codegen/impala-ir.cc
M be/src/exec/CMakeLists.txt
R be/src/exec/hdfs-columnar-scanner-ir.cc
A be/src/exec/hdfs-columnar-scanner.cc
A be/src/exec/hdfs-columnar-scanner.h
M be/src/exec/hdfs-orc-scanner.cc
M be/src/exec/hdfs-orc-scanner.h
M be/src/exec/hdfs-scanner.h
M be/src/exec/orc-column-readers.cc
M be/src/exec/orc-column-readers.h
M be/src/exec/parquet/CMakeLists.txt
M be/src/exec/parquet/hdfs-parquet-scanner.cc
M be/src/exec/parquet/hdfs-parquet-scanner.h
R be/src/exec/scratch-tuple-batch.h
15 files changed, 426 insertions(+), 144 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/04/15104/7
--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 7
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 


[Impala-ASF-CR] IMPALA-6360: Don't show full query statement on Impala webUI by default

2020-03-02 Thread Adam Tamas (Code Review)
Adam Tamas has uploaded a new patch set (#6). ( 
http://gerrit.cloudera.org:8080/15288 )

Change subject: IMPALA-6360: Don't show full query statement on Impala webUI by 
default
..

IMPALA-6360: Don't show full query statement on Impala webUI by default

Added the ‘query_stmt_size’ flag to impala-server.cc with default value
of 250 and modified the ‘ImpalaHttpHandler::QueryStateToJson()’ to
truncate the end of the statements if they are too long.

Testing:
-manual testing in the WebUI.
-added 'test_query_stmt()' to test_web_pages.py

Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
---
M be/src/service/impala-http-handler.cc
M be/src/service/impala-server.cc
M tests/webserver/test_web_pages.py
3 files changed, 34 insertions(+), 1 deletion(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/88/15288/6
--
To view, visit http://gerrit.cloudera.org:8080/15288
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: Ib7109a0be5d1022b4f8d6e72441cf5dc1dc42605
Gerrit-Change-Number: 15288
Gerrit-PatchSet: 6
Gerrit-Owner: Adam Tamas 
Gerrit-Reviewer: Adam Tamas 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Tamas Mate 


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Gabor Kaszab (Code Review)
Gabor Kaszab has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/15104 )

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..


Patch Set 6:

PS6 is a rebase with master to resolve conflict with the ORC string allocations 
enhancement.


--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 6
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 
Gerrit-Comment-Date: Mon, 02 Mar 2020 12:57:33 +
Gerrit-HasComments: No


[Impala-ASF-CR] IMPALA-9228: ORC scanner reads rows into scratch batch

2020-03-02 Thread Gabor Kaszab (Code Review)
Hello Zoltan Borok-Nagy, Csaba Ringhofer, Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

http://gerrit.cloudera.org:8080/15104

to look at the new patch set (#6).

Change subject: IMPALA-9228: ORC scanner reads rows into scratch batch
..

IMPALA-9228: ORC scanner reads rows into scratch batch

Because of performance considerations this change enhances ORC
scanner to populate a scratch batch on a column-by-column manner
using data from the column readers. Once this is done the parquet
code was reused to apply runtime filter and conjuncts and to
populate the outgoing row batch.

This approach reduces the number of virtual function calls and takes
advantage of the columnar orientation of the data to enhance scan
performance. Additionally, introducing the scratch batch concept also
opens the door for codegen runtime filtering and applying conjuncts.

Tesing:
  - Re-run the full test suite to verify that no regression is
introduced.
  - Checked the performance impact by running TPCH workload on a
scale 25 database using single_node_perf_run.py. The total query
runtime is decreased by 0-20% depending on how scan heavy the
particular query was. The more scan heavy the query is the more
performance gain I observe.

Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
---
M be/src/codegen/gen_ir_descriptions.py
M be/src/codegen/impala-ir.cc
M be/src/exec/CMakeLists.txt
R be/src/exec/hdfs-columnar-scanner-ir.cc
A be/src/exec/hdfs-columnar-scanner.cc
A be/src/exec/hdfs-columnar-scanner.h
M be/src/exec/hdfs-orc-scanner.cc
M be/src/exec/hdfs-orc-scanner.h
M be/src/exec/hdfs-scanner.h
M be/src/exec/orc-column-readers.cc
M be/src/exec/orc-column-readers.h
M be/src/exec/parquet/CMakeLists.txt
M be/src/exec/parquet/hdfs-parquet-scanner.cc
M be/src/exec/parquet/hdfs-parquet-scanner.h
R be/src/exec/scratch-tuple-batch.h
15 files changed, 423 insertions(+), 140 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/04/15104/6
--
To view, visit http://gerrit.cloudera.org:8080/15104
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I56db0325dee283d73742ebbae412d19693fac0ca
Gerrit-Change-Number: 15104
Gerrit-PatchSet: 6
Gerrit-Owner: Gabor Kaszab 
Gerrit-Reviewer: Csaba Ringhofer 
Gerrit-Reviewer: Gabor Kaszab 
Gerrit-Reviewer: Impala Public Jenkins 
Gerrit-Reviewer: Zoltan Borok-Nagy 


  1   2   >