Riza Suminto has posted comments on this change. ( 
http://gerrit.cloudera.org:8080/20770 )

Change subject: IMPALA-12426: Query History Table
......................................................................


Patch Set 9:

(4 comments)

Will do another pass, but here are few comments from me:

http://gerrit.cloudera.org:8080/#/c/20770/9/be/src/exprs/expr-test.cc
File be/src/exprs/expr-test.cc:

http://gerrit.cloudera.org:8080/#/c/20770/9/be/src/exprs/expr-test.cc@4350
PS9, Line 4350: TestStringValue("prettyprint_duration(-1)", "-1000.000ns");
Can you please add the new string functions to BenchmarkStringFunctions in 
be/src/benchmarks/expr-benchmark.cc?

If possible, please update the perf table there as well.
To get consistent number, I usually do the following before running 
microbenchmark.

sudo cpupower frequency-set --governor performance
echo "1" | sudo tee /sys/devices/system/cpu/intel_pstate/no_turbo


http://gerrit.cloudera.org:8080/#/c/20770/9/be/src/exprs/string-functions-ir.cc
File be/src/exprs/string-functions-ir.cc:

http://gerrit.cloudera.org:8080/#/c/20770/9/be/src/exprs/string-functions-ir.cc@1811
PS9, Line 1811: const string& fmt_str = PrettyPrinter::Print(val, unit);
PrettyPrinter::Print use stringstream internally. I'm worried using it as row 
function will cause memory and performance issue like IMPALA-10984. Perhaps a 
more performant variant that involve less string allocation and copying is 
needed here.


http://gerrit.cloudera.org:8080/#/c/20770/9/tests/custom_cluster/test_query_log.py
File tests/custom_cluster/test_query_log.py:

http://gerrit.cloudera.org:8080/#/c/20770/9/tests/custom_cluster/test_query_log.py@51
PS9, Line 51: sleep(4)
This might takes longer in sanitized build. Some test delay for longer like 
this:
https://github.com/apache/impala/blob/9ecf0cbfc79cd4ee0b1b0ff9143f3eda20cbdcb4/tests/query_test/test_runtime_filters.py#L37

Maybe it is more deterministic to wait until 
impala-server.completed_queries.written metric increased? example:
https://github.com/apache/impala/blob/9ecf0cbfc79cd4ee0b1b0ff9143f3eda20cbdcb4/tests/custom_cluster/test_executor_groups.py#L175-L176


http://gerrit.cloudera.org:8080/#/c/20770/9/tests/custom_cluster/test_query_log.py@359
PS9, Line 359: shell_result = run_impala_shell_cmd_no_expect(vector,
             :                                       
["--query={0}".format(create_tbl_sql),
             :                                         "--show_profiles"])
Why not use self.execute_query() and retrive the text profile using query 
handle?



--
To view, visit http://gerrit.cloudera.org:8080/20770
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: comment
Gerrit-Change-Id: I2d2da9d450fba4e789400cfa62927fc25d34f844
Gerrit-Change-Number: 20770
Gerrit-PatchSet: 9
Gerrit-Owner: Jason Fehr <jf...@cloudera.com>
Gerrit-Reviewer: Andrew Sherman <asher...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>
Gerrit-Reviewer: Jason Fehr <jf...@cloudera.com>
Gerrit-Reviewer: Michael Smith <michael.sm...@cloudera.com>
Gerrit-Reviewer: Riza Suminto <riza.sumi...@cloudera.com>
Gerrit-Comment-Date: Wed, 17 Jan 2024 01:52:15 +0000
Gerrit-HasComments: Yes

Reply via email to