[ 
https://issues.apache.org/jira/browse/IMPALA-9560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17069116#comment-17069116
 ] 

ASF subversion and git services commented on IMPALA-9560:
---------------------------------------------------------

Commit e9dd5d3f8c1d533bc5ae94c7e0677820fcd851aa in impala's branch 
refs/heads/master from Joe McDonnell
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=e9dd5d3 ]

IMPALA-9560: Fix TestStatsExtrapolation for release versions

When changing the Impala version from 3.4.0-SNAPSHOT to 3.4.0-RELEASE,
TestStatsExtrapolation::test_stats_extrapolation started failing due
to a difference in the expected cardinality (expected: 17.91K,
actual 17.90K). This is because the Impala version gets embedded into
parquet files, and this causes a slight difference in file size, which
translates into a slight difference in expected cardinality.

This modifies TestStatsExtrapolation::test_stats_extrapolation to
allow any 17.9*K cardinality.

Testing:
 - Tested on master and on branch-3.4.0

Change-Id: Iebe538936f23c095ef58c808e425cfb7b31edd94
Reviewed-on: http://gerrit.cloudera.org:8080/15569
Reviewed-by: Tim Armstrong <tarmstr...@cloudera.com>
Tested-by: Impala Public Jenkins <impala-public-jenk...@cloudera.com>


> Changing version from 3.4.0-SNAPSHOT to 3.4.0-RELEASE breaks 
> TestStatsExtrapolation
> -----------------------------------------------------------------------------------
>
>                 Key: IMPALA-9560
>                 URL: https://issues.apache.org/jira/browse/IMPALA-9560
>             Project: IMPALA
>          Issue Type: Bug
>          Components: Frontend
>    Affects Versions: Impala 3.4.0
>            Reporter: Joe McDonnell
>            Assignee: Joe McDonnell
>            Priority: Critical
>              Labels: broken-build
>
> When working on the Impala 3.4 release, we changed the version on 
> branch-3.4.0 from 3.4.0-SNAPSHOT to 3.4.0-RELEASE. 
> metadata/test_stats_extrapolation.py::TestStatsExtrapolation::test_stats_extrapolation()
>  now fails with the following error:
> {noformat}
> metadata/test_stats_extrapolation.py:44: in test_stats_extrapolation
>     self.run_test_case('QueryTest/stats-extrapolation', vector, 
> unique_database)
> common/impala_test_suite.py:690: in run_test_case
>     self.__verify_results_and_errors(vector, test_section, result, use_db)
> common/impala_test_suite.py:523: in __verify_results_and_errors
>     replace_filenames_with_placeholder)
> common/test_result_verifier.py:456: in verify_raw_results
>     VERIFIER_MAP[verifier](expected, actual)
> common/test_result_verifier.py:246: in verify_query_result_is_subset
>     assert expected_literal_strings <= actual_literal_strings
> E   assert Items in expected results not found in actual results:
> E     '   tuple-ids=0 row-size=4B cardinality=17.91K'
> E     Items in actual results:
> E     '|  output exprs: id'
> E     ''
> E     '     table: rows=unavailable size=unavailable'
> E     '   stored statistics:'
> E     'Max Per-Host Resource Reservation: Memory=8.00KB Threads=2'
> E     '     columns: unavailable'
> E     '     partitions: 0/24 rows=unavailable'
> E     '00:SCAN HDFS [test_stats_extrapolation_5c6bdfd.alltypes]'
> E     '   tuple-ids=0 row-size=4B cardinality=17.90K'
> E     '|'
> E     'Analyzed query: SELECT id FROM 
> test_stats_extrapolation_5c6bdfd.alltypes'
> E     'F00:PLAN FRAGMENT [UNPARTITIONED] hosts=1 instances=1'
> E     '   HDFS partitions=24/24 files=36 size=281.43KB'
> E     'test_stats_extrapolation_5c6bdfd.alltypes'
> E     'PLAN-ROOT SINK'
> E     '|  mem-estimate=0B mem-reservation=0B thread-reservation=0'
> E     '|  Per-Host Resources: mem-estimate=16.00MB mem-reservation=8.00KB 
> thread-reservation=2'
> E     '   in pipelines: 00(GETNEXT)'
> E     '   extrapolated-rows=unavailable max-scan-range-rows=unavailable'
> E     'Per-Host Resource Estimates: Memory=16MB'
> E     'WARNING: The following tables are missing relevant table and/or column 
> statistics.'
> E     '   mem-estimate=16.00MB mem-reservation=8.00KB 
> thread-reservation=1'{noformat}
> The output is expecting a cardinality of 17.91K, but instead the cardinality 
> is 17.90K.
> The RELEASE version has one character fewer than the SNAPSHOT version. The 
> version gets embedded in parquet files, so the parquet file is slightly 
> smaller than before. The test is estimating cardinality by looking at the 
> size of the parquet file. Apparently, this is right on the edge.
> This test should tolerate this difference.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to