Sahil Takiar has posted comments on this change. ( http://gerrit.cloudera.org:8080/16098 )
Change subject: IMPALA-9744: Treat corrupt table stats as missing to avoid bad plans ...................................................................... Patch Set 6: (5 comments) adding Tim to the review as well http://gerrit.cloudera.org:8080/#/c/16098/6/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java File fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java: http://gerrit.cloudera.org:8080/#/c/16098/6/fe/src/main/java/org/apache/impala/planner/HdfsScanNode.java@1218 PS6, Line 1218: if (numRows < -1 || (numRows == 0 && tbl_.getTotalHdfsBytes() > 0)) { : hasCorruptTableStats_ = true; : } : return numRows; what about for unpartitioned tables? it looks like if the table stats are corrupted, and numRows == 0, we still return numRows = 0 from this method. would be good to add a test case for unpartitioned tables as well. http://gerrit.cloudera.org:8080/#/c/16098/6/testdata/data/alltypes_parquet_year2009_month01.parquet File testdata/data/alltypes_parquet_year2009_month01.parquet: http://gerrit.cloudera.org:8080/#/c/16098/6/testdata/data/alltypes_parquet_year2009_month01.parquet@1 PS6, Line 1: PAR1°ºLì Ø ô×a b c d e f g h i j k l m n o p q r s N O P Q R S T U V W X Y Z [ \ ] ^ _ ` t u v w x y z { | } ~ ! "