Sahil Takiar created IMPALA-10083:
-------------------------------------

             Summary: Improve row count estimates when stats are not available
                 Key: IMPALA-10083
                 URL: https://issues.apache.org/jira/browse/IMPALA-10083
             Project: IMPALA
          Issue Type: Improvement
          Components: Frontend
            Reporter: Sahil Takiar


There are various improvements that we can make to estimate row count stats 
even if stats are not available for a table.

There are various factors to consider here:
 * Handling for partitioned vs. non-partitioned tables
 ** Handling for partitioned tables can be a bit tricky if the table is in a 
mixed state - some partitions have row counts while other don't
 * Interoperability with other systems such as Hive and Spark
 * Users can run alter table statements to manually set the value of the row 
count

The JIRA will be used to track the various improvements via sub-tasks.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to