Navis created SPARK-8312:
----------------------------

             Summary: Populate statistics info of hive tables if it's needed to 
be
                 Key: SPARK-8312
                 URL: https://issues.apache.org/jira/browse/SPARK-8312
             Project: Spark
          Issue Type: Improvement
          Components: SQL
            Reporter: Navis
            Priority: Minor


Currently, spark-sql uses stats in metastore for estimating size of hive table, 
which means analyze command should be executed before accessing the table for 
better planning especially for joins. But still with the stats, it cannot 
reflect real input size of the query when partition prunning predicate exists 
in it.

Even worse is that hive cannot update megastore stats for external tables, 
which is fixed recently in HIVE-6727. The issue detail says the bug is applied 
to all hive version between 0.13.0 and 1.2.0



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to