[
https://issues.apache.org/jira/browse/HIVE-3777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ashutosh Chauhan updated HIVE-3777:
-----------------------------------
Status: Patch Available (was: Open)
> add a property in the partition to figure out if stats are accurate
> -------------------------------------------------------------------
>
> Key: HIVE-3777
> URL: https://issues.apache.org/jira/browse/HIVE-3777
> Project: Hive
> Issue Type: Improvement
> Components: Query Processor
> Affects Versions: 0.13.0
> Reporter: Namit Jain
> Assignee: Ashutosh Chauhan
> Attachments: HIVE-3777.2.patch, HIVE-3777.2.patch, HIVE-3777.3.patch,
> HIVE-3777.4.patch, HIVE-3777.patch
>
>
> Currently, stats task tries to update the statistics in the table/partition
> being updated after the table/partition is loaded. In case of a failure to
> update these stats (due to the any reason), the operation either succeeds
> (writing inaccurate stats) or fails depending on whether hive.stats.reliable
> is set to true. This can be bad for applications who do not always care about
> reliable stats, since the query may have taken a long time to execute and then
> fail eventually.
> Another property should be added to the partition: areStatsAccurate. If
> hive.stats.reliable is
> set to false, and stats could not be computed correctly, the operation would
> still succeed, update the stats, but set areStatsAccurate to false.
> If the application cares about accurate stats, it can be obtained in the
> background.
--
This message was sent by Atlassian JIRA
(v6.1#6144)