[ 
https://issues.apache.org/jira/browse/HIVE-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Prasanth J updated HIVE-6300:
-----------------------------

    Attachment: HIVE-6300.2.patch

[~leftylev] Thanks for pointing out the nitty-gritty mistakes. That was really 
usefull. I removed the hive config hive.stats.avg.row.size as it is no longer 
required. Average row size will be computed from data size and schema in very 
stages of operator tree (TableScanOperator), so it should be available to all 
operators (including LIMIT). Can you please skim through it again to see if it 
looks good?

> Add documentation for stats configs to hive-default.xml.template
> ----------------------------------------------------------------
>
>                 Key: HIVE-6300
>                 URL: https://issues.apache.org/jira/browse/HIVE-6300
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Query Processor, Statistics
>    Affects Versions: 0.13.0
>            Reporter: Prasanth J
>            Assignee: Prasanth J
>            Priority: Minor
>             Fix For: 0.13.0
>
>         Attachments: HIVE-6300.1.patch, HIVE-6300.2.patch
>
>
> Add documentation for the following configs
> hive.stats.max.variable.length
> hive.stats.list.num.entries
> hive.stats.map.num.entries
> hive.stats.map.parallelism
> hive.stats.fetch.column.stats
> hive.stats.avg.row.size
> hive.stats.join.factor
> hive.stats.deserialization.factor
> hive.stats.fetch.partition.stats



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to