[ 
https://issues.apache.org/jira/browse/HIVE-12882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15102644#comment-15102644
 ] 

Prasanth Jayachandran commented on HIVE-12882:
----------------------------------------------

noscan cannot get row count and raw data size. partial scan cannot get raw data 
size. If you need all basic stats then full scan is the only way to go which is 
slow. 

> Automatically choose to use noscan for stats collection
> -------------------------------------------------------
>
>                 Key: HIVE-12882
>                 URL: https://issues.apache.org/jira/browse/HIVE-12882
>             Project: Hive
>          Issue Type: Sub-task
>            Reporter: Pengcheng Xiong
>
> noscan is leveraging the file system to derive the #rows and rawDataSize. 
> According to [~ashutoshc], it now only works with RC and ORC file type. We 
> would like Hive to automatically choose to use noscan or scan based on the 
> file system when stats task starts or when user issues the same query 
> "Analyze ...."



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to