Re: Reg:Column Statistics with Parquet

2014-07-25 Thread Suma Shivaprasad
Hi , I tried the same with compute statistics for columns a, b,c as above and still seeing the same results in explain plan. How do I confirm if its generating all the column stats for a given column. If this is confirmed, we can debug why Hive is still not using it? Thanks Suma On Thu, Jul

RE: Reg:Column Statistics with Parquet

2014-07-25 Thread Navdeep Agrawal
then statistics are not getting created . From: Suma Shivaprasad [mailto:sumasai.shivapra...@gmail.com] Sent: Friday, July 25, 2014 12:43 PM To: user@hive.apache.org Subject: Re: Reg:Column Statistics with Parquet Hi , I tried the same with compute statistics for columns a, b,c as above and still

Reg:Column Statistics with Parquet

2014-07-24 Thread Sandeep Samudrala
I am trying to enable Column statistics usage with Parquet tables. This is the query I am executing. However on explain, I see that even though *Basic stats: COMPLETE *is seen *Column stats *is seen as*NONE.* Can someone please explain what else I need to debug/fix this. set

Re: Reg:Column Statistics with Parquet

2014-07-24 Thread Prasanth Jayachandran
You have to explicit specifics column list in analyze command for gathering columns stats. This command will only collect basic stats like number of rows, total file size, raw data size, number of files. analyze table user_table partition(dt='2014-06-01',hour='00') compute statistics; To