Hi  dear drill devloper,    Now we are deploy the 0.7 version drill for 
statistics analysis. I found that the parquet file store the column summary 
info in pageheader (like min,max,count and so on), but in the datareader these 
info seems not to be used for pre-filtering files. For example, when I search 
the records that attribute_A = 10, if the column's (min,max) =(1,9) , skip to 
scan the data seems the best choice. I want to check if drill will do this 
operation in analysis process.btw:In TableStatsCalculator.getRegionSizeInBytes 
method, if avgRowSizeInBytes is to large, the return value will be out of int 
range. So the code should be fixed like "return 
((long)avgRowSizeInBytes)*1024L*1024L".                                         
                                                                                
                                   Thanks&Regards

Reply via email to