[ 
https://issues.apache.org/jira/browse/HIVE-4561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14008534#comment-14008534
 ] 

Ashutosh Chauhan commented on HIVE-4561:
----------------------------------------

Both testCliDriver_columnstats_tbllvl.q & 
testCliDriver_compute_stats_empty_table.q failed for me on latest patch with 
following exception. which is same as Hive QA run reported on 
http://ec2-174-129-184-35.compute-1.amazonaws.com/jenkins/job/PreCommit-HIVE-Build/291/testReport/org.apache.hadoop.hive.cli/TestCliDriver/testCliDriver_columnstats_tbllvl/
{code}
java.lang.Exception: java.lang.RuntimeException: Hive Runtime Error while 
closing operators
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.runTasks(LocalJobRunner.java:462)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:522)
Caused by: java.lang.RuntimeException: Hive Runtime Error while closing 
operators
        at 
org.apache.hadoop.hive.ql.exec.mr.ExecMapper.close(ExecMapper.java:260)
        at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:61)
        at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:430)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:342)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job$MapTaskRunnable.run(LocalJobRunner.java:243)
        at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)
        at java.util.concurrent.FutureTask.run(FutureTask.java:262)
        at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
        at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
        at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: 
java.lang.NullPointerException
        at 
org.apache.hadoop.hive.ql.exec.GroupByOperator.closeOp(GroupByOperator.java:1141)
        at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:579)
        at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:591)
        at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:591)
        at org.apache.hadoop.hive.ql.exec.Operator.close(Operator.java:591)
        at 
org.apache.hadoop.hive.ql.exec.mr.ExecMapper.close(ExecMapper.java:227)
        ... 9 more
Caused by: java.lang.NullPointerException
        at 
org.apache.hadoop.hive.serde2.objectinspector.primitive.WritableIntObjectInspector.get(WritableIntObjectInspector.java:36)
        at 
org.apache.hadoop.hive.serde2.objectinspector.primitive.PrimitiveObjectInspectorUtils.getInt(PrimitiveObjectInspectorUtils.java:598)
        at 
org.apache.hadoop.hive.ql.udf.generic.GenericUDAFComputeStats$GenericUDAFLongStatsEvaluator.iterate(GenericUDAFComputeStats.java:468)
        at 
org.apache.hadoop.hive.ql.udf.generic.GenericUDAFEvaluator.aggregate(GenericUDAFEvaluator.java:183)
        at 
org.apache.hadoop.hive.ql.exec.GroupByOperator.closeOp(GroupByOperator.java:1132)
{code}

> Column stats :  LOW_VALUE (or HIGH_VALUE) will always be 0.0000 ,if all the 
> column values larger than 0.0 (or if all column values smaller than 0.0)
> ----------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-4561
>                 URL: https://issues.apache.org/jira/browse/HIVE-4561
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>    Affects Versions: 0.12.0, 0.13.0
>            Reporter: caofangkun
>            Assignee: Navis
>         Attachments: HIVE-4561.1.patch, HIVE-4561.2.patch, HIVE-4561.3.patch, 
> HIVE-4561.4.patch.txt, HIVE-4561.5.patch.txt, HIVE-4561.6.patch.txt
>
>
> if all column values larger than 0.0  DOUBLE_LOW_VALUE always will be 0.0 
> or  if all column values less than 0.0,  DOUBLE_HIGH_VALUE will always be 
> hive (default)> create table src_test (price double);
> hive (default)> load data local inpath './test.txt' into table src_test;
> hive (default)> select * from src_test;
> OK
> 1.0
> 2.0
> 3.0
> Time taken: 0.313 seconds, Fetched: 3 row(s)
> hive (default)> analyze table src_test compute statistics for columns price;
> mysql> select * from TAB_COL_STATS \G;
>                  CS_ID: 16
>                DB_NAME: default
>             TABLE_NAME: src_test
>            COLUMN_NAME: price
>            COLUMN_TYPE: double
>                 TBL_ID: 2586
>         LONG_LOW_VALUE: 0
>        LONG_HIGH_VALUE: 0
>       DOUBLE_LOW_VALUE: 0.0000   # Wrong Result ! Expected is 1.0000
>      DOUBLE_HIGH_VALUE: 3.0000
>  BIG_DECIMAL_LOW_VALUE: NULL
> BIG_DECIMAL_HIGH_VALUE: NULL
>              NUM_NULLS: 0
>          NUM_DISTINCTS: 1
>            AVG_COL_LEN: 0.0000
>            MAX_COL_LEN: 0
>              NUM_TRUES: 0
>             NUM_FALSES: 0
>          LAST_ANALYZED: 1368596151
> 2 rows in set (0.00 sec)



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to