Hi Eva,
   After a close at the code, I think this is not a bug. We need to find out
how to avoid this. 

Thanks,
Yongqiang
On 09-9-9 下午1:31, "He Yongqiang" <heyongqi...@software.ict.ac.cn> wrote:

> Hi Eva,
>     Can you open a new jira for this?  And let’s discuss and resolve this
> issue. 
> I guess this is because the partition metadata is added before the data is
> available. 
> 
> Thanks
> Yongqiang
> On 09-9-9 下午1:18, "Eva Tse" <e...@netflix.com> wrote:
> 
>> 
>> We are planning to start enabling ad-hoc querying on our hive warehouse and
>> we tested some of the concurrent queries and found the following issue:
>> 
>> Query 1 – doing ‘insert overwrite table yyy .... partition (dateint = xxx)
>> select ...  from yyy where dateint = xxx’  This is done to merge small files
>> within a partition in table yyy
>> Query 2 – doing some select on the same table joining another table.
>> 
>> What we found is that query 2 would fail with the following exceptions in
>> multiple reducers.
>> java.io.FileNotFoundException: File does not exist:
>> hdfs://ip-10-251-98-80.ec2.internal:9000/user/hive/dataeng/warehouse/nccp_ses
>> sion_facts/dateint=20090908/hour=9/sessionsFacts_P20090909T021823L20090908T09
>> -r-00006
>>  at 
>> org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSys
>> tem.java:457)
>>  at org.apache.hadoop.fs.FileSystem.getLength(FileSystem.java:671)
>>  at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1417)
>>  at org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1412)
>>  at 
>> org.apache.hadoop.mapred.SequenceFileRecordReader.(SequenceFileRecordReader.j
>> ava:43)
>>  at 
>> org.apache.hadoop.mapred.SequenceFileInputFormat.getRecordReader(SequenceFile
>> InputFormat.java:63)
>>  at 
>> org.apache.hadoop.hive.ql.io.HiveInputFormat.getRecordReader(HiveInputFormat.
>> java:236)
>>  at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:336)
>>  at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
>>  at org.apache.hadoop.mapred.Child.main(Child.java:170)
>> 
>> Is this expected? If so, is there a jira or is it planned to be addressed? We
>> are trying to think of workaround, but haven’t thought of good ones as
>> swapping of files would ideally be handled inside hive.
>> 
>> Please let us know your feedback.
>> 
>> Thanks,
>> Eva.
> 

Reply via email to