agreed but it still needs to know where the hive top node directory starts
from, which is normally under ../../ warehouse

Dr Mich Talebzadeh



LinkedIn * 
https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
<https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*



http://talebzadehmich.wordpress.com



On 19 May 2016 at 20:32, Al Pivonka <alpivo...@gmail.com> wrote:

> Read about hive partitions and bucketing.
> Since your location has multiple directories, Hive needs to know how to
> traverse them..
>
> Hope this helps.
> On May 19, 2016 5:51 AM, "Abhishek Dubey" <abhishek.du...@xoriant.com>
> wrote:
>
>> Hi,
>>
>>
>>
>> In hdfs I have a directory structure like this.
>>
>> /user/hdfs/Data/Data1/File1
>>
>> /user/hdfs/Data/Data2/File2
>>
>>
>>
>> And I am creating an external table like:
>>
>> CREATE external TABLE db.tablename
>>
>> (
>>
>> amt1 STRING,
>>
>> amt2 STRING,
>>
>> amt3 STRING
>>
>> )
>>
>> row format delimited
>>
>> fields terminated by ','
>>
>> location '/user/hdfs/Data/';
>>
>>
>>
>> Also, I have set two properties:
>>
>> set mapred.input.dir.recursive=true;
>>
>> set hive.mapred.supports.subdirectories=true;
>>
>>
>>
>> This setup is working perfectly fine on my local single node vm, Having
>> all vanilla apache installations and setup,
>>
>>
>>
>> But. on cloudera 5.3.3 cluster of 4 nodes, above mentioned properties for
>> recursive lookup of sub directories for an external hive table is not
>> working.
>>
>> In the Cloudera manager i have added the properties in Hive-Site.xml,
>> deployed configuration and restarted Hive service but still not working.
>>
>> <property>
>>
>>   <name> mapred.input.dir.recursive</name>
>>
>>   <value>true</value>
>>
>> </property>
>>
>> <property>
>>
>>   <name>hive.mapred.supports.subdirectories</name>
>>
>>   <value>true</value>
>>
>> </property>
>>
>>
>>
>> When querying select *  on CDH What i get is this, Zero rows.
>>
>> hive> select * from tablename;
>>
>> OK
>>
>> Time taken: 0.322 seconds
>>
>> hive>
>>
>>
>>
>> Whereas on local vm it is giving desired output.
>>
>>
>>
>> Is there anything else on CDH that we need to take care to pick data from
>> subdirectories into hive table?
>>
>>
>>
>> Thanks in advance.
>> *Abhishek Dubey*
>>
>>
>>
>>
>>
>

Reply via email to