subject:"Re\: Populating tables using hive and spark"

Re: Populating tables using hive and spark

2016-08-22 Thread Mich Talebzadeh

Ok This is my test 1) create table in Hive and populate it with two rows hive> create table testme (col1 int, col2 string); OK hive> insert into testme values (1,'London'); Query ID = hduser_20160821212812_2a8384af-23f1-4f28-9395-a99a5f4c1a4a OK hive> insert into testme values (2,'NY'); Query ID

Re: Populating tables using hive and spark

2016-08-22 Thread Nitin Kumar

Hi Furcy, If I execute the command "ANALYZE TABLE TEST_ORC COMPUTE STATISTICS" before checking the count from hive, Hive returns the correct count albeit it does not spawn a map-reduce job for computing the count. I'm running a HDP 2.4 Cluster with Hive 1.2.1.2.4 and Spark 1.6.1 If others can