subject:"Populating tables using hive and spark"

Fwd: Populating tables using hive and spark

2016-08-26 Thread Timur Shenkao

Hello! I just wonder: do you (both of you) use the same user for HIVE & Spark? Or different ? Do you use Kerberized Hadoop? On Mon, Aug 22, 2016 at 2:20 PM, Mich Talebzadeh wrote: > Ok This is my test > > 1) create table in Hive and populate it with two rows > >

Re: Populating tables using hive and spark

2016-08-22 Thread Mich Talebzadeh

Ok This is my test 1) create table in Hive and populate it with two rows hive> create table testme (col1 int, col2 string); OK hive> insert into testme values (1,'London'); Query ID = hduser_20160821212812_2a8384af-23f1-4f28-9395-a99a5f4c1a4a OK hive> insert into testme values (2,'NY'); Query ID

Re: Populating tables using hive and spark

2016-08-22 Thread Nitin Kumar

Hi Furcy, If I execute the command "ANALYZE TABLE TEST_ORC COMPUTE STATISTICS" before checking the count from hive, Hive returns the correct count albeit it does not spawn a map-reduce job for computing the count. I'm running a HDP 2.4 Cluster with Hive 1.2.1.2.4 and Spark 1.6.1 If others can

Populating tables using hive and spark

2016-08-22 Thread Nitin Kumar

Hi! I've noticed that hive has problems in registering new data records if the same table is written to using both the hive terminal and spark sql. The problem is demonstrated through the commands listed below hive> use