RE: Issue joining 21 HUGE Hive tables

2016-03-24 Thread Loudongfeng
for columns; set hive.cbo.enable=true; Then try the join with 21 tables. Regards, Nemon From: Sanka, Himabindu [mailto:himabindu_sa...@optum.com] Sent: Thursday, March 24, 2016 9:50 AM To: user@hive.apache.org Subject: Issue joining 21 HUGE Hive tables Hi Team, I need some inputs from you. I have

Re: Issue joining 21 HUGE Hive tables

2016-03-24 Thread Mich Talebzadeh
Posting a typical query that you are using will help to clarify the issue. Also you may use TEMPORARY TABLEs to keep the intermediate stage results. On the face of it you can time every query itself to find out the longest components etc select from_unixtime(unix_timestamp(), 'dd/MM/

Re: Issue joining 21 HUGE Hive tables

2016-03-24 Thread Jörn Franke
Joining so many external tables is always an issue with any component. Your problem is not Hive specific; but your data model seems to be messed up. First of all you should have them in an appropriate format, such as ORC or parquet and the tables should not be external. Then you should use the

RE: Issue joining 21 HUGE Hive tables

2016-03-23 Thread Ryan Harris
failing, you may just need more resources. From: Sanka, Himabindu [mailto:himabindu_sa...@optum.com] Sent: Wednesday, March 23, 2016 7:50 PM To: user@hive.apache.org Subject: Issue joining 21 HUGE Hive tables Hi Team, I need some inputs from you. I have a requirement for my project where I have

Issue joining 21 HUGE Hive tables

2016-03-23 Thread Sanka, Himabindu
Hi Team, I need some inputs from you. I have a requirement for my project where I have to join 21 hive external tables. Out of which 6 tables are HUGE having 500 million records of data. Other 15 tables are smaller ones around 100 to 1000 records each. When I am doing inner joins/ left outer