Hi,
I have a question on Hive Optimizer. I have a table with partition columns
eg.,Sales partitioned by year, month, day. Assume that I have two years
worth of data on this table. I'm running two queries on this table.
Query 1: Select * from Sales where year=2015 and month = 5 and day between
1
it filtering or scanning all table.
Of course, it is a lazy approach. but, you can give a try.
I think query 1 should work fine. because I am using a lot of that
kind of queries and it works fine for me.
Thanks,
mungeol
On Fri, May 15, 2015 at 8:31 AM, Appan Thirumaligai
appanhiv...@gmail.com wrote
create a new table and copy the data from your source table by doing
lower(col1), drop the old table and rename the new table to old table.
On Fri, Mar 16, 2012 at 3:56 AM, Richard codemon...@163.com wrote:
if I wang to update a table, e.g,
insert overwrite table mytable
select lower(col1),
I'm running a bunch of hive queries (20+ and adding more) through Oozie and
they are all in one file :-( Is there a way I can tell oozie to run them
all simultaneously without writing a separate oozie action for each one of
them.
Thanks!
Andy,
Here is a short term work around - If your goal is to backup data in SQL Server
just copy the data out of hadoop and import it into SQL (DTS / BCP/SSIS based
on which version you are in). Email Viral (I guess he is still active in the
email list) and I'm sure he is using Hive, Hadoop
BY A.some_string, B.some_string
Does HIVE not like to join on the partitioned columns ? because when i
create a join on just the partitioned column the reduce step never finishes.
I am using HIVE 0.5.0
Thanks,
Viral
Appan Thirumaligai
ap...@ngmoco.com
Ph:1-818-472-8427
ngmoco:)