Re: Using capacity scheduler

2011-05-02 Thread Sreekanth Ramakrishnan
The queue specific configurations are not hive client specific, they have to be configured on JobTracker before JT is started up. All the Hive Cli should try setting is which queue they will want the DAG from hive query to be submitted to. So your capacity-scheduler.xml in $HADOOP_CONF_DIR sho

Export data with column names

2011-05-02 Thread Raghunath, Ranjith
Does anyone know how to export data out with column names? Any help here is appreciated. Thanks, Ranjith

HIVE Server multiple instances

2011-05-02 Thread V.Senthil Kumar
Hello, I have one instance of HIVE JDBC server running on port 1. Can I run another instance on different port ? Would it cause a concurrency issue on the underlying data warehouse files ? Please clarify. Thanks, V.Senthil Kumar

Re: Using capacity scheduler

2011-05-02 Thread Rosanna Man
Hi Sreekanth, When you mention about setting the max task limit, do you mean by executing set mapred.capacity-scheduler.queue..maximum-capacity = ? Is it only available on hadoop 0.21? Thanks, Rosanna On 5/1/11 8:42 PM, "Sreekanth Ramakrishnan" wrote: > > The design goal of CapacitySchedul

Re: Cross join in Hive.

2011-05-02 Thread Raghunath, Ranjith
Very cool. What is the non strict option for? Thanks, Ranjith From: Ashish Thusoo To: Sent: Mon May 02 16:00:39 2011 Subject: Re: Cross join in Hive. you could probably just say (1 = 1) in the on clause for the join. set hive.mapred.mode=nonstrict; select ...

Re: Cross join in Hive.

2011-05-02 Thread Ashish Thusoo
you could probably just say (1 = 1) in the on clause for the join. set hive.mapred.mode=nonstrict; select ... from T1 join T2 on (1 = 1); Ashish On May 1, 2011, at 10:27 PM, Raghunath, Ranjith wrote: Forgot to mentionthe condition for the inner join should be the column set to 1 in the fir

Re: setting job name when issuing a JDBC Hive query

2011-05-02 Thread Edward Capriolo
On Mon, May 2, 2011 at 4:29 PM, Zeus Courtois wrote: > Hi, > I am able to issue Hive queries through the Hive JDBC but I cannot find a > way to set config parameters nor change the job name. > Is there a way to pass/modify the HiveConf when issuing a JDBC statement? > I have tried creating my own

setting job name when issuing a JDBC Hive query

2011-05-02 Thread Zeus Courtois
Hi, I am able to issue Hive queries through the Hive JDBC but I cannot find a way to set config parameters nor change the job name. Is there a way to pass/modify the HiveConf when issuing a JDBC statement? I have tried creating my own Session but had not much success by using: org.apache.hadoop

Can remnant partitions cause Hive to slow down?

2011-05-02 Thread Mayuran Yogarajah
We've noticed that our Hive jobs appear to be getting slower and slower every day even though the data set isn't really growing by much. Here are some run times taken from last month which shows the date and the duration of the job in minutes: 2010/12/31 -> 19.21667 2011/01/31 -> 24.55