Help IN CHD

2011-08-11 Thread Vikas Srivastava
Hey All,


Please tell me where to enter datanode IP's in CHD3U2 , actally i installed
all the components in namenode and datanode but confuse where to put
datanode IPS in namenode so thet they get connected.


-- 
With Regards
Vikas Srivastava

DWH  Analytics Team
Mob:+91 9560885900
One97 | Let's get talking !


Re: Help IN CHD

2011-08-11 Thread Harsh J
Vikas,

This question belongs to Hadoop's lists. I'm moving it to
hdfs-u...@hadoop.apache.org.

To answer your question:

DN hostnames must exist in the dfs.hosts pointed file if you want
selective inclusion. Else you just have to start the DN with the right
config and network access to the NN, and it will automatically join
the cluster.

On Thu, Aug 11, 2011 at 6:26 PM, Vikas Srivastava
vikas.srivast...@one97.net wrote:
 Hey All,


 Please tell me where to enter datanode IP's in CHD3U2 , actally i installed
 all the components in namenode and datanode but confuse where to put
 datanode IPS in namenode so thet they get connected.


 --
 With Regards
 Vikas Srivastava

 DWH  Analytics Team
 Mob:+91 9560885900
 One97 | Let's get talking !




-- 
Harsh J


Reducer Issue in New Setup

2011-08-11 Thread hadoop n00b
Hello,

We have just setup Hive on a new Hadoop cluster.

When I run a select * on a table, it works fine but when I run any query
which needs a reducer, like count(1) or a where condition, the query just
sits there doing nothing (map 0%). I see some message like no reducers to
run. How do I fix this? The configuration et al is  exactly like another
environment that is working fine.

We are running CDH2 with Hadoop 0.20.1 and Hive 0.4.1

Cheers!


RE: Reducer Issue in New Setup

2011-08-11 Thread Travis Powell
Have you checked your logs? These are often the best places to start.

Look at the running job and click on the running count, the current
task, then the task logs.

 

Sometimes they're helpful, sometimes they're not.

 

http://hadoop-master:50030/jobtracker.jsp

 

Travis Powell / tpow...@tealeaf.com 

 

From: hadoop n00b [mailto:new2h...@gmail.com] 
Sent: Thursday, August 11, 2011 9:32 AM
To: user@hive.apache.org
Subject: Reducer Issue in New Setup

 

Hello,

 

We have just setup Hive on a new Hadoop cluster. 

 

When I run a select * on a table, it works fine but when I run any query
which needs a reducer, like count(1) or a where condition, the query
just sits there doing nothing (map 0%). I see some message like no
reducers to run. How do I fix this? The configuration et al is  exactly
like another environment that is working fine.

 

We are running CDH2 with Hadoop 0.20.1 and Hive 0.4.1

 

Cheers!



Re: Reducer Issue in New Setup

2011-08-11 Thread Loren Siebert
Can you run normal MR jobs, like the example Pi calculation? Sometimes a 
no-reducer problem stems from DNS issues— reducers use node names, not IP 
addresses, so you need to have each machine knows how to resolve the names of 
all the other machines in the cluster.

If it's a new cluster, you may want to look at CDH3, as it is a pretty big leap 
forward.

On Aug 11, 2011, at 9:35 AM, Travis Powell wrote:

 Have you checked your logs? These are often the best places to start.
 Look at the running job and click on the running count, the current task, 
 then the task logs.
  
 Sometimes they’re helpful, sometimes they’re not.
  
 http://hadoop-master:50030/jobtracker.jsp
  
 Travis Powell / tpow...@tealeaf.com
  
 From: hadoop n00b [mailto:new2h...@gmail.com] 
 Sent: Thursday, August 11, 2011 9:32 AM
 To: user@hive.apache.org
 Subject: Reducer Issue in New Setup
  
 Hello,
  
 We have just setup Hive on a new Hadoop cluster.
  
 When I run a select * on a table, it works fine but when I run any query 
 which needs a reducer, like count(1) or a where condition, the query just 
 sits there doing nothing (map 0%). I see some message like no reducers to 
 run. How do I fix this? The configuration et al is  exactly like another 
 environment that is working fine.
  
 We are running CDH2 with Hadoop 0.20.1 and Hive 0.4.1
  
 Cheers!



RE: Reducer Issue in New Setup

2011-08-11 Thread Aggarwal, Vaibhav
Are you using a custom scheduler?
I have seen issues with jobs having 0 mappers and 1 reducer with Fair scheduler.

From: hadoop n00b [mailto:new2h...@gmail.com]
Sent: Thursday, August 11, 2011 9:32 AM
To: user@hive.apache.org
Subject: Reducer Issue in New Setup

Hello,

We have just setup Hive on a new Hadoop cluster.

When I run a select * on a table, it works fine but when I run any query which 
needs a reducer, like count(1) or a where condition, the query just sits there 
doing nothing (map 0%). I see some message like no reducers to run. How do I 
fix this? The configuration et al is  exactly like another environment that is 
working fine.

We are running CDH2 with Hadoop 0.20.1 and Hive 0.4.1

Cheers!


Running Hive from Eclipse

2011-08-11 Thread john smith
Hi folks,

I am trying to run Hive from eclipse. I've set it up correctly and it is
building the jars and stuff. However I face execeptions when I try to run
hive queries like show tables etc. There  has been a discussion on this in
the mailing list previously but there was no solution provided. It runs
perfectly from command line .

I am making a few changes to the hive source and every time I need to jar it
from the command line and run it .Is there some way to run it directly from
eclipse?

Please help,

Thanks,
JS


Re: Running Hive from Eclipse

2011-08-11 Thread Carl Steinbach
Hi John,

Can you please include the error messages/exceptions that you're
encountering?

Thanks.

Carl

On Thu, Aug 11, 2011 at 1:40 PM, john smith js1987.sm...@gmail.com wrote:

 Hi folks,

 I am trying to run Hive from eclipse. I've set it up correctly and it is
 building the jars and stuff. However I face execeptions when I try to run
 hive queries like show tables etc. There  has been a discussion on this
 in
 the mailing list previously but there was no solution provided. It runs
 perfectly from command line .

 I am making a few changes to the hive source and every time I need to jar
 it
 from the command line and run it .Is there some way to run it directly from
 eclipse?

 Please help,

 Thanks,
 JS



multiple tables join with only one hug table.

2011-08-11 Thread Daniel,Wu
if the retailer fact table is sale_fact with 10B rows, and join with 3 small 
tables: stores (10K), products(10K), period (1K). What's the best join solution?

In oracle, it can first build hash for stores, and hash for products, and hash 
for stores. Then probe using the fact table, if the row matched in stores, that 
row can go up further to map with products by hashing check, if pass, then go 
up further to try to match period. In this way, the sale_fact only needs to be 
scanned once which save lots of disk IO.  Is this doable in hive, if doable, 
what hint need to use?


Re: Running Hive from Eclipse

2011-08-11 Thread john smith
Hi Carl,

This is the stack trace I get .. http://pastebin.com/3pASqvDq

I configured mysql as my metastore and its perfectly getting updated when
ever I am adding tables via commandline.

Also one more thing is ..I am not getting any log statements while using
command line . I haven't messed up with log4j props but I wonder why this is
happening.

THanks

On Fri, Aug 12, 2011 at 2:12 AM, Carl Steinbach c...@cloudera.com wrote:

 Hi John,

 Can you please include the error messages/exceptions that you're
 encountering?

 Thanks.

 Carl


 On Thu, Aug 11, 2011 at 1:40 PM, john smith js1987.sm...@gmail.comwrote:

 Hi folks,

 I am trying to run Hive from eclipse. I've set it up correctly and it is
 building the jars and stuff. However I face execeptions when I try to run
 hive queries like show tables etc. There  has been a discussion on this
 in
 the mailing list previously but there was no solution provided. It runs
 perfectly from command line .

 I am making a few changes to the hive source and every time I need to jar
 it
 from the command line and run it .Is there some way to run it directly
 from
 eclipse?

 Please help,

 Thanks,
 JS





Re: Running Hive from Eclipse

2011-08-11 Thread john smith
Hi,

See in the line that log4j props is not in found .. I added Hive_conf dir to
the classpath while running and now I get this trace ..

http://pastebin.com/vXs98aZ5

I am completely clueless !

Thanks
JS



On Fri, Aug 12, 2011 at 9:54 AM, john smith js1987.sm...@gmail.com wrote:

 Hi Carl,

 This is the stack trace I get .. http://pastebin.com/3pASqvDq

 I configured mysql as my metastore and its perfectly getting updated when
 ever I am adding tables via commandline.

 Also one more thing is ..I am not getting any log statements while using
 command line . I haven't messed up with log4j props but I wonder why this is
 happening.

 THanks


 On Fri, Aug 12, 2011 at 2:12 AM, Carl Steinbach c...@cloudera.com wrote:

 Hi John,

 Can you please include the error messages/exceptions that you're
 encountering?

 Thanks.

 Carl


 On Thu, Aug 11, 2011 at 1:40 PM, john smith js1987.sm...@gmail.comwrote:

 Hi folks,

 I am trying to run Hive from eclipse. I've set it up correctly and it is
 building the jars and stuff. However I face execeptions when I try to run
 hive queries like show tables etc. There  has been a discussion on this
 in
 the mailing list previously but there was no solution provided. It runs
 perfectly from command line .

 I am making a few changes to the hive source and every time I need to jar
 it
 from the command line and run it .Is there some way to run it directly
 from
 eclipse?

 Please help,

 Thanks,
 JS