Re: hive mapjoin decision process

2011-07-19 Thread Koert Kuipers
thanks. changing mapred.child.java.opts from -Xmx512m to -Xmx1024m did the trick allocating more memory to the On Tue, Jul 19, 2011 at 6:49 PM, yongqiang he wrote: > >> i thought only one table needed to be small? > Yes. > > >> hive.mapjoin.maxsize also apply to big table? > No. > > >> i made s

Re: hive mapjoin decision process

2011-07-19 Thread yongqiang he
>> i thought only one table needed to be small? Yes. >> hive.mapjoin.maxsize also apply to big table? No. >> i made sure hive.mapjoin.smalltable.filesize and hive.mapjoin.maxsize are >> set large enough to accomodate the small table. yet hive does not attempt to >> do a mapjoin. There are phys

Re: hive mapjoin decision process

2011-07-19 Thread Koert Kuipers
thanks! i only see hive create the hashmap dump and perform mapjoin if both tables are small. i thought only one table needed to be small? i try to merge a very large table with a small table. i made sure hive.mapjoin.smalltable.filesize and hive.mapjoin.maxsize are set large enough to accomodate

Re: hive mapjoin decision process

2011-07-19 Thread yongqiang he
in most cases, the mapjoin falls back to normal join because of one of these three reasons: 1) the input table size is very big, so there will be no try on mapjoin 2) if one of the input table is small (let's say less than 25MB which is configurable), hive will try a local hashmap dump. If it cause

remote hive metastore

2011-07-19 Thread Koert Kuipers
i am testing running a remote hive metastore. i understand that the client communicates with the metastore via thrift. now is it the case that the client still communicates with HDFS directly? in the metastore i see logs for all the actions that i perform on the client. but they show up like this:

hive mapjoin decision process

2011-07-19 Thread Koert Kuipers
note: this is somewhat a repost of something i posted on the CDH3 user group. apologies if that is not appropriate. i am exploring map-joins in hive. with hive.auto.convert.join=true hive tries to do a map-join and then falls back on a mapreduce-join if certain conditions are not met. this sounds

Re: Problem in Hadoop(0.20.2) with hive

2011-07-19 Thread Viral Bajaria
Vikas, I don't think it's the ping from name-node that is an issue here, you should run a ping command from data-node to all data-nodes/name-node. Thanks, Viral On Tue, Jul 19, 2011 at 6:50 AM, Edward Capriolo wrote: > > > On Tue, Jul 19, 2011 at 9:46 AM, Vikas Srivastava < > vikas.srivast...@o

Re: Problem in Hadoop(0.20.2) with hive

2011-07-19 Thread Edward Capriolo
On Tue, Jul 19, 2011 at 9:46 AM, Vikas Srivastava < vikas.srivast...@one97.net> wrote: > Hey Edward, > > thanks for responding but i try to ping all the *data-node* from * > name-node* and they all are responding.. > > i won't be able to figure it out where the problem persist. > > query is runnin

Re: Problem in Hadoop(0.20.2) with hive

2011-07-19 Thread Vikas Srivastava
Hey Edward, thanks for responding but i try to ping all the *data-node* from *name-node* and they all are responding.. i won't be able to figure it out where the problem persist. query is running fine when i dont use any map reduce.. but while using and map tasks...its get stuck into that.. Reg

Re: Problem in Hadoop(0.20.2) with hive

2011-07-19 Thread Edward Capriolo
It must be a hostname or DNS problem. Use dig and ping to find out what is wrong. On Tue, Jul 19, 2011 at 9:05 AM, Vikas Srivastava < vikas.srivast...@one97.net> wrote: > > > On Tue, Jul 19, 2011 at 6:29 PM, Vikas Srivastava < > vikas.srivast...@one97.net> wrote: > >> >> HI Team, >>> >>> >>> we a

Re: Problem in Hadoop(0.20.2) with hive

2011-07-19 Thread Vikas Srivastava
On Tue, Jul 19, 2011 at 6:29 PM, Vikas Srivastava < vikas.srivast...@one97.net> wrote: > > HI Team, >> >> >> we are using 1 namenode with 11 Datanode each of (16GB ram and 1.4 tb hdd) >> >> i m getting this error while running any query , simple its not working >> when we use any map tasks. >> >>

RE: ODBC/JDBC Bridge

2011-07-19 Thread Guy Doulberg
Update... I eventually succeed in making the ODBC/JDBC bridge from Easysoft to work. So I made simple reports on top of hive So far so good From: Guy Doulberg [mailto:guy.doulb...@conduit.com] Sent: Monday, July 18, 2011 2:31 PM To: user@hive.apache.org Subject: RE: ODBC/JDBC Bridge I