Re: Issue with loading the Snappy Codec

2012-04-15 Thread john smith
Can you restart tasktrackers once and run the job again? It refreshes the class path. On Sun, Apr 15, 2012 at 11:58 AM, Bas Hickendorff hickendorff...@gmail.comwrote: Thanks. The native snappy libraries I have installed. However, I use the normal jars that you get when downloading Hadoop, I

Reduce shuffle bytes in GUI

2011-09-27 Thread john smith
Hey folks, I have my job tracker GUI which shows a lot of information about the running/completed jobs. I am interested in the field Reduce shuffle bytes. I want to know how it is computed... Is it just the sum of all the bytes received per reducer during shuffle ? Any help? Thanks

Re: Reducer hanging ( swapping? )

2011-09-22 Thread john smith
: 2GB for a task tracker? Here are some possible thoughts. Compress map output. Change mapred.reduce.slowstart.completed.maps By the way I see no swapping. Anything interesting from the task tracker log? System log? Raj From: john smith js1987.sm

Reducer hanging ( swapping? )

2011-09-21 Thread john smith
Hi Folks, I am running hive on a 10 node cluster. Since my hive queries have joins in them, their reduce phases are a bit heavy. I have 2GB RAM on each TT . The problem is that my reducer hangs at 76% for a large amount of time. I guess this is due to excessive swapping from disk to memory. My

Out of heap space errors on TTs

2011-09-19 Thread john smith
Hey guys, I am running hive and I am trying to join two tables (2.2GB and 136MB) on a cluster of 9 nodes (replication = 3) Hadoop version - 0.20.2 Each data node memory - 2GB HADOOP_HEAPSIZE - 1000MB other heap settings are defaults. My hive launches 40 Maptasks and every task failed with the

Re: Out of heap space errors on TTs

2011-09-19 Thread john smith
by default it will be 200mb. But your io.sort.mb(300) is more than that. So, configure more heap space for child tasks. ex: -Xmx512m Regards, Uma - Original Message - From: john smith js1987.sm...@gmail.com Date: Monday, September 19, 2011 6:14 pm Subject: Out of heap space errors

Re: Out of heap space errors on TTs

2011-09-19 Thread john smith
...@gmail.com wrote: John Can you share the hive QL you are using for joins? Regards Bejoy K S -Original Message- From: john smith js1987.sm...@gmail.com Date: Mon, 19 Sep 2011 19:02:02 To: common-user@hadoop.apache.org Reply-To: common-user@hadoop.apache.org Subject: Re: Out

Ganglia for hadoop monitoring

2011-09-17 Thread john smith
Hi all, First of all, ganglia integration with hadoop is an awesome feature. Kudos to the hadoop devs. Unfortunately its not working out for me. I am unable to see hadoop specific metrics in my ganglia frontend. My configurations are as follows: gmetad.coinf : data_source hadoop test host-name

Re: Datanodes going down frequently

2011-09-16 Thread john smith
or not. --Aaron -Original Message- From: john smith [mailto:js1987.sm...@gmail.com] Sent: Thursday, September 15, 2011 9:46 PM To: common-user@hadoop.apache.org Subject: Re: Datanodes going down frequently Hi All, Thanks for your inputs, @Aaron : No, they aren't recovering

Datanodes going down frequently

2011-09-15 Thread john smith
Hi all, I am running a 10 node cluster (1NN + 9DN, ubuntu server 10.04, 2GB RAM each). I am facing a strange problem. My datanodes go down randomly and nothing showup in the logs. They lose their network connectivity suddenly and NN declares them as dead. Any one faced this problem? Is it because

Re: Datanodes going down frequently

2011-09-15 Thread john smith
of blocks its gotta report (which would equate to a small files issue given your cluster size possibly, but that's a different discussion). On Fri, Sep 16, 2011 at 3:36 AM, john smith js1987.sm...@gmail.com wrote: Hi all, I am running a 10 node cluster (1NN + 9DN, ubuntu server 10.04, 2GB RAM

Re: Disable Sorting?

2011-09-11 Thread john smith
, but a use-case will help. IAC, the only way to achieve what you are trying to do is to run to jobs with the first a map-only job (i.e. #reduces = 0). Arun On Sep 10, 2011, at 10:19 PM, john smith wrote: Hey, I have reduce phases too. But for each reduce, I dont need sorted input (map

Disable Sorting?

2011-09-10 Thread john smith
Hi, Some of the MR jobs I run doesn't need sorting of map-output in each partition. Is there someway I can disable it? Any help? Thanks jS

Re: Disable Sorting?

2011-09-10 Thread john smith
a map-only job with #reduces set to 0. Arun On Sep 10, 2011, at 2:06 AM, john smith wrote: Hi, Some of the MR jobs I run doesn't need sorting of map-output in each partition. Is there someway I can disable it? Any help? Thanks jS

Map phase hanging for wordcount example

2011-09-06 Thread john smith
Hi Folks, I am working on a 3 node cluster (1 NN + 2 DNs) . I loaded some test data with replication factor 3 (around 400MB data). However when I run wordcount example , it hangs at map 0%. bin/hadoop jar hadoop-examples-0.20.3-SNAPSHOT.jar wordcount /test_data /out2 11/09/06 13:07:28 INFO

Re: Map phase hanging for wordcount example

2011-09-06 Thread john smith
check the tail of JobTracker logs after a submit is done to see if an error/warn there is causing this. And then dig further on why/what/how. Hard to tell what your problem specifically is without logs :) On Tue, Sep 6, 2011 at 1:18 PM, john smith js1987.sm...@gmail.com wrote: Hi Folks, I

Re: Map phase hanging for wordcount example

2011-09-06 Thread john smith
at 3:00 PM, john smith js1987.sm...@gmail.com wrote: Hi Harsh, My jt log : http://pastebin.com/rXAEeDkC I have some startup exceptions (which doesn't matter much I guess) but the tail indicates that its locating the splits correctly and then it hangs ! Any idea? Thanks On Tue, Sep 6

Re: Map phase hanging for wordcount example

2011-09-06 Thread john smith
That should do it, so long as the slave hosts can freely access the master hosts (no blockage of ports via firewall and such). On Tue, Sep 6, 2011 at 3:05 PM, john smith js1987.sm...@gmail.com wrote: Hey My TT logs show this , 2011-09-06 13:22:41,860 ERROR org.apache.hadoop.mapred.TaskTracker

Out of Memory Exception while building hadoop

2011-09-04 Thread john smith
Hey folks, Strangely I get a out of memory exception while building hadoop from source. I have 2gigs of ram and I've tried building it from both eclipse and commandline http://pastebin.com/9pcHg1P9 is the full stack trace. Can anyone help me out on this? Thanks, John Smith

getSplits() in TableInputFormatBase

2010-04-11 Thread john smith
Hi , In the method public org.apache.hadoop.mapred.InputSplit[] *getSplits* (org.apache.hadoop.mapred.JobConf job, int numSplits) how is the numSplits decided ? I've seen differnt values of numSplits for different MR jobs . Any reason for

Re: getSplits() in TableInputFormatBase

2010-04-11 Thread john smith
startKeys.length? startKeys.length: numSplits; Here startKeys.length is the number of regions... Am I true? Thanks j.S On Sun, Apr 11, 2010 at 1:33 PM, Amandeep Khurana ama...@gmail.com wrote: The number of splits is equal to the number of regions... On Sun, Apr 11, 2010 at 12:54 AM, john smith

Re: getSplits() in TableInputFormatBase

2010-04-11 Thread john smith
. If you keep it lower, then it combines regions in a single split. Amandeep Khurana Computer Science Graduate Student University of California, Santa Cruz On Sun, Apr 11, 2010 at 1:15 AM, john smith js1987.sm...@gmail.com wrote: Amandeep, I guess that is not true ,.. See the explanation

Re: getSplits() in TableInputFormatBase

2010-04-11 Thread john smith
, 2010 at 1:39 AM, john smith js1987.sm...@gmail.com wrote: Amandeep , Thanks for the explanation . What is the default value to the num of maps ? Is it not equal to the num of regions ? Right now I am running HBase in pseudo distributed mode . If I set num of map tasks to 10

Re: getSplits() in TableInputFormatBase

2010-04-11 Thread john smith
? are you counting root and meta also? Amandeep Khurana Computer Science Graduate Student University of California, Santa Cruz On Sun, Apr 11, 2010 at 1:57 AM, john smith js1987.sm...@gmail.com wrote: From the web interface... number of regions =5 number of tables = 3 Thanks

Region not getting served

2010-04-11 Thread john smith
Hi all, I wrote my own getSplits() function for HBase-MR . A is a table involved in MR . I am getting the following stack trace. It seems that it couldn't access the region. But my region server is up and running. Does it indicate that my splitting is wrong? http://pastebin.com/YBK4JQBu Thanks

Re: Region not getting served

2010-04-11 Thread john smith
your splitting isn't wrong, the region server is trying to parse the column family and there's something null where it shouldn't be. J-D On Sun, Apr 11, 2010 at 10:55 AM, john smith js1987.sm...@gmail.com wrote: Hi all, I wrote my own getSplits() function for HBase-MR . A is a table

Re: Region not getting served

2010-04-11 Thread john smith
at 6:07 PM, john smith js1987.sm...@gmail.com wrote: J.D. Thanks for replying. My hbase version is 0.19.3. Because I wrote many codes for this version, I haven't updated it. Also i'll check if therz any problem with my column family naming ..such as missing : etc and I'll let you know

Re: Region not getting served

2010-04-11 Thread john smith
and raising NullPointer exception? I am not sure though.. Any help is appreciated! Thanks.. On Sun, Apr 11, 2010 at 11:02 PM, john smith js1987.sm...@gmail.com wrote: J.D, I tried working with the 0.20+ branch of hadoop and Hbase. I changed my build paths in eclipse and I found out the following

Region assignment in Hbase

2010-03-29 Thread john smith
Hi all, I read the issue HBase-57 ( https://issues.apache.org/jira/browse/HBASE-57 ) . I don't really understand the use of assigning regions keeping DFS in mind. Can anyone give an example usecase showing its advantages . Can map-reduce exploit it's advantage in any way (if data is distributed

Re: Region assignment in Hbase

2010-03-29 Thread john smith
J-D thanks for your reply. I have some doubts which I posted inline . Kindly help me On Tue, Mar 30, 2010 at 2:23 AM, Jean-Daniel Cryans jdcry...@apache.orgwrote: Inline. J-D On Mon, Mar 29, 2010 at 11:45 AM, john smith js1987.sm...@gmail.com wrote: Hi all, I read the issue HBase-57

Table size from API

2010-01-27 Thread john smith
Hi guys, Is there a way to get the rowcount of the table from java API .. I looked at HTable and HTableDesc... but I couldn't find it .. It is similar to count tablename frm Hbase shell. Thanks J-S

Re: MR in HBase

2010-01-09 Thread john smith
delegate.createRecordReader(); else if (split for table2) return delegate.createRecordReader(); else throw exception } } --- end pseudo code --- Regards, Mridul john smith wrote: Mridul Can you be more clear .. I didn't get you ! On Fri, Jan 8, 2010 at 6:13 PM, Mridul

Re: MR in HBase

2010-01-08 Thread john smith
, 2010 at 7:51 PM, john smith js1987.sm...@gmail.com wrote: Hi all, My requirement is that , I must read two tables (belonging to the same region server) in the same Map . Normally TableMap supports only 1 table at a time and right now I am reading the entire 2nd table in any one

Re: MR in HBase

2010-01-08 Thread john smith
to both tables input formats ? Regards, Mridul john smith wrote: Stack, The requirement is that I need to I need to scan two tables A,B for an MR job ,Order is not important . That is , the reduce phase contains both keys from both A,B. Presently what iam doing is that I am using

MR in HBase

2010-01-07 Thread john smith
Hi all, My requirement is that , I must read two tables (belonging to the same region server) in the same Map . Normally TableMap supports only 1 table at a time and right now I am reading the entire 2nd table in any one of the maps , This is a big overhead . So can any one suggest some

Doubt in HBase

2009-08-20 Thread john smith
Hi all , I have one small doubt . Kindly answer it even if it sounds silly. Iam using Map Reduce in HBase in distributed mode . I have a table which spans across 5 region servers . I am using TableInputFormat to read the data from the tables in the map . When i run the program , by default how

Re: Doubt in HBase

2009-08-20 Thread john smith
as essentially a grid scheduler -- something like job.setNumReducers(0) will do the trick. Best regards, - Andy From: john smith js1987.sm...@gmail.com To: hbase-user@hadoop.apache.org Sent: Friday, August 21, 2009 12:42:36 AM Subject: Doubt

Re: Implementing TableMap interface

2009-07-24 Thread john smith
Yes i too have the same problem .. Can anyone tell me in detail how to add new classes to the existing hbase jar or do we have a different method to include our own classes in the program .. On 7/24/09, bharath vissapragada bharathvissapragada1...@gmail.com wrote: Thanks it worked fine .. Do i

Re: Implementing TableMap interface

2009-07-24 Thread john smith
2009/7/24 john smith js1987.sm...@gmail.com: Yes i too have the same problem .. Can anyone tell me in detail how to add new classes to the existing hbase jar or do we have a different method to include our own classes in the program .. On 7/24/09, bharath vissapragada bharathvissapragada1