Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Andrey Stepachev
Thanks. I found. I looked in upstream branch instead of stumbleupon branch in my local repo. 2010/9/30 Alexey Kovyrin : > http://github.com/stumbleupon/hbase/blob/master/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java#L1418 >

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Andrey Stepachev
ohh... sorry. I find it. Thanks for patch. 2010/9/30 Andrey Stepachev : > Against what version this patch.  I can't find "Only 1 KV, does" in > any stumbleupon or > upstream repositories > > > 2010/9/29 Jean-Daniel Cryans : >> The fix is here: http://pastebin.com/zuL23e0U >> >> We're going to do a

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Alexey Kovyrin
http://github.com/stumbleupon/hbase/blob/master/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java#L1418 On Thu, Sep 30, 2010 at 1:17 AM, Andrey Stepachev wrote: > Against what version this patch.  I can't find "Only 1 KV, does" in > any stumbleupon or > upstream repositories > > > 2

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Andrey Stepachev
Against what version this patch. I can't find "Only 1 KV, does" in any stumbleupon or upstream repositories 2010/9/29 Jean-Daniel Cryans : > The fix is here: http://pastebin.com/zuL23e0U > > We're going to do a push to github later today, along with other > patches that require more testing. > >

New HBase YCSB changes - improves speed drastically

2010-09-29 Thread Ryan Rawson
Hi, There is a new commit to YCSB: http://github.com/brianfrankcooper/YCSB http://github.com/brianfrankcooper/YCSB/commit/6676f080d4c624eeb0ae56b548c8786742be3be3 This fixes performance problems in the HBase DB adapter. In my own tests I found that my short scans, which were configured to read

Re: HBase Eclipse compilation problem

2010-09-29 Thread Ryan Rawson
HBase is a maven project, please use m2eclipse: http://m2eclipse.sonatype.org/ -ryan 2010/9/29 Petrucci Andreas : > > Hi there, i'm trying to compile HBase in Eclipse so as to modify it. However, > i can't compile it due to error appearing. I followed the instructions from > the HBase wiki si

Re: Upgrading 0.20.6 -> 0.89

2010-09-29 Thread Dmitriy Lyubimov
Ok, thank you, Ryan. I certainly can't claim a deep intrinsic knowledge on GC, just read the Sun's GC guide which doesn't seem to draw that clear distinction between full gc and tenured gen GC, and since then acted on their examples given for figuring pause times -- which seem to refer to tenured g

Re: Upgrading 0.20.6 -> 0.89

2010-09-29 Thread Ryan Rawson
FullGC is not the CMS cycle... During a FullGC the entire main heap is rewritten and compacted. This can happen due to fragmentation issues, and can happen if the CMS cycle does not finish releasing enough memory before the minor GC needs to promote (aka: concurrent promotion failure, or something

Re: region doesn't split after 32+ GB

2010-09-29 Thread Andrew Purtell
Matt, Since you are using ZooKeeper already, conceivably you could keep a hosts file in ZooKeeper somewhere, use a strategy for updates similar to what is done for implementing locking to insure a new slave gets and updates the latest version "atomically", and use Twitcher to trigger updates on

Re: region doesn't split after 32+ GB

2010-09-29 Thread Matt Corgan
Thanks for your help again Stack... sorry i don't have the logs. Will do a better job of saving them. By the way, this time the insert job maintained about 22k rows/sec all night without any pauses, and even though it was sequential insertion, it did a nice job of rotating the active region aroun

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Jean-Daniel Cryans
The fix is here: http://pastebin.com/zuL23e0U We're going to do a push to github later today, along with other patches that require more testing. J-D On Wed, Sep 29, 2010 at 10:54 AM, Andrey Stepachev wrote: > wow. i'll wait. thanks for reply. > > 2010/9/29 Jean-Daniel Cryans : >> Ok I found th

HBase Eclipse compilation problem

2010-09-29 Thread Petrucci Andreas
Hi there, i'm trying to compile HBase in Eclipse so as to modify it. However, i can't compile it due to error appearing. I followed the instructions from the HBase wiki site (SVN repository) but i couldn't manage it. There are about 1000 errors for example : DescriptionResourcePath

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Andrey Stepachev
wow. i'll wait. thanks for reply. 2010/9/29 Jean-Daniel Cryans : > Ok I found the bug, I think it's only in our distro. > > Stay tuned! > > J-D > > On Wed, Sep 29, 2010 at 9:26 AM, Jean-Daniel Cryans > wrote: >> Weird indeed, even after the WAL was rolled 4 times (theoretically >> 256MB of data)

Re: Upgrading 0.20.6 -> 0.89

2010-09-29 Thread Dmitriy Lyubimov
> Full GCs do happen. We have it at 40 seconds here. Jean-Daniel, Is it total with concurrent CMS? 40 seconds is a plausible number for full CMS, even more plausible for i-CMS, so i assume that's what you are quoting here. But CMS doesn't pause jvm for that much time. Most of that time is spent

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Jean-Daniel Cryans
Ok I found the bug, I think it's only in our distro. Stay tuned! J-D On Wed, Sep 29, 2010 at 9:26 AM, Jean-Daniel Cryans wrote: > Weird indeed, even after the WAL was rolled 4 times (theoretically > 256MB of data) I don't even see a flush request... although you're > running at INFO level inste

Re: region doesn't split after 32+ GB

2010-09-29 Thread Stack
On Wed, Sep 29, 2010 at 9:22 AM, Matt Corgan wrote: > Everything is working fine now. > > My best guess is that when we upgraded from 0.20.6 to 0.89.20100726 there > was a change in hostname resolution (either by hbase, hdfs, or us). Resolution is done differently in 0.89. RS checks into master.

Re: Upgrading 0.20.6 -> 0.89

2010-09-29 Thread Jean-Daniel Cryans
I'd say it mostly depends on your tolerance to regions being unavailable while the recovery happens. You have to account for the ZK timeout (60 secs by default), plus the time to split (I don't have any good metric for that, usually it's kinda fast but you should try it with your data), plus the ti

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Jean-Daniel Cryans
Weird indeed, even after the WAL was rolled 4 times (theoretically 256MB of data) I don't even see a flush request... although you're running at INFO level instead of DEBUG. Could you switch that and send us just the full log. Thanks a lot! J-D On Wed, Sep 29, 2010 at 4:25 AM, Andrey Stepachev

Re: region doesn't split after 32+ GB

2010-09-29 Thread Matt Corgan
Everything is working fine now. My best guess is that when we upgraded from 0.20.6 to 0.89.20100726 there was a change in hostname resolution (either by hbase, hdfs, or us). In 0.20.6, our regionservers looked each other up via IP address, but after the upgrade it switched to hostname, and some o

Re: How to make HBase redundant?

2010-09-29 Thread Jean-Daniel Cryans
|On Wed, Sep 29, 2010 at 3:49 AM, Matthew LeMieux wrote: > The problem I referred to was also being addressed in a separate thread, > thanks to the contributors to the mailing list and mostly to J-D and Stack. > > I have recently upgraded to the 0.89.20100924 version  and after more than 24 > ho

Re: Upgrading 0.20.6 -> 0.89

2010-09-29 Thread Daniel Einspanjer
Question regarding configuration and tuning... Our current configuration/schema has fairly low hlog rollover sizes to keep the possibility of data loss to a minimum. When we upgrade to .89 with append support, I imagine we'll be able to safely set this to a much larger size. Are there any r

Re: Can't insert data into hbase, OOME.

2010-09-29 Thread Andrey Stepachev
Data is simple table with two column families. info: json object (small) rows: ~300 bigdecimal columns per row.

Can't insert data into hbase, OOME.

2010-09-29 Thread Andrey Stepachev
Hi all, I'm stuck. I can't insert any valuable peace of data into hbase. Data is something around ~20mil rows (20G). I try to insert them into nondistributed hbase with 4 parallel jobs. MR job run until all memory given to hbase is exhaused and then hbase produces hprof file. As profiler shows, a

Re: How to make HBase redundant?

2010-09-29 Thread Matthew LeMieux
The problem I referred to was also being addressed in a separate thread, thanks to the contributors to the mailing list and mostly to J-D and Stack. I have recently upgraded to the 0.89.20100924 version and after more than 24 hours am very happy with the results. I think I must have missed t