Re: crafting your key - scan vs. get

2012-10-19 Thread Neil Yalowitz
warrant a look, however, are the filters on the scan. We are already filtering on CF but we can now look at filtering on qualifiers as well. Thanks again, Neil Yalowitz neilyalow...@gmail.com On Thu, Oct 18, 2012 at 4:59 PM, Ian Varley wrote: > Hi Neil, > > Mike summed it up well, as usual

Re: crafting your key - scan vs. get

2012-10-17 Thread Neil Yalowitz
, but I don't trust my crude testing much and hoped someone could describe the performance trade-off between Scan vs. Get. Thanks again for anyone who read this far. Neil Yalowitz neilyalow...@gmail.com On Wed, Oct 17, 2012 at 10:45 AM, Michael Segel wrote: > Neil, > > > Sin

crafting your key - scan vs. get

2012-10-16 Thread Neil Yalowitz
ill greatly increase StoreFile size." ...found here: http://hbase.apache.org/book/schema.versions.html So, are there any performance considerations between Scan vs. Get in this use case? Which choice would you go for? Neil Yalowitz neilyalow...@gmail.com

Re: replication - how do I know the status?

2012-09-13 Thread Neil Yalowitz
inserting a lot of data. > > J-D > > On Thu, Sep 13, 2012 at 1:18 PM, Neil Yalowitz > wrote: > > Hi all, > > > > I'm using HBase replication between two clusters running CDH3u3 and I > > recently noticed that a replicated column family was "lagging&q

replication - how do I know the status?

2012-09-13 Thread Neil Yalowitz
ould I look for to tell me that there is a problem with replication? Neil Yalowitz neilyalow...@gmail.com

length and size of a column family name or qualifier vs. amount of disk storage

2012-02-14 Thread Neil Yalowitz
ifier:-- mycolfam1:myqual1 We are assuming the longer cf/qual would be written to HDFS billions of times and would be wasteful. Is that a correct assumption? Does the answer change if you use Snappy compression? Thanks, Neil Yalowitz neilyalow...@gmail.com

Re: the occasion of the major compact?

2012-01-26 Thread Neil Yalowitz
all files being processed into one ( hbase.hstore.compaction.max ) Can someone confirm if this is correct, or if there are other factors to causing a major compaction that I am missing? Neil Yalowitz On Thu, Jan 26, 2012 at 10:06 AM, yonghu wrote: > Hello Mikael, > > I think disabling

Re: RegionServer dying every two or three days

2012-01-24 Thread Neil Yalowitz
ildly different performance (off by 400%). Occasionally, an instance will become unresponsive to pings and SSH and will completely fall out of the cluster. It seems the strategy for EC2 deployment is to expect everything to fail and plan accordingly. It hasn't been a good experience. Neil Y

Re: RegionServer dying every two or three days

2012-01-19 Thread Neil Yalowitz
We have experienced many problems with our cluster on EC2. The blunt solution was to increase the Zookeeper timeout to 5 minutes or even more. Even with a long timeout, however, it's not uncommon for us to see an EC2 instance to become unresponsive to pings and SSH several times during a week. I

multi-threaded HTablePool, incrementColumnValue, compaction and large data set

2012-01-15 Thread Neil Yalowitz
nces. I can provide code examples if anyone is curious. Neil Yalowitz

Re: heavy writing and compaction storms

2012-01-12 Thread Neil Yalowitz
GB? Also, how does this change with a table with more than one column family? As I understand it, each column family has a memstore. Thanks for your responses so far. Neil Yalowitz On Thu, Jan 12, 2012 at 1:12 PM, Jean-Daniel Cryans wrote: > Hi, > > First you should consider using bul

upgrade 0.90 to 0.92 - HFile v2

2012-01-12 Thread Neil Yalowitz
Is anyone familiar with the upgrade path from HBase 0.90 to 0.92 or greater? Specifically, is there a way to upgrade the existing HFiles to v2, or should this not be attempted?

bulk import and counting increments

2012-01-12 Thread Neil Yalowitz
Hi all, When performing a bulk import into HBase, what methods are available to increment a counter? To describe the problem: a large dataset comes in, and the most efficient way to get that data into an HBase table is to bulk load, as described here: http://hbase.apache.org/bulk-loads.html The

heavy writing and compaction storms

2012-01-12 Thread Neil Yalowitz
lower writes (#1)? Something completely different? Thanks, Neil Yalowitz