Re: HBase/Hadoop backups in EC2

2008-03-20 Thread stack
Joost Ouwerkerk wrote: Ideally we'd have some kind of incremental replication mechanism. Thats a fine idea. Fill any notions you have of how this might work into HBASE-50. I imagine that this kind of question is among the least of your concerns at this stage in hbase's evolution, though. We

Re: HBase/Hadoop backups in EC2

2008-03-20 Thread Joost Ouwerkerk
We're actually hoping to use Hbase in an online production capacity, so I don't think we could shutdown regularly enough to protect the data. We could probably stand to lose about 15 minutes worth of data, but couldn't shut down more than once a day. Ideally we'd have some kind of incremen

Re: HBase/Hadoop backups in EC2

2008-03-20 Thread stack
Can you afford to shutdown hbase? Shutting down hbase will force it to dump whats in-memory out to the Filesystem. Once its down, run a distcp between your EC2 HDFS and S3? Would that work for you? Otherwise, there is no mechanism of taking a snapshot of hbase currently (HBASE-50 is about t

HBase/Hadoop backups in EC2

2008-03-20 Thread Joost Ouwerkerk
Does anyone have any experience with backing up data for an HBase cluster on Amazon EC2 instances? We could take snapshots of the filesystem on a regular basis and dump to S3, but I was wondering if anyone had any other strategies to recommend? EC2 instances can theoretically drop at anyt

RE: remote connection to hbase

2008-03-20 Thread Peeyush Bishnoi
Hi , Yes u can connect remotely to Hbase. For that 1. Hbase version on both machines. 2. Both machines should have similiar Hbase configuration . 3. HMaster should be running on node A for accepting connections from B. Just try this out and also check the Hadoop logs directory for Hbase related

Re: Filter omitting columns

2008-03-20 Thread Clint Morgan
Thats great, but it won't do what he wants, as he want only rows where the value for the status: column equals getBytes(1). Hence the filter. On Thu, Mar 20, 2008 at 9:30 AM, Jim Kellerman <[EMAIL PROTECTED]> wrote: > If you want to get all the family members, you can just specify > 'familyname:

RE: Filter omitting columns

2008-03-20 Thread Jim Kellerman
If you want to get all the family members, you can just specify 'familyname:' as the column. This creates a wild-card scanner which will do what you want without filters. --- Jim Kellerman, Senior Engineer; Powerset > -Original Message- > From: Clint Morgan [mailto:[EMAIL PROTECTED] > Se

Re: Filter omitting columns

2008-03-20 Thread Clint Morgan
I was having a similar problem as well. Though I've never used just the column families to specify the columns (eg always fully qualified col names like family:col) . Maybe you can try my patch and see if it fixes your problem. https://issues.apache.org/jira/browse/HBASE-527 Also you can give a n

remote connection to hbase

2008-03-20 Thread cure
Hi is there any posibility to connect to hbase from remote machine ? i have two machines : A and B, and i have setup on A hadoop 0.17 with newest hbase from svn. but i cant connect from B to A:6. (all firewalls are disabled) on netstat i see that connection to hbase is enable only f

Filter omitting columns

2008-03-20 Thread Goel, Ankur
Hi, I am trying to obtain a set of rows by obtaining a scanner on Htable. I also specify the RowFilterCriteria like this. /* Code Start */ Map columnFilter = new HashMap(); columnFilter.put(new Text("status:"), getBytes(1)); RowFilterInterface rowFilter = new RegExpRowFilter(".*", columnFilte