Re: Running RowCounter as Standalone

2009-02-12 Thread Jean-Daniel Cryans
Philipp, For HBase-related questions, please post to hbase-user@hadoop.apache.org Try importing commons-cli-2.0-SNAPSHOT.jar as well as any other jar in the lib folder just to be sure you won't get any other missing class def error. J-D On Thu, Feb 12, 2009 at 6:32 PM, Philipp Dobrigkeit wrote:

Re: Backup again

2009-02-12 Thread Billy Pearson
because we append edits to the edit logs by default of 300 edits per append. It would not be wise to copy the files with out shutting down the cluster. multi reasons not to do this thank memcache flushes/compactions/updates all working on the data files and directories at the same time. Bu

Hadoop User Group Meeting (Bay Area) 2/18

2009-02-12 Thread Ajay Anand
The next Bay Area Hadoop User Group meeting is scheduled for Wednesday, February 18th at Yahoo! 2811 Mission College Blvd, Santa Clara, Building 2, Training Rooms 5 & 6 from 6:00-7:30 pm. Agenda: Fair Scheduler for Hadoop - Matei Zaharia Interfacing with MySQL - Aaron Kimball Registration

RE: usecase: tagged key/values

2009-02-12 Thread Jonathan Gray
Bill, So out of the box, with a straight-forward schema, you can store the data in the way you want and have the efficient query of "get all key/vals for this identifier". In order to also implement "get all identifiers which have key X = value Y" queries, you'd need to store your data in the inv

Re: usecase: tagged key/values

2009-02-12 Thread Bill de hOra
Jonathan Gray wrote: Bill, It's hard to say whether hbase is a good fit without knowing a bit more. HBase is very well suited for storing data in the format you describe. If your primary problem is scaling the persistence of this dataset, it can certainly do that. You can have any number of a

Re: Backup again

2009-02-12 Thread Michael Dagaev
Erik, thank you for the answer. I am afraid my question not clear. I agree that we can backup database just by scanning it and writing the data somewhere while the database is running. However, if we want to backup the database _by copying the underlying HDFS file system_ we have

Re: Backup again

2009-02-12 Thread Erik Holstad
Hi Michael! I don't think that it is true that you can only backup system that are down, the backup tools that are posted in HBASE-974 can be used on system that are up and running. What that will do for you is to scan the tables and serialize those to disk and the other way around when importing.

Backup again

2009-02-12 Thread Michael Dagaev
Hi, all As I understand from the discussions on the list, we can backup the file system only if Hbase is down. If we want to run backup when Hbase is up we can work on the Hbase level, i.e. use scan, MR export/import utils, etc. but we cannot backup on the file system level. Is it correct?