RE: performance on getRow and get

2008-07-15 Thread Jim Kellerman
get(row, column) is more efficient than get(row) because get(row) must access multiple HStores and do multiple reads while get(row, column) only accesses one HStore. --- Jim Kellerman, Senior Engineer; Powerset > -Original Message- > From: Daniel [mailto:[EMAIL PROTECTED] > Sent: Tuesd

RE: Adding version number and timestamps to Hbase map reduce

2008-07-15 Thread Jim Kellerman
In HBase-0.1.x, you cannot get the timestamps. In trunk (0.2.0) you will. --- Jim Kellerman, Senior Engineer; Powerset > -Original Message- > From: Yair Even-Zohar [mailto:[EMAIL PROTECTED] > Sent: Tuesday, July 15, 2008 8:05 AM > To: hbase-user@hadoop.apache.org > Subject: Adding versio

Adding version number and timestamps to Hbase map reduce

2008-07-15 Thread Yair Even-Zohar
I got several questions: 1) Can we retrieve the timestamp for a given version for a data point in a column? I see in the documentation that HTable.get() may return array of versions but I don't see how to get the extract the timestamp for each version. 2) Did anyone code a mapReduc

Re: performance on getRow and get

2008-07-15 Thread Sébastien Rainville
Hi Daniel, Yes get(row) is more expensive than get(row, column name). Keep in mind that HBase is column oriented. So when you fetch data from multiple columns it means that it will need to access multiple files (1 per column family) in order to get the data for the whole row. Sebastien On Tue

performance on getRow and get

2008-07-15 Thread Daniel
hi all, i'm writting a program to access my hbase table in a MR job. my first version is to get different values from get(row,column name), and now im changing to get one row each time into a map, and query that map instead - for one reduce job. i think it would be better to access hbase only

Re: Problem Accessing Hbase

2008-07-15 Thread Jean-Daniel Cryans
Srikanth, Interesting, the META table is never scanned at all! I think that happened to me once we I started messing around with HBase. Since the meta table is missing, I guess you don't have any data in HBase so I suggest shutting down HBase, then Hadoop, format HDFS, restart Hadoop and then rest

Re: table design questions

2008-07-15 Thread ZhaoWei
Thanks JD, I'm using 0.1.3 currently, will upgrade to 0.2 soon. I tried scanner in 0.1.3, seems not very efficient on performance? Reading all serialized objects from just one column is much faster than reading from a whole column family with scanner. From: "Jean-Daniel Cryans" <[EMAIL PROTECTED]>

Re: Problem Accessing Hbase

2008-07-15 Thread srikanth . bondalapati
Hi, I am using HBase version 0.1.2 and Hadoop version 0.16.4. These are the log file contents of the HBase after starting the HBase and giving a query from the HBase Shell: - 1) HBase master 2008-07-15 09:29:01,298 INFO org.apache.hadoop.hbase.HMaster: Root region dir: file:/home/srikanth/