Re: Unit Test: HBase Map/Reduce

2010-04-21 Thread Karthik K
On Mon, Apr 19, 2010 at 10:55 AM, Renaud Delbru wrote: Also, do someone have some advices/examples on how to write and run unit > tests involving hbase ? > Specifically about this - I found HBaseTestingUtility very useful, available in trunk (backported to 0.20.5+ as well, I guess). For PE and

Re: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Andrey
Thank you, guys. Your comments are quite helpful for me. Andrey

Re: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Andrey
> Jean-Daniel Cryans writes: > > > That can be done in a couple of seconds using the normal HBase client > in a multithreaded process, fed by a message queue if you feel like > it. What were you trying to achieve using MR? > > J-D Yes, that's a point as well, to use or not to use the MapReduce.

RE: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Andrew Purtell
My advice is to use a scanner with explicit start/end key (best) or filters (still good), not temporary tables (not so good). HBase 0.21 will have MultiGet, so that would be another option then. - Andy > From: Geoff Hendrey [...] > As I understand it, you have a table, and you need to do >

RE: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Geoff Hendrey
el_se...@hotmail.com] Sent: Tuesday, April 20, 2010 9:37 AM To: hbase-user@hadoop.apache.org Subject: RE: Get operation in HBase Map-Reduce methods Going back to the OP's question... using get() within a M/R, the answer is yes. However you have a problem in that you need to have to somehow deter

Re: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Richard Weber
ng a mapper that takes a list in memory as its input source. > > Now here's the point where someone from Cloudera, Yahoo! or somewhere else > says that even that piece is trivial and here's how to do it. :-) > > -Mike > > >> > Date: Tue, 20 Apr 201

RE: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Michael Segel
source. Now here's the point where someone from Cloudera, Yahoo! or somewhere else says that even that piece is trivial and here's how to do it. :-) -Mike > Date: Tue, 20 Apr 2010 10:15:52 +0200 > Subject: Re: Get operation in HBase Map-Reduce methods > From: jdcry...@apache.or

Re: Unit Test: HBase Map/Reduce

2010-04-20 Thread Renaud Delbru
Hi Andrey, thanks a lot, your classes are quite useful. However, my problem was not related to the HBaseClusterTestCase, but it was a problem in my code. Thanks -- Renaud Delbru On 19/04/10 19:39, Andrey S wrote: 2010/4/19 Renaud Delbru Hi, I am trying to create a unit test using the

Re: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Jean-Daniel Cryans
That can be done in a couple of seconds using the normal HBase client in a multithreaded process, fed by a message queue if you feel like it. What were you trying to achieve using MR? J-D On Tue, Apr 20, 2010 at 12:54 PM, Andrey wrote: > Yes, about 1k rows currently. In the future it may happen

Re: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Andrey
Yes, about 1k rows currently. In the future it may happen to be more: some tens of thousands. Andrey

Re: Get operation in HBase Map-Reduce methods

2010-04-20 Thread Jean-Daniel Cryans
What are the numbers like? Is it 1k rows you need to process? 1M? 10B? Your question is more about scaling (or the need to). J-D On Tue, Apr 20, 2010 at 8:39 AM, Andrey wrote: > Dear All, > > Assumed, I've got a list of rowIDs of a HBase table. I want to get each row by > its rowID, do some oper

Get operation in HBase Map-Reduce methods

2010-04-19 Thread Andrey
Dear All, Assumed, I've got a list of rowIDs of a HBase table. I want to get each row by its rowID, do some operations with its values, and store the results somewhere subsequently. Is there a good way to do this in a Map-Reduce manner? As far as I understand, a mapper usually takes a Scan to for

Re: Unit Test: HBase Map/Reduce

2010-04-19 Thread Andrey S
2010/4/19 Renaud Delbru > Hi, > > I am trying to create a unit test using the HBaseClusterTestCase and the > RowCounter example. > I am able to spin up a hbase table, load data inside, access the data > (lookup and scan), but whenever I am trying to launch a map/reduce job > (TableMapper), the ma

Unit Test: HBase Map/Reduce

2010-04-19 Thread Renaud Delbru
Hi, I am trying to create a unit test using the HBaseClusterTestCase and the RowCounter example. I am able to spin up a hbase table, load data inside, access the data (lookup and scan), but whenever I am trying to launch a map/reduce job (TableMapper), the map/reduce functions are never execut

hbase map reduce tutorial ... seeking feedback

2010-04-10 Thread Sujee Maniyam
Hi All, I have a tutorial on Hbase MapReduce here : http://sujee.net/tech/articles/hbase-map-reduce-freq-counter/ It is rated PG-13 (i.e. for beginners). Uses v0.20+ mapreduce APIs. I'd appreciate any comments & feedback from this group. thanks Sujee http://sujee.net

Re: Looking for simple hbase map/reduce example

2009-09-28 Thread tigertail
gt; >>> what I have read the comparison portion would occur during the map >>> step, >>> >>> and >>> >>> the reduce step would then aggregate the totals to come up with a >>> >>> percent >>> >>> matched for eac

Re: Looking for simple hbase map/reduce example

2009-09-28 Thread terryg
t;>> >> >>> What i'm trying to find are some simple map/reduce hbase examples for >> >>> 0.20 >> >>> API. Can anyone point me in the direction of some? >> >> >> >> >> >> See content of org.apache.ha

Re: Looking for simple hbase map/reduce example

2009-09-26 Thread stack
t > >>> matched for each field. > >>> > >>> What i'm trying to find are some simple map/reduce hbase examples for > >>> 0.20 > >>> API. Can anyone point me in the direction of some? > >> > >> > >> See conte

Re: Looking for simple hbase map/reduce example

2009-09-26 Thread terryg
ield. >>> >>> What i'm trying to find are some simple map/reduce hbase examples for >>> 0.20 >>> API. Can anyone point me in the direction of some? >> >> >> See content of org.apache.hadoop.hbase.mapreduce package. Checkout in >> part

Re: Looking for simple hbase map/reduce example

2009-09-25 Thread stack
Or better, http://kdpeterson.net/blog/2009/09/minimal-hbase-mapreduce-example.html St.Ack On Fri, Sep 25, 2009 at 2:32 PM, stack wrote: > On Fri, Sep 25, 2009 at 2:09 PM, terryg wrote: > >> >> I'm working on a map/reduce java program for performing a comparison of >> columns in a single record a

Re: Looking for simple hbase map/reduce example

2009-09-25 Thread stack
On Fri, Sep 25, 2009 at 2:09 PM, terryg wrote: > > I'm working on a map/reduce java program for performing a comparison of > columns in a single record and then reporting the percent matched. From > what I have read the comparison portion would occur during the map step, > and > the reduce step

Looking for simple hbase map/reduce example

2009-09-25 Thread terryg
rcent matched for each field. What i'm trying to find are some simple map/reduce hbase examples for 0.20 API. Can anyone point me in the direction of some? -- View this message in context: http://www.nabble.com/Looking-for-simple-hbase-map-reduce-example-tp25619336p25619336.html Sen

Re: Hbase Map-reduce Scheduler

2009-08-26 Thread Billy Pearson
I thank you are looking for this jira for reduce assignment https://issues.apache.org/jira/browse/HBASE-1199 Regions are assigned in the map to be ran on the server hosting the region but if there is an idle server it will pull the next map task. Billy "bharath vissapragada" wrote in mess

Hbase Map-reduce Scheduler

2009-08-20 Thread bharath vissapragada
Hi all , Can any one tell me where i can access some docs which gives a good explanation about how a map-reduce scheduler in Hbase i.e., How Map regions are created (to minimize data flow through network) and how the reduce phases are performed so that we can minimize the flow of keys and values a

HBase Map/Reduce

2009-06-01 Thread llpind
w.nabble.com/file/p23823017/HBaseDBUploader.java HBaseDBUploader.java http://www.nabble.com/file/p23823017/DBInputFormat.java DBInputFormat.java -- View this message in context: http://www.nabble.com/HBase-Map-Reduce-tp23823017p23823017.html Sent from the HBase User mailing list archive at Nabble.com.

Re: HBase Map Reduce

2008-08-23 Thread Jean-Daniel Cryans
Rafael, Like it says in the documentation, the RowCounter is a very good example. Else have a look at: http://wiki.apache.org/hadoop/Hbase/MapReduce J-D On Sat, Aug 23, 2008 at 10:35 AM, Rafael Turk <[EMAIL PROTECTED]> wrote: > Hello, > > I've been trying to implement a

HBase Map Reduce

2008-08-23 Thread Rafael Turk
Hello, I've been trying to implement an HBase map & reduce job. Going through the API documentation at http://hadoop.apache.org/hbase/docs/r0.2.0/api/index.html I've found an nice example, but its outdated as its refers to HBase 0.1 branch Can anyone provide an updated ver

RE: Adding version number and timestamps to Hbase map reduce

2008-08-13 Thread Jonathan Gray
To: hbase-user@hadoop.apache.org Subject: Re: Adding version number and timestamps to Hbase map reduce (Sent this morning from the wrong email address, sorry if this gets in the moderators way ...) I need to be able to retrieve timestamps through the thrift interface - after sending the query

Re: Adding version number and timestamps to Hbase map reduce

2008-08-13 Thread Toby White
(Sent this morning from the wrong email address, sorry if this gets in the moderators way ...) I need to be able to retrieve timestamps through the thrift interface - after sending the query below this morning, I got my hands dirty & looked into it properly. I've now implemented two thrift

RE: Adding version number and timestamps to Hbase map reduce

2008-08-13 Thread Toby White
From the mailing list last month: In HBase-0.1.x, you cannot get the timestamps. In trunk (0.2.0) you will. --- Jim Kellerman, Senior Engineer; Powerset -Original Message- From: Yair Even-Zohar [mailto:[EMAIL PROTECTED] ] Sent: Tuesday, July 15, 2008 8:05 AM I got several questions:

RE: Adding version number and timestamps to Hbase map reduce

2008-07-15 Thread Jim Kellerman
Subject: Adding version number and timestamps to Hbase map reduce > > I got several questions: > > > > 1) Can we retrieve the timestamp for a given version for a data > point in a column? I see in the documentation that > HTable.get() may return array of versions but I

Adding version number and timestamps to Hbase map reduce

2008-07-15 Thread Yair Even-Zohar
I got several questions: 1) Can we retrieve the timestamp for a given version for a data point in a column? I see in the documentation that HTable.get() may return array of versions but I don't see how to get the extract the timestamp for each version. 2) Did anyone code a mapReduc