Re: how to do parallel scanning in map reduce using hbase as input?

2014-07-24 Thread Li Li
On Wed, Jul 23, 2014 at 6:31 AM, Stack st...@duboce.net wrote: On Mon, Jul 21, 2014 at 11:11 PM, Li Li fancye...@gmail.com wrote: On Tue, Jul 22, 2014 at 1:57 PM, Stack st...@duboce.net wrote: On Mon, Jul 21, 2014 at 10:53 PM, Li Li fancye...@gmail.com wrote: Sorry, I enter tab

Re: how to do parallel scanning in map reduce using hbase as input?

2014-07-24 Thread Li Li
information on the subject. -Vladimir Rodionov On Tue, Jul 22, 2014 at 3:31 PM, Stack st...@duboce.net wrote: On Mon, Jul 21, 2014 at 11:11 PM, Li Li fancye...@gmail.com wrote: On Tue, Jul 22, 2014 at 1:57 PM, Stack st...@duboce.net wrote: On Mon, Jul 21, 2014 at 10:53 PM, Li Li fancye

Re: how to do parallel scanning in map reduce using hbase as input?

2014-07-22 Thread Li Li
On Tue, Jul 22, 2014 at 1:54 PM, Stack st...@duboce.net wrote: On Mon, Jul 21, 2014 at 10:47 PM, Li Li fancye...@gmail.com wrote: sorry. I have not finished it. 1. yes, I have 20 concurrent running mappers. 2. I can't add more mappers because I set io.sort.mb to 500mb and if I set 8 mappers

Re: how to do parallel scanning in map reduce using hbase as input?

2014-07-22 Thread Li Li
On Tue, Jul 22, 2014 at 1:57 PM, Stack st...@duboce.net wrote: On Mon, Jul 21, 2014 at 10:53 PM, Li Li fancye...@gmail.com wrote: Sorry, I enter tab and it send my unfinished post. See the following mail for answers of other questions. I forget the exception's detail. It throws exception

Fwd: how to do parallel scanning in map reduce using hbase as input?

2014-07-21 Thread Li Li
anyone could help? now I have about 1.1 billion nodes and it takes 2 hours to finish a map reduce job. -- Forwarded message -- From: Li Li fancye...@gmail.com Date: Thu, Jun 26, 2014 at 3:34 PM Subject: how to do parallel scanning in map reduce using hbase as input? To: u

Re: how to do parallel scanning in map reduce using hbase as input?

2014-07-21 Thread Li Li
:30 PM, Li Li fancye...@gmail.com wrote: anyone could help? now I have about 1.1 billion nodes and it takes 2 hours to finish a map reduce job. -- Forwarded message -- From: Li Li fancye...@gmail.com Date: Thu, Jun 26, 2014 at 3:34 PM Subject: how to do parallel scanning

Re: how to do parallel scanning in map reduce using hbase as input?

2014-07-21 Thread Li Li
432066k mphbase3,60020,1405730850546 46 46 53592m 53610mb 42858k 110748k mphbase4,60020,1405730850585 43 101 109790m 109827mb 83236k 295068k mphbase5,60020,1405730850652 41 81 89073m 89099mb 66622k 243354k 9. url_db has 84 regions On Tue, Jul 22, 2014 at 1:32 PM, Li Li fancye...@gmail.com wrote: 1

Re: how to do parallel scanning in map reduce using hbase as input?

2014-07-21 Thread Li Li
/regionserver has 16GB memory but free memory for map-reduce is about 5gb. So I can't add more mappers On Tue, Jul 22, 2014 at 1:37 PM, Stack st...@duboce.net wrote: On Mon, Jul 21, 2014 at 10:32 PM, Li Li fancye...@gmail.com wrote: 1. yes, I have 20 concurrent running mappers. 2. I can't add

Re: anyone interested in openhft?

2014-04-16 Thread Li Li
is ConcurrentSkipListMap as far as I remember. -Vladimir Rodionov On Tue, Apr 15, 2014 at 10:24 PM, Nick Dimiduk ndimi...@gmail.com wrote: Yes, now that we have memstore abstracted, this is something worth experimenting with. On Tuesday, April 15, 2014, Li Li fancye...@gmail.com wrote

anyone interested in openhft?

2014-04-15 Thread Li Li
http://www.infoq.com/articles/Open-JDK-and-HashMap-Off-Heap http://openhft.blogspot.com/2014/03/javautilconcurrentconcurrenthashmap-vs.html I found this offheap solution and remeber hbase faces gc problem with large heap