Re: Distributed table processing is slower that local table processing

2012-03-31 Thread Alexander Goryunov
Hi Anil, Yes, I'm sure I'm running cluster in distributed mode (I see 21 parallel map tasks in job tracker and processes on each node). max map tasks set to 7 for each node. I run my job with the same cluster configuration on two tables: 1. Table located only on 1 node (I see it on HBase master

Re: Distributed table processing is slower that local table processing

2012-03-30 Thread Alexander Goryunov
Hi Anil, Yes, the second table is distributed, the first is not and I have 3х better results for nondistrubuted table. I use distributed hadoop mode for all cases. Thanks. On Fri, Mar 30, 2012 at 3:26 AM, anil gupta anilg...@buffalo.edu wrote: Hi Alexander, Is data properly distributed

Re: Distributed table processing is slower that local table processing

2012-03-30 Thread anil gupta
Hi Alexander, If you can provide more details of the stuff you are doing then it would be helpful. Are you sure that your cluster is running in distributed mode? Did you ran the job with 1 node in cluster and then added 2 additional node to the same cluster? Thanks, Anil 2012/3/30 Alexander

Distributed table processing is slower that local table processing

2012-03-29 Thread Alexander Goryunov
Hello, I'm running 3 data node cluster (8core Xeon, 16G) + 1 node for jobtracker and namenode with Hadoop and HBase and have strange performance results. The same map job runs with speed about 300 000 records per second for 1 node table and 100 000 records per second for table distributed to 3

Re: Distributed table processing is slower that local table processing

2012-03-29 Thread anil gupta
Hi Alexander, Is data properly distributed over the cluster in Distributed Mode? If the data is not then you wont get good results in distributed mode. Thanks, Anil Gupta On Thu, Mar 29, 2012 at 8:37 AM, Alexander Goryunov a.goryu...@gmail.comwrote: Hello, I'm running 3 data node cluster