Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread Keith Turner
Do you know how much data is being brought back (i.e. 100 megabytes)? I am wondering what the data rate is in MB/s. Do you know how many files per tablet you have? Do most of the 10,000 ids you are querying for exist? On Tue, May 12, 2015 at 1:58 PM, vaibhav thapliyal

Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread Keith Turner
How many tablets do you have? The batch scanner does not parallelize operations within a tablet. If you give the batch scanner more threads than there are tservers, it will make multilple parallel rpc calls to each tserver if the tserver has multiple tablets. Each rpc may include multiple

BatchScanner taking too much time to scan rows

2015-05-12 Thread vaibhav thapliyal
Hi, I am using BatchScanner to scan rows from a accumulo table. The table has around 187m entries and I am using a 3 node cluster which has accumulo 1.6.1. I have passed 1 ids which are stored as row id in my table as a list in the setRanges() method. This whole process takes around 50

Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread David Medinets
On the monitor page, you should see how many threads are running in each tserver, if I remember correctly. There are also graphs to show response rates. On Tue, May 12, 2015 at 2:39 PM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com wrote: I also tried to increase threads to a bigger number

Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread vaibhav thapliyal
I also tried to increase threads to a bigger number about 500, but yes I will try using batchscanner with 194 threads too. I will get back with the info that Keith has asked in some time. Thanks Vaibhav On 13-May-2015 12:04 am, David Medinets david.medin...@gmail.com wrote: Try using 194

Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread David Medinets
Try using 194 threads if your hardware can support them. The worst that'll happen is the client program crashes during testing. If that happens, cut the number of threads in half. And so on. On Tue, May 12, 2015 at 1:58 PM, vaibhav thapliyal vaibhav.thapliyal...@gmail.com wrote: I have 194

Re: BatchScanner taking too much time to scan rows

2015-05-12 Thread vaibhav thapliyal
I have 194 tablets. Currently I am using 20 threads to create the batchscanner inside the createBatchScanner method. On 12-May-2015 11:19 pm, Keith Turner ke...@deenlo.com wrote: How many tablets do you have? The batch scanner does not parallelize operations within a tablet. If you give the