Do you know how much data is being brought back (i.e. 100 megabytes)? I am
wondering what the data rate is in MB/s. Do you know how many files per
tablet you have? Do most of the 10,000 ids you are querying for exist?
On Tue, May 12, 2015 at 1:58 PM, vaibhav thapliyal
How many tablets do you have? The batch scanner does not parallelize
operations within a tablet.
If you give the batch scanner more threads than there are tservers, it will
make multilple parallel rpc calls to each tserver if the tserver has
multiple tablets. Each rpc may include multiple
Hi,
I am using BatchScanner to scan rows from a accumulo table. The table has
around 187m entries and I am using a 3 node cluster which has accumulo
1.6.1.
I have passed 1 ids which are stored as row id in my table as a list in
the setRanges() method.
This whole process takes around 50
On the monitor page, you should see how many threads are running in
each tserver, if I remember correctly. There are also graphs to show
response rates.
On Tue, May 12, 2015 at 2:39 PM, vaibhav thapliyal
vaibhav.thapliyal...@gmail.com wrote:
I also tried to increase threads to a bigger number
I also tried to increase threads to a bigger number about 500, but yes I
will try using batchscanner with 194 threads too. I will get back with the
info that Keith has asked in some time.
Thanks
Vaibhav
On 13-May-2015 12:04 am, David Medinets david.medin...@gmail.com wrote:
Try using 194
Try using 194 threads if your hardware can support them. The worst
that'll happen is the client program crashes during testing. If that
happens, cut the number of threads in half. And so on.
On Tue, May 12, 2015 at 1:58 PM, vaibhav thapliyal
vaibhav.thapliyal...@gmail.com wrote:
I have 194
I have 194 tablets. Currently I am using 20 threads to create the
batchscanner inside the createBatchScanner method.
On 12-May-2015 11:19 pm, Keith Turner ke...@deenlo.com wrote:
How many tablets do you have? The batch scanner does not parallelize
operations within a tablet.
If you give the