It is not scanning the HBase. What it is doing is looping through your list
of Row keys and fetching data for each 1 at a time.
Ex: Your solr result has 5 records, with Row Keys R1...R5.
Then list will be [R1,R2,...R5]
Then table.get(list) will do something like:
res=[]
for k in list:
v =
get(List gets) will call:
Object [] r1 = batch((List)gets);
where batch() would do:
AsyncRequestFuture ars = multiAp.submitAll(pool, tableName, actions,
null, results);
ars.waitUntilDone();
multiAp is an AsyncProcess.
In short, client would access region server for the results.
Hi
I trying to integrated Hbase-solr-spark.
Solr is indexing all the documents from Hbase through hbase-indexer .
Through the Spark I am manipulating all datasets .Thing is after getting
the solrdocuments from the solr query ,it has the rowkey and rowvalues .So
directly i got the rowkeys and
Thanks Ted, :)
if everything gets indexed from Hbase into solr ,then no need to trace
Regionservers once again
Thanks
Beesh
On Wed, Jan 20, 2016 at 5:05 AM, Ted Yu wrote:
> get(List gets) will call:
>
> Object [] r1 = batch((List)gets);
>
> where batch() would