subject:"is Hbase Scan really need thorough Get \(Hbase\+solr\+spark\)"

Re: is Hbase Scan really need thorough Get (Hbase+solr+spark)

2016-01-19 Thread ayan guha

It is not scanning the HBase. What it is doing is looping through your list of Row keys and fetching data for each 1 at a time. Ex: Your solr result has 5 records, with Row Keys R1...R5. Then list will be [R1,R2,...R5] Then table.get(list) will do something like: res=[] for k in list: v =

Re: is Hbase Scan really need thorough Get (Hbase+solr+spark)

2016-01-19 Thread Ted Yu

get(List gets) will call: Object [] r1 = batch((List)gets); where batch() would do: AsyncRequestFuture ars = multiAp.submitAll(pool, tableName, actions, null, results); ars.waitUntilDone(); multiAp is an AsyncProcess. In short, client would access region server for the results.

is Hbase Scan really need thorough Get (Hbase+solr+spark)

2016-01-19 Thread beeshma r

Hi I trying to integrated Hbase-solr-spark. Solr is indexing all the documents from Hbase through hbase-indexer . Through the Spark I am manipulating all datasets .Thing is after getting the solrdocuments from the solr query ,it has the rowkey and rowvalues .So directly i got the rowkeys and

Re: is Hbase Scan really need thorough Get (Hbase+solr+spark)

2016-01-19 Thread beeshma r

Thanks Ted, :) if everything gets indexed from Hbase into solr ,then no need to trace Regionservers once again Thanks Beesh On Wed, Jan 20, 2016 at 5:05 AM, Ted Yu wrote: > get(List gets) will call: > > Object [] r1 = batch((List)gets); > > where batch() would