Re: How spark writes to HBASE

2018-01-22 Thread vignesh
to the region (via staging directory). > > Please see: > http://hbase.apache.org/book.html#arch.bulk.load > > On Mon, Jan 22, 2018 at 8:52 AM, vignesh wrote: > > > If it is a bulk load I use spark hbase connector provided by hortonworks. > > For time series writes

RE: How spark writes to HBASE

2018-01-22 Thread vignesh
on't provide the affinity > described above. > > On Mon, Jan 22, 2018 at 7:19 AM, vignesh wrote: > > > Hi, > > > > I have a Spark job which reads some timeseries data and pushes that to > > HBASE using HBASE client API. I am executing this Spark job on

Re: How spark writes to HBASE

2018-01-22 Thread vignesh
#x27;t think that is the case. Multiple writes may be performed which would end up on different region servers. Spark won't provide the affinity described above. On Mon, Jan 22, 2018 at 7:19 AM, vignesh wrote: > Hi, > > I have a Spark job which reads some timeseries data and pushes

How spark writes to HBASE

2018-01-22 Thread vignesh
Hi, I have a Spark job which reads some timeseries data and pushes that to HBASE using HBASE client API. I am executing this Spark job on a 10 node cluster. Say at first when spark kicks off it picks machine1,machine2,machine3 as its executors. Now when the job inserts a row to HBASE. Below is wha