Re: Query regarding pre-split Major compaction

2016-09-11 Thread Manjeet Singh
Hi I am attaching screenshot [image: Inline image 2] can anyone help me to figure out I can see that my first region was empty as their was no start rowkey same with end row key second my data actually disturbed on only 2 nodes i have 5 nodes Thanks Manjeet On Mon, Sep 12, 2016 at 10:38 AM,

Re: Query regarding pre-split Major compaction

2016-09-11 Thread Manjeet Singh
Thanks Ted for your inputs I have write some algorithm to convert my some String to single char like # $ ! etc and its my salt so based on these I know whats my salt as my input data was so random and I need to know in advance what is my rowkey (Hash like Md5 generates long string , which coz some

Re: Query regarding pre-split Major compaction

2016-09-09 Thread Manjeet Singh
Thanks Ted for links links will help to determine how region split what should be the size etc which will really helpful but can you correct me if I am not wrong does my understanding was correct as I asked in trailing mail? I know what will be the salt based on my Mobile number coming in my data

Re: Query regarding pre-split Major compaction

2016-09-09 Thread Ted Yu
Please take a look at: http://hbase.apache.org/book.html#table_schema_rules_of_thumb http://hbase.apache.org/book.html#arch.regions.size http://hbase.apache.org/book.html#ops.capacity.regions http://hbase.apache.org/book.html#ops.capacity.regions.total On Fri, Sep 9, 2016 at 5:35 PM, Manjeet

Re: Query regarding pre-split Major compaction

2016-09-09 Thread Manjeet Singh
Yeah its in weekdays Yeah default is 10 gb so what is the way/forumla to knw what shuld be the size of RS On 9 Sep 2016 19:03, "Ted Yu" wrote: > Can you clarify whether the incoming data rate is for weekdays ? > > At 6-7 Gb /Hour, you need to set larger region size. >

Query regarding pre-split Major compaction

2016-09-09 Thread Manjeet Singh
Hi All I have some basic question can anyone help me out Q1. this is my understanding To perform splitting I need to create table like below create 'test_table','c1', SPLITS=>['#", '!', '$''] and I have to design row key in this way #_123456789 !_123456789 $_123456789 so my data distributed