Re: Salting based on partial rowkeys

2018-09-16 Thread Gerald Sangudi
Jaanai, Thomas, Thanks for the feedback. I or my colleague will reply in this thread in the dev list. Gerald On Thu, Sep 13, 2018 at 10:01 PM, Thomas D'Silva wrote: > For the usage example that you provided when you write data how does the > values of id_1, id_2 and other_key vary? > I

Re: Salting based on partial rowkeys

2018-09-14 Thread Sergey Soldatov
Thomas is absolutely right that there will be a possibility of hotspotting. Salting is the mechanism that should prevent that in all cases (because all rowids are different). The partitioning described above actually can be implemented by using id2 as a first column of the PK and using presplit by

Re: Salting based on partial rowkeys

2018-09-14 Thread Josh Elser
Yeah, I think that's his point :) For a fine-grained facet, the hotspotting is desirable to co-locate the data for query. To try to make an example to drive this point home: Consider a primary key constraint(col1, col2, col3, col4); If I defined the SALT_HASH based on "col1" alone, you'd get

Re: Salting based on partial rowkeys

2018-09-13 Thread Thomas D'Silva
For the usage example that you provided when you write data how does the values of id_1, id_2 and other_key vary? I assume id_1 and id_2 remain the same while other_key is monotonically increasing, and thats why the table is salted. If you create the salt bucket only on id_2 then wouldn't you run

Re: Salting based on partial rowkeys

2018-09-13 Thread Jaanai Zhang
Sorry, I don't understander your purpose. According to your proposal, it seems that can't achieve. You need a hash partition, However, Some things need to clarify that HBase is a range partition engine and the salt buckets were used to avoid hotspot, in other words, HBase as a storage engine

Re: Salting based on partial rowkeys

2018-09-13 Thread Gerald Sangudi
Hi folks, Any thoughts or feedback on this? Thanks, Gerald On Mon, Sep 10, 2018 at 1:56 PM, Gerald Sangudi wrote: > Hello folks, > > We have a requirement for salting based on partial, rather than full, > rowkeys. My colleague Mike Polcari has identified the requirement and > proposed an

Salting based on partial rowkeys

2018-09-10 Thread Gerald Sangudi
Hello folks, We have a requirement for salting based on partial, rather than full, rowkeys. My colleague Mike Polcari has identified the requirement and proposed an approach. I found an already-open JIRA ticket for the same issue: https://issues.apache.org/jira/browse/PHOENIX-4757. I can provide