Yubao Liu created HBASE-25357:
---------------------------------

             Summary: allow specifying binary row key range to pre-split regions
                 Key: HBASE-25357
                 URL: https://issues.apache.org/jira/browse/HBASE-25357
             Project: HBase
          Issue Type: Improvement
          Components: spark
            Reporter: Yubao Liu


Currently, spark hbase connector use `String` to specify regionStart and 
regionEnd, but we often have serialized binary row key,  I made a little patch 
at [https://github.com/apache/hbase-connectors/pull/72/files] to always treat 
the `String` in ISO_8859_1, so we can put raw bytes into the String object and 
get it unchanged.

This has a drawback,  if your row key is really UTF-8 strings, you should 
convert it to UTF-8 encoded bytes and then encapsulate it in ISO_8859_1 string. 
This is a limitation of Spark option interface which allows only string to 
string map.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to