Paul Rogers created DRILL-5825:
----------------------------------

             Summary: HBase storage plugin assumes local system encoding for 
column names
                 Key: DRILL-5825
                 URL: https://issues.apache.org/jira/browse/DRILL-5825
             Project: Apache Drill
          Issue Type: Bug
    Affects Versions: 1.11.0
            Reporter: Paul Rogers


The HBase storage plugin has the ability to expand wildcard columns at plan 
time as part of projection push-down. HBase column names are encoded in binary. 
Drill decodes the names using the default character set of the machine running 
Drill. While this works fine for ASCII names, there is no reason to believe it 
will work for non-ASCII names if the machine running Drill is different than 
the machine on which the column name was defined.

Offending code:

{code}
public class DrillHBaseTable extends DrillTable implements DrillHBaseConstants {
   ...
      fieldNameList.add(Bytes.toString(family));
{code}

>From {{String}} Javadoc:

bq. String(byte[] bytes):  Constructs a new String by decoding the specified 
array of bytes using the *platform's* default charset.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to