[ 
https://issues.apache.org/jira/browse/IMPALA-5323?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16998675#comment-16998675
 ] 

Adar Dembo commented on IMPALA-5323:
------------------------------------

Although it's true that Kudu _stores_ STRING and BINARY data in the same way 
and isn't opinionated on the contents, the clients behave somewhat differently: 
STRING data is expected to be encoded in UTF-8, or at least be UTF-8 compatible 
(a la US ASCII). Today there's no validation (in that I don't think Java's 
{{new String(byte[], UTF_8)}} does anything useful), and although it's tempting 
to say that we reserve the right to add such validation in the future, it's 
likely that doing so would break backwards compatibility for too many users who 
have come to expect the ability store binary data in STRING columns.

BTW, I don't think the documentation Pavel linked to has been updated; I don't 
see this restriction or its workaround documented anywhere.


> Support Kudu BINARY
> -------------------
>
>                 Key: IMPALA-5323
>                 URL: https://issues.apache.org/jira/browse/IMPALA-5323
>             Project: IMPALA
>          Issue Type: New Feature
>          Components: Backend
>            Reporter: Pavel Martynov
>            Priority: Major
>              Labels: kudu
>
> I trying to 'CREATE EXTERNAL TABLE STORED AS KUDU' on the table with BINARY 
> Kudu column data type and got an error: Kudu type 'binary' is not supported 
> in Impala.
> This limitation is not documented, checked:
> https://impala.incubator.apache.org/docs/build/html/topics/impala_kudu.html
> https://kudu.apache.org/docs/kudu_impala_integration.html#_known_issues_and_limitations
> There are some thoughts that Kudu BINARY data type may be supported by 
> Impala's STRING data type:
> https://community.cloudera.com/t5/Interactive-Short-cycle-SQL/Does-impala-support-binary-data-type/td-p/24366
> https://groups.google.com/a/cloudera.org/forum/#!msg/impala-user/muguKJU3c3I/_oArmoxSlDMJ



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to