[
https://issues.apache.org/jira/browse/DRILL-5772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16163093#comment-16163093
]
ASF GitHub Bot commented on DRILL-5772:
---------------------------------------
Github user arina-ielchiieva commented on the issue:
https://github.com/apache/drill/pull/936
@paul-rogers, the unit tests just show which influence saffron property has
on Drill (since people in the community where asking how to enable support
UTF-8 in Drill), a long with this PR we'll also add description to Drill
documentation.
So far Drill relies on Calcite saffron property to determine which charset
it supports. By default, it's ISO-8859-1. So currently if customer wants to
query UTF-8 data in Drill, he will get an error (one of the unit tests shows
it) and needs to override saffron property to proceed. I guess, we don't
support UTF-8 by default since there are some issues and Drill did not fully
tested UTF-8 support.
> Add unit tests to indicate how utf-8 support can be enabled / disabled in
> Drill
> -------------------------------------------------------------------------------
>
> Key: DRILL-5772
> URL: https://issues.apache.org/jira/browse/DRILL-5772
> Project: Apache Drill
> Issue Type: Task
> Affects Versions: 1.11.0
> Reporter: Arina Ielchiieva
> Assignee: Arina Ielchiieva
> Labels: doc-impacting
> Fix For: 1.12.0
>
>
> Add unit test to indicated how utf-8 support can be enabled in Drill.
> To select utf-8 data user needs to update system property
> {{saffron.default.charset}} to {{UTF-16LE}} before starting the drillbit.
> Calcite uses this property to get default charset, if it is not set then
> {{ISO-8859-1}} is used by default. Drill gets default charset from Calcite.
> This information should be also documented, probably in
> https://drill.apache.org/docs/data-type-conversion/.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)