[ https://issues.apache.org/jira/browse/DRILL-8509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
James Turton updated DRILL-8509: -------------------------------- Summary: Use appropriate, or no, Unicode escaping on string values in JDBC writer (was: Use approporiate, or no, Unicode escaping on string values in JDBC writer) > Use appropriate, or no, Unicode escaping on string values in JDBC writer > ------------------------------------------------------------------------ > > Key: DRILL-8509 > URL: https://issues.apache.org/jira/browse/DRILL-8509 > Project: Apache Drill > Issue Type: Bug > Components: Storage - JDBC > Affects Versions: 1.21.2 > Reporter: James Turton > Priority: Minor > Fix For: Future > > > When characters outside of the ASCII printable range appear inside string > values passed to the JDBC writer via a CTAS with a JDBC storage plugin as its > destination, the JDBC writer replaces them with escape sequences embedded in > PostgreSQL-style Unicode strings prefixed with 'u&'. An example in which a > tab character is replaced with \0009 is [visible > here|https://github.com/apache/drill/issues/2922]. > # Review character encoding and escaping JdbcRecordWriter.java and > InsertStatementBuilder.java. > # Review the SqlDialect selection made by the JdbcWriter, looking for why a > PostgreSQL dialect [appears to have been selected for a JDBC connection to > MariaDB|https://github.com/apache/drill/issues/2922]. > # Determine whether a MySQL / MariaDB SQL dialect can be selected instead, > and whether this will resolve the issue. -- This message was sent by Atlassian Jira (v8.20.10#820010)