Jim Scott created DRILL-1439:
--------------------------------

             Summary: Allow file / field text encoding type to be specifed so 
that UTF-8 default can be overriden 
                 Key: DRILL-1439
                 URL: https://issues.apache.org/jira/browse/DRILL-1439
             Project: Apache Drill
          Issue Type: Bug
          Components: Storage - Text & CSV
    Affects Versions: 0.5.0
            Reporter: Jim Scott


Drill parsed a field and found that the byte value 0xb5 was invalid. That byte 
is from ISO-8859-1 and not UTF-8. The equivalent character in UTF-8 would be 
0xc2b5. If there was a way to tell the function or the file is a different 
encoding this problem could be prevented.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to