Quanlong Huang created IMPALA-10415:
---------------------------------------

             Summary: impala-shell crash in parsing multiline queries that 
contain UTF-8 characters
                 Key: IMPALA-10415
                 URL: https://issues.apache.org/jira/browse/IMPALA-10415
             Project: IMPALA
          Issue Type: Bug
          Components: Clients
            Reporter: Quanlong Huang
            Assignee: Quanlong Huang


Reproducing the issue by:
{code:java}
[localhost:21050] default> select "你好";
Query: select "你好"
Query submitted at: 2020-12-30 11:00:40 (Coordinator: 
http://quanlong-OptiPlex-BJ:25000)
Query progress can be monitored at: 
http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=554d2348a28884c6:30835a4800000000
+--------+
| '你好' |
+--------+
| 你好   |
+--------+
Fetched 1 row(s) in 0.12s
[localhost:21050] default> select
                         > "你好";
Traceback (most recent call last):
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2062, in 
<module>
    impala_shell_main()
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2027, in 
impala_shell_main
    shell.cmdloop(intro)
  File 
"/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/cmd.py",
 line 141, in cmdloop
    line = self.precmd(line)
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 631, in 
precmd
    args = self.sanitise_input(args.decode('utf-8'))  # python2
  File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 435, in 
sanitise_input
    tokens = args.strip().split(' ')
UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 8: ordinal 
not in range(128) {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to