Quanlong Huang created IMPALA-10415: ---------------------------------------
Summary: impala-shell crash in parsing multiline queries that contain UTF-8 characters Key: IMPALA-10415 URL: https://issues.apache.org/jira/browse/IMPALA-10415 Project: IMPALA Issue Type: Bug Components: Clients Reporter: Quanlong Huang Assignee: Quanlong Huang Reproducing the issue by: {code:java} [localhost:21050] default> select "你好"; Query: select "你好" Query submitted at: 2020-12-30 11:00:40 (Coordinator: http://quanlong-OptiPlex-BJ:25000) Query progress can be monitored at: http://quanlong-OptiPlex-BJ:25000/query_plan?query_id=554d2348a28884c6:30835a4800000000 +--------+ | '你好' | +--------+ | 你好 | +--------+ Fetched 1 row(s) in 0.12s [localhost:21050] default> select > "你好"; Traceback (most recent call last): File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2062, in <module> impala_shell_main() File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 2027, in impala_shell_main shell.cmdloop(intro) File "/home/quanlong/workspace/Impala/toolchain/toolchain-packages-gcc7.5.0/python-2.7.16/lib/python2.7/cmd.py", line 141, in cmdloop line = self.precmd(line) File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 631, in precmd args = self.sanitise_input(args.decode('utf-8')) # python2 File "/home/quanlong/workspace/Impala/shell/impala_shell.py", line 435, in sanitise_input tokens = args.strip().split(' ') UnicodeDecodeError: 'ascii' codec can't decode byte 0xe4 in position 8: ordinal not in range(128) {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)