Jarry Shaw created THRIFT-4677: ---------------------------------- Summary: UnicodeDecodeError in Python3 Key: THRIFT-4677 URL: https://issues.apache.org/jira/browse/THRIFT-4677 Project: Thrift Issue Type: Bug Components: Python - Library Environment: Operating System: Windows 10 Pro (Simplified Chinese)
Python Interpreter: Python 3.6.6 {{osquery}} Version: 3.3.0 {{osquery-python}} Version: 3.0.5 Reporter: Jarry Shaw Attachments: compat.py This is an issue occurred when using [osquery-python|https://github.com/osquery/osquery-python] (Python binding of [osquery|https://osquery.io/] by Facebook). When querying, {{UnicodeDecodeError}} raised with error message: "{{'utf-8' codec can't decode byte 0xc3 in position 0: invalid continuation byte}}" from {{thrift.compat.binary_to_str}}, which is because the encoding of {{bin_val}} parameter should be "{{gbk}}". Possible approaches are: * add a parameter for user to determine encodings * get the system encoding through {{locale.getpreferredencoding()}} * call {{bin_val.decode}} with {{errors='replace'}} or {{errors='ignore'}} parameter * introduce {{chardet}} to try and resolve encoding problems The attachment is my hack solution to this issue (through not perfect). -- This message was sent by Atlassian JIRA (v7.6.3#76005)