Chris Bartak created ARROW-1306: ----------------------------------- Summary: [Python] Encoding? issue with error reporting for parquet.read_table Key: ARROW-1306 URL: https://issues.apache.org/jira/browse/ARROW-1306 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.5.0 Reporter: Chris Bartak
This is only error-reporting, somehow the filename in the exception for a not found file is getting garbled, example below {code:python} import pyarrow.parquet as pq pq.read_table('non_existent_file.parquet') ArrowIOError Traceback (most recent call last) pq.read_table('non_existent_file.parquet') --------------------------------------------------------------------------- ArrowIOError Traceback (most recent call last) ----> 1 pq.read_table('non_existent_file.parquet') ~\AppData\Local\Continuum\Anaconda3\envs\py36\lib\site-packages\pyarrow\parquet.py in read_table(source, columns, nthreads, metadata, use_pandas_metadata) 709 metadata=metadata) 710 --> 711 pf = ParquetFile(source, metadata=metadata) 712 return pf.read(columns=columns, nthreads=nthreads, 713 use_pandas_metadata=use_pandas_metadata) ~\AppData\Local\Continuum\Anaconda3\envs\py36\lib\site-packages\pyarrow\parquet.py in __init__(self, source, metadata, common_metadata) 52 def __init__(self, source, metadata=None, common_metadata=None): 53 self.reader = ParquetReader() ---> 54 self.reader.open(source, metadata=metadata) 55 self.common_metadata = common_metadata 56 _parquet.pyx in pyarrow._parquet.ParquetReader.open() io.pxi in pyarrow.lib.get_reader() io.pxi in pyarrow.lib.memory_map() io.pxi in pyarrow.lib.MemoryMappedFile._open() error.pxi in pyarrow.lib.check_status() ArrowIOError: IOError: Failed to open file: 潮彮硥獩整瑮晟汩慰煲敵 {code} verions - Python 3.6 Windows x64 {code} arrow-cpp: 0.5.0-np112py36_vc14_1 conda-forge [vc14] parquet-cpp: 1.2.0.pre-vc14_3 conda-forge [vc14] pyarrow: 0.5.0-np112py36_vc14_0 conda-forge [vc14] {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)