[ https://issues.apache.org/jira/browse/PARQUET-1481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16726806#comment-16726806 ]
Wes McKinney commented on PARQUET-1481: --------------------------------------- The Thrift metadata is corrupt, but it is not checked. I'm submitting a patch > [C++] SEGV when reading corrupt parquet file > -------------------------------------------- > > Key: PARQUET-1481 > URL: https://issues.apache.org/jira/browse/PARQUET-1481 > Project: Parquet > Issue Type: Bug > Reporter: Hatem Helal > Assignee: Wes McKinney > Priority: Major > Attachments: corrupt.parquet > > > >>> import pyarrow.parquet as pq > >>> pq.read_table('corrupt.parquet') > fish: 'python' terminated by signal SIGSEGV (Address boundary error) > > Stack report from macOS: > > 0 libsystem_kernel.dylib 0x00007fff51164cee __psynch_cvwait + 10 > 1 libsystem_pthread.dylib 0x00007fff512a1662 _pthread_cond_wait + 732 > 2 libc++.1.dylib 0x00007fff4f04acb0 > std::__1::condition_variable::wait(std::__1::unique_lock<std::__1::mutex>&) + > 18 > 3 libc++.1.dylib 0x00007fff4f04b728 > std::__1::__assoc_sub_state::__sub_wait(std::__1::unique_lock<std::__1::mutex>&) > + 46 > 4 libparquet.11.dylib 0x0000000115512d00 > std::__1::__assoc_state<arrow::Status>::move() + 48 > 5 libparquet.11.dylib 0x00000001154faa15 > parquet::arrow::FileReader::Impl::ReadTable(std::__1::vector<int, > std::__1::allocator<int> > const&, std::__1::shared_ptr<arrow::Table>*) + 1093 > 6 libparquet.11.dylib 0x00000001154fb6fe > parquet::arrow::FileReader::Impl::ReadTable(std::__1::shared_ptr<arrow::Table>*) > + 350 > 7 libparquet.11.dylib 0x00000001154fce47 > parquet::arrow::FileReader::ReadTable(std::__1::shared_ptr<arrow::Table>*) + > 23 > 8 _parquet.so 0x000000011598d97b > __pyx_pw_7pyarrow_8_parquet_13ParquetReader_9read_all(_object*, _object*, > _object*) + 1035 -- This message was sent by Atlassian JIRA (v7.6.3#76005)