[ https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Wes McKinney resolved PARQUET-1571. ----------------------------------- Resolution: Fixed Fix Version/s: cpp-1.6.0 Issue resolved by pull request 4275 [https://github.com/apache/arrow/pull/4275] > [C++] Can't read data from parquet file in C++ library > ------------------------------------------------------ > > Key: PARQUET-1571 > URL: https://issues.apache.org/jira/browse/PARQUET-1571 > Project: Parquet > Issue Type: Bug > Components: parquet-cpp > Reporter: worker24h > Priority: Critical > Labels: pull-request-available > Fix For: cpp-1.6.0 > > Time Spent: 1h 40m > Remaining Estimate: 0h > > Specified the second param *parquet::ReaderProperties* When I used > parquet::ParquetFileReader::Open, it can't work. > The following code: > {code:java} > parquet::ReaderProperties _properties; > _properties = parquet::ReaderProperties(); > _properties.enable_buffered_stream(); // used buffer stream. Don't set > buffer-size > parquet_reader = parquet::ParquetFileReader::Open(_parquet, _properties); > ... > int32_t value; > parquet::Int32Reader* int32_reader = > static_cast<parquet::Int32Reader*>(column_reader.get()); > int32_reader->Skip(_current_line_of_group);// skip lines of processed. > rows_read = int32_reader->ReadBatch(1, nullptr, nullptr, &value, > &values_read); > {code} > The interface *Skip* throw exception: > {color:#FF0000}{{Couldn't deserialize thrift: TProtocolException: Invalid > data Deserializing page header failed.}}{color} > -- This message was sent by Atlassian JIRA (v7.6.3#76005)