Original Thrift schema:

struct teststruct {
  1: optional string field1;
  2: optional string field2;
  3: optional string field3;
}

struct mainstruct {
  1: optional list<teststruct> teststructs;
}

This parquet file schema was generated:

message ParquetSchema {
  optional group teststructs {
    repeated group teststruct_tuple {
      optional binary field1;
      optional binary field2;
      optional binary field3;
    }
  }
}

When i try to run queries involving this 'teststructs' column, I get this
error:

Failed with exception java.io.IOException:java.lang.RuntimeException:
Invalid parquet hive schema: repeated group teststruct_tuple {
      optional binary field1;
      optional binary field2;
      optional binary field3;
    }



On Thu, Aug 14, 2014 at 8:35 AM, Brock Noland <br...@cloudera.com> wrote:

> Hi,
>
> Can you share your parquet schema?
>
> Brock
>
>
> On Tue, Aug 12, 2014 at 5:06 PM, Raymond Lau <raymond.lau...@gmail.com>
> wrote:
>
>> Hello.  (First off, sorry if I accidentally posted to the wrong mailing
>> list before - dev - and you are getting this again)
>>
>> Regarding the ArrayWritableGroupConverter class: I was just wondering how
>> come the field count has to be either 1 or 2?  I'm trying to read a column
>> where the amount is fields is 3 and I'm getting an invalid parquet hive
>> schema (in hive 0.12) error when I try to do so.  It looks like it links
>> back to here.
>>
>> *https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/convert/ArrayWritableGroupConverter.java
>> <https://github.com/apache/hive/blob/trunk/ql/src/java/org/apache/hadoop/hive/ql/io/parquet/convert/ArrayWritableGroupConverter.java>*
>>
>>
>> Thanks,
>> -Raymond
>>
>
>


-- 
*Raymond Lau*
Software Engineer - Intern |
r...@ooyala.com | (925) 395-3806

Reply via email to