[ https://issues.apache.org/jira/browse/HIVE-10016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371002#comment-14371002 ]
Dong Chen commented on HIVE-10016: ---------------------------------- Thanks for your review! [~Ferd]. Yes, Parquet have a new instance there. The ReadSupport instance in Hive side is just for providing some info for ParquetRecordReaderWrapper creation. > Remove duplicated Hive table schema parsing in DataWritableReadSupport > ---------------------------------------------------------------------- > > Key: HIVE-10016 > URL: https://issues.apache.org/jira/browse/HIVE-10016 > Project: Hive > Issue Type: Sub-task > Reporter: Dong Chen > Assignee: Dong Chen > Attachments: HIVE-10016-parquet.patch > > > In {{DataWritableReadSupport.init()}}, the table schema is created and its > string format is set in conf. When construct the > {{ParquetRecordReaderWrapper}} , the schema is fetched from conf and parsed > several times. > We could remove these schema parsing, and improve the speed of > getRecordReader a bit. -- This message was sent by Atlassian JIRA (v6.3.4#6332)