[ https://issues.apache.org/jira/browse/HIVE-693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zheng Shao updated HIVE-693: ---------------------------- Attachment: HIVE-693.2.patch Incorporated Ashish's comments. Also removed the column definition since they will come directly from serde. @Andraz: For speed improvement: Instead of using regex, you can read in the data as org.apache.hadoop.io.Text, and do split by yourself. Each field can be stored in a Text as well, and the Text object can be reused across the rows. In this way, the processing will be much faster. > Add a AWS S3 log format deserializer > ------------------------------------ > > Key: HIVE-693 > URL: https://issues.apache.org/jira/browse/HIVE-693 > Project: Hadoop Hive > Issue Type: New Feature > Components: Serializers/Deserializers > Reporter: Zheng Shao > Assignee: Andraz Tori > Fix For: 0.3.0 > > Attachments: HIVE-693.1.patch, HIVE-693.2.patch, inputs3.q, s3.log, > s3deserializer.diff, S3LogDeserializer.java, S3LogStruct.java > > -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.