[ https://issues.apache.org/jira/browse/HIVE-21737?focusedWorklogId=527112&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-527112 ]
ASF GitHub Bot logged work on HIVE-21737: ----------------------------------------- Author: ASF GitHub Bot Created on: 22/Dec/20 09:04 Start Date: 22/Dec/20 09:04 Worklog Time Spent: 10m Work Description: iemejia edited a comment on pull request #1635: URL: https://github.com/apache/hive/pull/1635#issuecomment-749430903 @sunchao Great to know the cut is happening soon! If this is absolutely needed is a question of tradeoffs. Avro binary format has not changed since version 1.8.x when it introduced Logical Types, but more recent versions have removed dependencies from the public API (Jackson, Guava, JodaTime, etc) so I suppose catching up to the latest version can have the same risks as catching up to 1.8.x (which Hive already did) for the binary part. I suppose Hive users rarely use Avro directly from the transitive dependency and mostly rely on the Hive APIs (which I hope don't leak Avro) so this might diminish the risk, but of course there is a risk in that particular case. I am probably biased towards the upgrade because I come from the Avro side and I expect the full Big Data ecosystem to be updated and avoid issues because Hive contributors may introduce changes that are API incompatible with more recent versions of Avro and break downstream projects because of this (like the current parallel work on Spark). As usual in software it is all about tradeoffs. This decision is up to you guys as the maintainers and I might miss some other side effects in my analysis because I don't know Hive deeply. In any case if you guys decide to jump to the latest Avro version e.g. 1.10.1 and any issue happens I engage myself from the Avro side to do any fix and get out a release if required. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org Issue Time Tracking ------------------- Worklog Id: (was: 527112) Time Spent: 5h 20m (was: 5h 10m) > Upgrade Avro to version 1.10.1 > ------------------------------ > > Key: HIVE-21737 > URL: https://issues.apache.org/jira/browse/HIVE-21737 > Project: Hive > Issue Type: Improvement > Components: Hive > Reporter: Ismaël Mejía > Assignee: Fokko Driesprong > Priority: Major > Labels: pull-request-available > Attachments: > 0001-HIVE-21737-Make-Avro-use-in-Hive-compatible-with-Avr.patch > > Time Spent: 5h 20m > Remaining Estimate: 0h > > Avro >= 1.9.x bring a lot of fixes including a leaner version of Avro without > Jackson in the public API and Guava as a dependency. Worth the update. -- This message was sent by Atlassian Jira (v8.3.4#803005)