[ 
https://issues.apache.org/jira/browse/SPARK-27733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17212451#comment-17212451
 ] 

Ismaël Mejía commented on SPARK-27733:
--------------------------------------

[~sha...@uber.com] sorry I missed somehow the previous notification. If you 
have a future parquet sync i would love to join to explain the full details, 
otherwise, the tldr version is fix on Spark side is 'easy' the real deal is 
that Spark gets Avro's 1.8 dependency via Hive and getting the fix on Hive has 
proven difficult (already more than 1y in the making) and even with the fix 
merged we still need it to be backported to the 2.3.x branch and have a release 
that includes it, we need LOTS of good will and help from the Hive people so if 
you guys know anyone there who can help that would be appreciated.

The issue is related to a more strict validation on unions with ill defined 
defaults starting on Avro 1.9.x. There are multiple options to deal with this 
(discussed in the ticket) but we shall probably do a fix on Avro side. I will 
bring the update here once we agree on the fix, worse scenario it will require 
a release of Avro too but this is a good time since we were already discussing 
about a release soon.

> Upgrade to Avro 1.10.0
> ----------------------
>
>                 Key: SPARK-27733
>                 URL: https://issues.apache.org/jira/browse/SPARK-27733
>             Project: Spark
>          Issue Type: Improvement
>          Components: Build, SQL
>    Affects Versions: 3.1.0
>            Reporter: Ismaël Mejía
>            Priority: Minor
>
> Avro 1.9.2 was released with many nice features including reduced size (1MB 
> less), and removed dependencies, no paranamer, no shaded guava, security 
> updates, so probably a worth upgrade.
> Avro 1.10.0 was released and this is still not done.
> There is at the moment (2020/08) still a blocker because of Hive related 
> transitive dependencies bringing older versions of Avro, so we could say that 
> this is somehow still blocked until HIVE-21737 is solved.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to