[ https://issues.apache.org/jira/browse/SPARK-15831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Owen updated SPARK-15831: ------------------------------ Affects Version/s: 1.5.2 1.6.1 Target Version/s: (was: 1.5.0) > Kryo 2.21 TreeMap serialization bug causes random job failures with RDDs of > HBase puts > -------------------------------------------------------------------------------------- > > Key: SPARK-15831 > URL: https://issues.apache.org/jira/browse/SPARK-15831 > Project: Spark > Issue Type: Bug > Affects Versions: 1.5.2, 1.6.1 > Reporter: Charles GariƩpy-Ikeson > > This was found on Spark 1.5, but it seems that all Spark 1.x brings in the > problematic dependency in question. > Kryo 2.21 has a bug when serializing TreeMap that causes intermittent > failures in Spark. This problem cause be seen especially when sinking data to > HBase using a RDD of HBase Puts (which internally have TreeMap). > Kryo fixed the issue in 2.21.1. Current work around involves setting > "spark.kryo.referenceTracking" to false. > For reference see: > Kryo commit: > https://github.com/EsotericSoftware/kryo/commit/00ffc7ed443e022a8438d1e4c4f5b86fe4f9912b > TreeMap Kryo Issue: https://github.com/EsotericSoftware/kryo/issues/112 > HBase Put Kryo Issue: https://github.com/EsotericSoftware/kryo/issues/428 -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org