It might mean that some partition was computed on two nodes, because a task for it wasn't able to be scheduled locally on the first node. Did the RDD really have 426 partitions total? You can click on it and see where there are copies of each one.
Matei > On Nov 8, 2014, at 10:16 PM, Nathan Kronenfeld <nkronenf...@oculusinfo.com> > wrote: > > RDD Name Storage Level Cached Partitions Fraction Cached Size in > Memory Size in Tachyon Size on Disk > 8 <http://hadoop-s1.oculus.guest:4042/storage/rdd?id=8> Memory > Deserialized 1x Replicated 426 107% 59.7 GB 0.0 B 0.0 B > Anyone understand what it means to have more than 100% of an rdd cached? > > Thanks, > -Nathan >