Burak Yavuz created SPARK-14766: ----------------------------------- Summary: Attribute reference mismatch with Dataset filter + mapPartitions Key: SPARK-14766 URL: https://issues.apache.org/jira/browse/SPARK-14766 Project: Spark Issue Type: Bug Components: SQL Affects Versions: 2.0.0 Reporter: Burak Yavuz
After a filter, the Dataset references seem to be not copied properly leading to an exception. To reproduce, you may use the following code: {code} Seq((1, 1)).toDS().filter(_._1 != 0).mapPartitions { iter => iter }.count() {code} Using explain shows the problem: {code} == Physical Plan == !MapPartitions <function1>, newInstance(class scala.Tuple2), [input[0, scala.Tuple2]._1 AS _1#38521,input[0, scala.Tuple2]._2 AS _2#38522] +- WholeStageCodegen : +- Filter <function1>.apply : +- INPUT +- LocalTableScan [_1#38512,_2#38513], [[0,1,1]] {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org