[ https://issues.apache.org/jira/browse/SPARK-14766?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15250525#comment-15250525 ]
Burak Yavuz commented on SPARK-14766: ------------------------------------- cc [~cloud_fan] [~marmbrus] > Attribute reference mismatch with Dataset filter + mapPartitions > ---------------------------------------------------------------- > > Key: SPARK-14766 > URL: https://issues.apache.org/jira/browse/SPARK-14766 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.0.0 > Reporter: Burak Yavuz > > After a filter, the Dataset references seem to be not copied properly leading > to an exception. To reproduce, you may use the following code: > {code} > Seq((1, 1)).toDS().filter(_._1 != 0).mapPartitions { iter => iter }.count() > {code} > Using explain shows the problem: > {code} > == Physical Plan == > !MapPartitions <function1>, newInstance(class scala.Tuple2), [input[0, > scala.Tuple2]._1 AS _1#38521,input[0, scala.Tuple2]._2 AS _2#38522] > +- WholeStageCodegen > : +- Filter <function1>.apply > : +- INPUT > +- LocalTableScan [_1#38512,_2#38513], [[0,1,1]] > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org