[ https://issues.apache.org/jira/browse/FLINK-10194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17321568#comment-17321568 ]
Flink Jira Bot commented on FLINK-10194: ---------------------------------------- This issue and all of its Sub-Tasks have not been updated for 180 days. So, it has been labeled "stale-minor". If you are still affected by this bug or are still interested in this issue, please give an update and remove the label. In 7 days the issue will be closed automatically. > Serialization issue with Scala's AggregateDataSet[Row] > ------------------------------------------------------ > > Key: FLINK-10194 > URL: https://issues.apache.org/jira/browse/FLINK-10194 > Project: Flink > Issue Type: Bug > Components: API / Type Serialization System > Environment: Flink v1.6.0 > Reporter: Alexis Sarda-Espinosa > Priority: Minor > Labels: scala, serialization, stale-minor > > > Consider the following code, where I had to jump through some hoops to > manually create a DataSet[Row] that allows using groupBy and sum as shown: > {code:java} > object Main { > def main(args: Array[String]): Unit = { > val env = ExecutionEnvironment.getExecutionEnvironment > val letters = Seq("a", "a", "b").map(Row.of(_, 1.asInstanceOf[Object])) > val typeInfo = new RowTypeInfo(BasicTypeInfo.STRING_TYPE_INFO, > BasicTypeInfo.INT_TYPE_INFO) > import scala.collection.JavaConverters._ > val inputFormat = new CollectionInputFormat(letters.asJavaCollection, > typeInfo.createSerializer(env.getConfig)) > val source = new DataSource(env.getJavaEnv, > inputFormat, > typeInfo, > "hello.flink.Main$.main(Main.scala:20") > > val dataSet = new DataSet(source) > dataSet.print() > dataSet > .groupBy(0) > .sum(1) > .print() > } > }{code} > The call to dataSet.print() works as expected, but the final print() throws > an exception: > {noformat} > Caused by: java.lang.ClassCastException: > org.apache.flink.api.java.typeutils.runtime.RowSerializer cannot be cast to > org.apache.flink.api.java.typeutils.runtime.TupleSerializerBase > at > org.apache.flink.api.scala.operators.ScalaAggregateOperator$AggregatingUdf.open(ScalaAggregateOperator.java:260){noformat} > Changing the final print() to collect() throws the same exception. > -- This message was sent by Atlassian Jira (v8.3.4#803005)