Jacek Laskowski created SPARK-20416: ---------------------------------------
Summary: Column names inconsistent for UDFs in SQL vs Dataset Key: SPARK-20416 URL: https://issues.apache.org/jira/browse/SPARK-20416 Project: Spark Issue Type: Improvement Components: SQL Affects Versions: 2.2.0 Reporter: Jacek Laskowski Priority: Minor As you can see below, the name of the columns in SQL vs Dataset is different. {code} scala> val timesTwoUDF = spark.udf.register("timesTwo", (x: Int) => x * 2) timesTwoUDF: org.apache.spark.sql.expressions.UserDefinedFunction = UserDefinedFunction(<function1>,IntegerType,Some(List(IntegerType))) scala> spark.sql("SELECT timesTwo(1)").show +---------------+ |UDF:timesTwo(1)| +---------------+ | 2| +---------------+ scala> spark.range(1, 2).toDF("x").select(timesTwoUDF($"x")).show +------+ |UDF(x)| +------+ | 2| +------+ {code} -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org