[ https://issues.apache.org/jira/browse/SPARK-23459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Xiao Li updated SPARK-23459: ---------------------------- Description: {noformat} test("save with an unknown partition column") { withTempDir { dir => val path = dir.getCanonicalPath Seq(1L -> "a").toDF("i", "j").write .format("parquet") .partitionBy("unknownColumn") .save(path) } } {noformat} We got the following error message: {noformat} Partition column unknownColumn not found in schema StructType(StructField(i,LongType,false), StructField(j,StringType,true)); {noformat} We should not call toString, but catalogString in the function `partitionColumnsSchema` of `PartitioningUtils.scala` was: {noformat} test("save with an unknown partition column") { withTempDir { dir => val path = dir.getCanonicalPath Seq(1L -> "a").toDF("i", "j").write .format("parquet") .partitionBy("unknownColumn") .save(path) } } {noformat} We got the following error message: Partition column unknownColumn not found in schema StructType(StructField(i,LongType,false), StructField(j,StringType,true)); We should not call toString, but catalogString in the function `partitionColumnsSchema` of `PartitioningUtils.scala` > Improve the error message when unknown column is specified in partition > columns > ------------------------------------------------------------------------------- > > Key: SPARK-23459 > URL: https://issues.apache.org/jira/browse/SPARK-23459 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.3.0 > Reporter: Xiao Li > Priority: Major > Labels: starter > > {noformat} > test("save with an unknown partition column") { > withTempDir { dir => > val path = dir.getCanonicalPath > Seq(1L -> "a").toDF("i", "j").write > .format("parquet") > .partitionBy("unknownColumn") > .save(path) > } > } > {noformat} > We got the following error message: > {noformat} > Partition column unknownColumn not found in schema > StructType(StructField(i,LongType,false), StructField(j,StringType,true)); > {noformat} > We should not call toString, but catalogString in the function > `partitionColumnsSchema` of `PartitioningUtils.scala` -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org