[GitHub] spark pull request #20521: [SPARK-22977][SQL] fix web UI SQL tab for CTAS

cloud-fan Tue, 06 Feb 2018 10:48:36 -0800

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20521#discussion_r166403941
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSource.scala
 ---
    @@ -493,9 +510,23 @@ case class DataSource(
             dataSource.createRelation(
               sparkSession.sqlContext, mode, caseInsensitiveOptions, 
Dataset.ofRows(sparkSession, data))
           case format: FileFormat =>
    -        
sparkSession.sessionState.executePlan(planForWritingFileFormat(format, mode, 
data)).toRdd
    +        val cmd = planForWritingFileFormat(format, mode, data)
    +        val resolvedPartCols = cmd.partitionColumns.map { col =>
    +          // The partition columns created in `planForWritingFileFormat` 
should always be
    +          // `UnresolvedAttribute` with a single name part.
    +          assert(col.isInstanceOf[UnresolvedAttribute])
    +          val unresolved = col.asInstanceOf[UnresolvedAttribute]
    +          assert(unresolved.nameParts.length == 1)
    +          val name = unresolved.nameParts.head
    +          outputColumns.find(a => equality(a.name, name)).getOrElse {
    +            throw new AnalysisException(
    +              s"Unable to resolve $name given 
[${data.output.map(_.name).mkString(", ")}]")
    +          }
    +        }
    +        val resolved = cmd.copy(partitionColumns = resolvedPartCols, 
outputColumns = outputColumns)
    --- End diff --
    
    The previous code calls `sparkSession.sessionState.executePlan` to 
analze/optimize/plan/exeucte this temporary 
`InsertIntoHadoopFsRelationCommand`, which is pretty hacky because at this 
moment, we are executing CTAS and we already have the final physical plan. Here 
we manually analyze the `InsertIntoHadoopFsRelationCommand` so that we only 
reuse the physical part here.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #20521: [SPARK-22977][SQL] fix web UI SQL tab for CTAS

Reply via email to