[ 
https://issues.apache.org/jira/browse/SPARK-25378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16612399#comment-16612399
 ] 

Xiangrui Meng commented on SPARK-25378:
---------------------------------------

Comments from [~vomjom] at https://github.com/tensorflow/ecosystem/pull/100:

{quote}
We currently only do releases along with TensorFlow releases, and the next one 
that'll include this is TF 1.12.
{quote}

This means Spark+TF users cannot migrate to Spark 2.4 until TF 1.12 is 
released. I think we need to decide based on the impact instead of just saying 
"this is not a public API". If it is not pubic, why didn't we hide it in the 
first place? And as [~cloud_fan] mentioned, it is hard to implement data source 
without touching those "private" APIs.

> ArrayData.toArray(StringType) assume UTF8String in 2.4
> ------------------------------------------------------
>
>                 Key: SPARK-25378
>                 URL: https://issues.apache.org/jira/browse/SPARK-25378
>             Project: Spark
>          Issue Type: Bug
>          Components: SQL
>    Affects Versions: 2.4.0
>            Reporter: Xiangrui Meng
>            Priority: Critical
>
> The following code works in 2.3.1 but failed in 2.4.0-SNAPSHOT:
> {code}
> import org.apache.spark.sql.catalyst.util._
> import org.apache.spark.sql.types.StringType
> ArrayData.toArrayData(Array("a", "b")).toArray[String](StringType)
> res0: Array[String] = Array(a, b)
> {code}
> In 2.4.0-SNAPSHOT, the error is
> {code}java.lang.ClassCastException: java.lang.String cannot be cast to 
> org.apache.spark.unsafe.types.UTF8String
>   at 
> org.apache.spark.sql.catalyst.util.GenericArrayData.getUTF8String(GenericArrayData.scala:75)
>   at 
> org.apache.spark.sql.catalyst.InternalRow$$anonfun$getAccessor$8.apply(InternalRow.scala:136)
>   at 
> org.apache.spark.sql.catalyst.InternalRow$$anonfun$getAccessor$8.apply(InternalRow.scala:136)
>   at org.apache.spark.sql.catalyst.util.ArrayData.toArray(ArrayData.scala:178)
>   ... 51 elided
> {code}
> cc: [~cloud_fan] [~yogeshg]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to