HyukjinKwon commented on a change in pull request #24405: [SPARK-27506][SQL] Allow deserialization of Avro data using compatible schemas URL: https://github.com/apache/spark/pull/24405#discussion_r316956886
########## File path: external/avro/src/main/scala/org/apache/spark/sql/avro/functions.scala ########## @@ -28,39 +28,59 @@ object functions { // scalastyle:on: object.name /** - * Converts a binary column of avro format into its corresponding catalyst value. The specified - * schema must match the read data, otherwise the behavior is undefined: it may fail or return - * arbitrary result. + * Converts a binary column of avro format into its corresponding catalyst value. If a writer's + * schema is provided, a different (but compatible) schema can be used for reading. If no writer's + * schema is provided, the specified schema must match the read data, otherwise the behavior is + * undefined: it may fail or return arbitrary result. * * @param data the binary column. * @param jsonFormatSchema the avro schema in JSON string format. + * @param writerJsonFormatSchema the avro schema in JSON string format used to serialize the data. * * @since 3.0.0 */ @Experimental def from_avro( data: Column, - jsonFormatSchema: String): Column = { - new Column(AvroDataToCatalyst(data.expr, jsonFormatSchema, Map.empty)) + jsonFormatSchema: String, + writerJsonFormatSchema: Option[String]): Column = { Review comment: Is it a Java native library? or Scala's? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org