Haizhou Zhao created FLINK-27255:
------------------------------------

             Summary: Flink-avro does not support serialization and 
deserialization of avro schema longer than 65535 characters
                 Key: FLINK-27255
                 URL: https://issues.apache.org/jira/browse/FLINK-27255
             Project: Flink
          Issue Type: Bug
          Components: Formats (JSON, Avro, Parquet, ORC, SequenceFile)
    Affects Versions: 1.14.4
            Reporter: Haizhou Zhao


The underlying serialization of avro schema uses string serialization method of 
ObjectOutputStream.class, however, the default string serialization by 
ObjectOutputStream.class does not support handling string of more than 66535 
characters (64kb). As a result, constructing flink operators that input/output 
Avro Generic Record with huge schema is not possible.

 

The purposed fix is two change the serialization and deserialization method of 
these following classes so that huge string could also be handled.

 

[GenericRecordAvroTypeInfo|https://github.com/apache/flink/blob/master/flink-formats/flink-avro/src/main/java/org/apache/flink/formats/avro/typeutils/GenericRecordAvroTypeInfo.java#L107]

[SerializableAvroSchema|https://github.com/apache/flink/blob/master/flink-formats/flink-avro/src/main/java/org/apache/flink/formats/avro/typeutils/SerializableAvroSchema.java#L55]

 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to