JsonEncoder is not flushed after writing using ReflectDatumWriter
------------------------------------------------------------------
Key: AVRO-295
URL: https://issues.apache.org/jira/browse/AVRO-295
Project: Avro
Issue Type: New Feature
Components: java
Affects Versions: 1.3.0
Reporter: Jonathan Hsieh
JsonEncoder needs to be flushed otherwise data may be left in its buffers.
Ideally behavior should be the same regardless of what kind of Encoder passed
in. Here is some example code:
class A {
long timestamp;
}
public void testEventSchemaSerializeBinary() throws IOException {
A e = new A();
e.timestamp = 1234;
ReflectData reflectData = ReflectData.get();
Schema schm = reflectData.getSchema(A.class);
System.out.println(schm);
ReflectDatumWriter writer = new ReflectDatumWriter(schm);
ByteArrayOutputStream out = new ByteArrayOutputStream();
Encoder json = new BinaryEncoder(out);
writer.write(e, json); // only one calls
byte[] bs = out.toByteArray();
int len = bs.length; // length is 2, which is reasonable.
System.out.println("output size: " + len);
}
public void testSerializeJson() throws IOException {
A a = new A();
a.timestamp = 1234;
ReflectData reflectData = ReflectData.get();
Schema schm = reflectData.getSchema(A.class);
ReflectDatumWriter writer = new ReflectDatumWriter(schm);
ByteArrayOutputStream out = new ByteArrayOutputStream();
JsonEncoder json = new JsonEncoder(schm, out);
writer.write(e, json); /// only one call
// did not flush
byte[] bs = out.toByteArray();
int len = bs.length; // len == 0; this is unexpected!
System.out.println("output size: " + len);
// flushed this time.
json.flush();
bs = out.toByteArray();
len = bs.length; // len == 18; this is better!
System.out.println("output size: " + len);
}
One way to deal with it is to have either all Encoders have flush method (so
the DatumWriter can always flush it, and potentially add a flush method to
DatumWriter as well.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.