Zhihong Zhang created AVRO-1073:
-----------------------------------

             Summary: JSON binary encoding doesn't work
                 Key: AVRO-1073
                 URL: https://issues.apache.org/jira/browse/AVRO-1073
             Project: Avro
          Issue Type: Improvement
          Components: java
    Affects Versions: 1.6.3
         Environment: uname -a
Darwin zmac 10.8.0 Darwin Kernel Version 10.8.0: Tue Jun  7 16:33:36 PDT 2011; 
root:xnu-1504.15.3~1/RELEASE_I386 i386

java -version
java version "1.6.0_29"
Java(TM) SE Runtime Environment (build 1.6.0_29-b11-402-10M3527)
Java HotSpot(TM) 64-Bit Server VM (build 20.4-b02-402, mixed mode)


            Reporter: Zhihong Zhang


The JSON encoder doesn't encode binary data (bytes or ByteBuffer) properly. It 
treats the data as Latin-1 and encode it as text. But there is a bug in text 
encoding of non-Roman characters so the binary encoding doesn't work at all.

The binary should be encoded into ASCII (Base64 or 85) before it's encoded in 
JSON. The text encoding of binary data is very inefficient even it works. For 
example, the 0 is encoded as "\\u0000" in JSON (7 bytes).

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to