Zhenxiao Luo created PARQUET-74:
-----------------------------------

             Summary: Use thread local decoder cache in Binary 
toStringUsingUTF8()
                 Key: PARQUET-74
                 URL: https://issues.apache.org/jira/browse/PARQUET-74
             Project: Parquet
          Issue Type: Improvement
          Components: parquet-mr
            Reporter: Zhenxiao Luo


In Binary toStringUsingUTF8(), new String(...) is slower because it 
instantiates a new Decoder, while Charset#decode uses a thread-local decoder 
cache, it is much faster.

ByteArraySliceBackedBinary is using Charset#decode, while, 
ByteArrayBackedBinary and ByteBufferBackedBinary are still using new String()



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to