[jira] Updated: (THRIFT-765) Improved string encoding and decoding performance

Bryan Duxbury (JIRA) Fri, 30 Apr 2010 11:09:17 -0700

     [ 
https://issues.apache.org/jira/browse/THRIFT-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Bryan Duxbury updated THRIFT-765:
---------------------------------

    Attachment: thrift-765-redux-v2.patch

OK, after looking over Lucene's implementation, I've managed to make an 
approach that reclaims the performance benefits of the original broken version.

Protocols that use this could probably still benefit a little bit by reusing 
the byte and char buffers used during encoding and decoding, but I haven't 
thought all the way through the memory implications, so I'm leaving that out 
for now.

If no one has any objections, I will commit this patch shortly.

> Improved string encoding and decoding performance
> -------------------------------------------------
>
>                 Key: THRIFT-765
>                 URL: https://issues.apache.org/jira/browse/THRIFT-765
>             Project: Thrift
>          Issue Type: Improvement
>          Components: Library (Java)
>    Affects Versions: 0.2
>            Reporter: Bryan Duxbury
>            Assignee: Bryan Duxbury
>             Fix For: 0.3
>
>         Attachments: thrift-765-redux-v2.patch, thrift-765-redux.patch, 
> thrift-765.patch
>
>
> One of the most consistent time-consuming spots of Thrift serialization and 
> deserialization is string encoding. For some inscrutable reason, 
> String.getBytes("UTF-8") is slow. 
> However, it's recently been brought to my attention that DataOutputStream's 
> writeUTF method has a faster implementation of UTF-8 encoding and decoding. 
> We should use this style of encoding.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (THRIFT-765) Improved string encoding and decoding performance

Reply via email to