[
https://issues.apache.org/jira/browse/THRIFT-765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bryan Duxbury updated THRIFT-765:
---------------------------------
Attachment: thrift-765-redux-v2.patch
OK, after looking over Lucene's implementation, I've managed to make an
approach that reclaims the performance benefits of the original broken version.
Protocols that use this could probably still benefit a little bit by reusing
the byte and char buffers used during encoding and decoding, but I haven't
thought all the way through the memory implications, so I'm leaving that out
for now.
If no one has any objections, I will commit this patch shortly.
> Improved string encoding and decoding performance
> -------------------------------------------------
>
> Key: THRIFT-765
> URL: https://issues.apache.org/jira/browse/THRIFT-765
> Project: Thrift
> Issue Type: Improvement
> Components: Library (Java)
> Affects Versions: 0.2
> Reporter: Bryan Duxbury
> Assignee: Bryan Duxbury
> Fix For: 0.3
>
> Attachments: thrift-765-redux-v2.patch, thrift-765-redux.patch,
> thrift-765.patch
>
>
> One of the most consistent time-consuming spots of Thrift serialization and
> deserialization is string encoding. For some inscrutable reason,
> String.getBytes("UTF-8") is slow.
> However, it's recently been brought to my attention that DataOutputStream's
> writeUTF method has a faster implementation of UTF-8 encoding and decoding.
> We should use this style of encoding.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.