[
https://issues.apache.org/jira/browse/AVRO-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13739076#comment-13739076
]
Scott Carey commented on AVRO-1348:
-----------------------------------
About a year ago I experimented with all sorts of UTF8 to string optimizations,
using state machines and other techniques in addition to those similar to this
patch and only ever got minor (5%) improvements. It was hard to beat 'new
String(bytes, 0, length, UTF8)' safely. A fully custom state machine utf8
decoder was almost 10% faster.
> Improve Utf8 to String conversion
> ---------------------------------
>
> Key: AVRO-1348
> URL: https://issues.apache.org/jira/browse/AVRO-1348
> Project: Avro
> Issue Type: Bug
> Reporter: Mark Wagner
> Assignee: Mohammad Kamrul Islam
> Attachments: AVRO1348v1.patch
>
>
> AVRO-1241 found that the existing method of creating Strings from Utf8 byte
> arrays could be made faster. The same method is being used in the
> Utf8.toString(), and could likely be sped up by doing the same thing.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira