[ 
https://issues.apache.org/jira/browse/AVRO-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13739076#comment-13739076
 ] 

Scott Carey commented on AVRO-1348:
-----------------------------------

About a year ago I experimented with all sorts of UTF8 to string optimizations, 
using state machines and other techniques in addition to those similar to this 
patch and only ever got minor (5%) improvements.  It was hard to beat 'new 
String(bytes, 0, length, UTF8)' safely.  A fully custom state machine utf8 
decoder was almost 10% faster.  
                
> Improve Utf8 to String conversion
> ---------------------------------
>
>                 Key: AVRO-1348
>                 URL: https://issues.apache.org/jira/browse/AVRO-1348
>             Project: Avro
>          Issue Type: Bug
>            Reporter: Mark Wagner
>            Assignee: Mohammad Kamrul Islam
>         Attachments: AVRO1348v1.patch
>
>
> AVRO-1241 found that the existing method of creating Strings from Utf8 byte 
> arrays could be made faster. The same method is being used in the 
> Utf8.toString(), and could likely be sped up by doing the same thing.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to