Re: Codereview request for 7183053: Optimize DoubleByte charset for String.getBytes()/new String(byte[])

Alan Bateman Fri, 13 Jul 2012 05:20:10 -0700

On 11/07/2012 00:11, Xueming Shen wrote:

Hi,
In JDK7, the decoder and encoder implementation of most of oursingle-byte charsetsand UTF-8 charset are optimized to implement the internal interfcesun.nio.cs.ArrayDecoder/Encoder to provide a fastpath for String.getBytes(...) and newString(byte[]...) operations. I
have an old blog regarding this optimization at

https://blogs.oracle.com/xuemingshen/entry/faster_new_string_bytes_cs
This rfe, as the followup for above changes, is to implementArrayDe/Encoder for mostof the sun.nio.cs.ext.DoubleByte based double-byte charsets. Here isthe webrev
http://cr.openjdk.java.net/~sherman/7183053/webrev

I've taken a pass over this and it's great to seeDoubleByte.Decoder/Encoder implementing sun.nio.cs.ArrayDecoder/Encoder.The results looks good too, a small number of regressions (Big5 atlen=32 for example) but this is a micro benchmark and I'm sure there arefluctuations. I don't see anything obviously wrong with the EBCDICchanges I'd need a history book to remember how the shifts between DBCSand SBCS. I think our tests our good for this area so I'm happy. Oneminor nit is the continue in both encode methods, I think it would becleaner to use "else if (bb ..." instead.

I see in TestStringCoding.java that you've commented out the test thatgoes over the buffer limit - would I be correct to say that this isn'tan issue and this happens with DB charsets today?

Ulf - you've got several patches to the double byte charsets and Iwonder if you have cycles to try Sherman's patch with jdk8 to see ifthere is any more to be gained?


-Alan.

Re: Codereview request for 7183053: Optimize DoubleByte charset for String.getBytes()/new String(byte[])

Reply via email to