[jira] Commented: (THRIFT-1035) Container types containing binary data are parameterized with ByteBuffer in the generated Java code

Bryan Duxbury (JIRA) Tue, 11 Jan 2011 14:28:18 -0800

    [ 
https://issues.apache.org/jira/browse/THRIFT-1035?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12980383#action_12980383
 ]


Bryan Duxbury commented on THRIFT-1035:
---------------------------------------

bq. ByteBuffer are not Thread Safe and thus would impose extraneous precautions 
on their use.

Neither are byte[], in the sense that you can easily clobber the contents being 
used by other threads if you're not being careful. Besides, in 99.9% of cases, 
the ByteBuffer is going to be wrapped around a byte[], so you can use it just 
as thread-safely.

bq. I agree that my fix introduces an inconsistency internally, but on the user 
facing side we've worked after THRIFT-830 so binary fields could be 
set/retrieved using byte[], right now collections of binary fields cannot use 
byte[], so we suffer from a lack of external consistency which, in my point of 
view, is more important to have than internal consistency.

I'm actually not talking about internal consistency. I'm talking about 
consistently giving users a way to access the ByteBuffer version if that's what 
they want. Overall, if we were creating Thrift Java from scratch today, 
*everything* would be using ByteBuffer. We're only supporting the byte[] getter 
for backwards compatibility. From our discussion, I think that the cost of 
giving a backwards compatible interface to collections is very high, and that's 
why I'm counseling that we don't do it.

bq. On the performance side, could you pinpoint cases where using byte[] in 
collections would be less efficient that the ByteBuffers, since the retrieved 
arrays will be those backing the ByteBuffers when deserializing and the 
ByteBuffer passed to the serialization code are simply wrapping the byte[] in 
the collections.

If you're putting a collection *into* a Thrift struct, then you're correct - 
there is no positive performance impact of using ByteBuffers over byte[] 
(unless you already have ByteBuffers from NIO operations...). However, on the 
read side, having to read into a byte[] when you could just tag the underlying 
buffer with a ByteBuffer is a performance loss. That's what I'm referring to. 
If we make all collections use byte[] instead of ByteBuffer, you never get the 
ByteBuffer read benefits.

> Container types containing binary data are parameterized with ByteBuffer in 
> the generated Java code
> ---------------------------------------------------------------------------------------------------
>
>                 Key: THRIFT-1035
>                 URL: https://issues.apache.org/jira/browse/THRIFT-1035
>             Project: Thrift
>          Issue Type: Bug
>          Components: Java - Compiler, Java - Library
>    Affects Versions: 0.4, 0.5, 0.6, 0.7
>         Environment: All
>            Reporter: Mathias Herberts
>         Attachments: THRIFT-1035-2.patch, THRIFT-1035.patch
>
>
> Since THRIFT-830, binary fields are internally handled using ByteBuffer.
> Release 0.4.0 was the first to expose the ByteBuffer to the outside world 
> (replacing previous methods returning/accepting byte[]).
> THRIFT-882 lead to the methods accepting/returning byte[] being available 
> again, as it was deemed more reasonable not to expose the ByteBuffer too much 
> as their use could be cumbersome. This lead to 0.5.0 being backward 
> compatible with 0.3.0 on the binary fields front.
> During that time, nobody noticed that container types that contained binary 
> data had their generated Java code changed to collections parameterized with 
> ByteBuffer instead of byte[].
> list<binary> -> List<ByteBuffer>
> set<binary> -> Set<ByteBuffer>
> map<binary,...> -> Map<ByteBuffer,...>
> map<...,binary> -> Map<...,ByteBuffer>
> This introduces confusion in the API and still exposes ByteBuffer when 
> discussion on THRIFT-882 concluded this should be avoided.
> We need to provide a way to offer the original parameterization with byte[] 
> as this will simplify working with that type of collection and thus will 
> increase the odds of Thrift's adoption.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (THRIFT-1035) Container types containing binary data are parameterized with ByteBuffer in the generated Java code

Reply via email to