[DISCUSS] DrillBuf

Vlad Rozov Wed, 04 Apr 2018 10:34:56 -0700

I have several questions and concerns regarding DrillBuf usage, designand implementation. There is a limited documentation available for thesubject (Java doc,https://github.com/apache/drill/blob/master/exec/memory/base/src/main/java/org/apache/drill/exec/memory/README.mdand https://github.com/paul-rogers/drill/wiki/Memory-Management) and Ihope that a few members of the community may have more information.

What are the design goals behind DrillBuf? It seems like it is supposedto be Drill access gate for direct byte buffers. How is it different(for that goal) from UnsafeDirectLittleEndian? Both usewrapper/delegation pattern, with DrillBuf delegating toUnsafeDirectLittleEndian (not always) and UnsafeDirectLittleEndiandelegating to ByteBuf it wraps. Is it necessary to have both? Are thereany out of the box netty classes that already provide requiredfunctionality? I guess that answer to the last question was "no" backwhen DrillBuf and UnsafeDirectLittleEndian were introduced into Drill.Is it still "no" for the latest netty release? What extra functionalityDrillBuf (and UnsafeDirectLittleEndian) provides on top of existingnetty classes?

As far as I can see from the source code, DrillBuf changes validation(boundary and reference count checks) mechanism, making it optional(compared to always enabled boundary checks inside netty) for get/setByte/Char/Short/Long/Float/Double. Is this a proper place to makevalidation optional or the validation (or portion of the validation)must be always on or off (there are different opinions, seehttps://issues.apache.org/jira/browse/DRILL-6004,https://issues.apache.org/jira/browse/DRILL-6202,https://github.com/apache/drill/pull/1060 andhttps://github.com/apache/drill/pull/1144)? Are there any performancebenchmark that justify or explain such behavior (if such benchmark doesnot exist, are there any volunteer to do the benchmark)? My experienceis that the reference count check is significantly more expensivecompared to boundary checking and boundary checking adds tens of percentto direct memory read when reading just a few bytes, so my vote is tokeep validation as optional with the ability to enable it for debugpurposes at run-time. What is the reason the same approach do not applyto get/set Bytes and those methods are delegated toUnsafeDirectLittleEndian that delegates it further?

Why DrillBuf reverses how AbstractByteBuf calls _get from get (and _setfrom set), making _get to call get (_set to call set)? Why not to followa base class design patter?

Another question is usage of netty "io.netty.buffer" package for Drillclasses. Is this absolutely necessary? I don't think that nettydevelopers expect this and support semantic version compatibility forpackage private classes/members.


Thank you,

Vlad

[DISCUSS] DrillBuf

Reply via email to