Hanifi Gunes created DRILL-2147:
-----------------------------------
Summary: Refactor ValueVector design
Key: DRILL-2147
URL: https://issues.apache.org/jira/browse/DRILL-2147
Project: Apache Drill
Issue Type: Bug
Reporter: Hanifi Gunes
Assignee: Hanifi Gunes
The overall design of value vectors has become unclear and inconsistent with
additions from multiple contributors over the time. Also we need proper
documentation for the abstractions made for consistently communicating with
developers.
There are many instances that indicate possible design issues.
For instance, ValueVector implements Iterator<ValueVector>. This seems to
assume all vectors are somewhat hierarchical. This does not truly capture
scalar vectors as they have no child.
Similarly, RepeatedVector has the following interface definition:
{code:title=RepeatedVector}
interface RepeatedVector {
RepeatedFixedWidthVector.RepeatedAccessor getAccessor()
}
{code}
Yet, RepeatedFixedWidthVector implements RepeatedVector as follows
{code:title=RepeatedFixedWidthVector}
interface RepeatedFixedWidthVector extends ValueVector, RepeatedVector {
interface RepeatedAccessor extends Accessor {...}
interface RepeatedMutator extends Mutator {...}
}
{code}
A super-type that is aware of its sub-type hints a need for re-design.
Examples could be multiplied here: some method names are not self-explaining or
wrongly named or seems to be misplaced. There are couple of more places where
design is not capturing the nature of vectors such like missing abstractions
for Repeated vs Composite vectors. We should consider a design refactoring.
This is an umbrella issue for tracking ValueVector design refactoring.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)