[ 
https://issues.apache.org/jira/browse/DRILL-7096?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16895193#comment-16895193
 ] 

ASF GitHub Bot commented on DRILL-7096:
---------------------------------------

ihuzenko commented on pull request #1829: DRILL-7096: Develop vector for 
canonical Map<K,V>
URL: https://github.com/apache/drill/pull/1829#discussion_r308153779
 
 

 ##########
 File path: 
contrib/storage-hive/core/src/main/java/org/apache/drill/exec/store/hive/HiveUtilities.java
 ##########
 @@ -368,8 +368,13 @@ public static MajorType 
getMajorTypeFromHiveTypeInfo(final TypeInfo typeInfo, fi
         return typeBuilder.build();
       }
 
-      case LIST:
       case MAP:
+        return MajorType.newBuilder()
 
 Review comment:
   I think this code here is not necessary. This util method still here just 
for selection of partition columns vectors, and Hive doesn't partition data 
over complex types. I believe it would be good just to rename the method for 
clarity in the PR. Later I would like to get rid of the method and fill 
partition vectors method and use same approach (factory + Hive writers), like 
with regular columns.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


> Develop vector for canonical Map<K,V>
> -------------------------------------
>
>                 Key: DRILL-7096
>                 URL: https://issues.apache.org/jira/browse/DRILL-7096
>             Project: Apache Drill
>          Issue Type: Sub-task
>            Reporter: Igor Guzenko
>            Assignee: Bohdan Kazydub
>            Priority: Major
>             Fix For: 1.17.0
>
>
> Canonical Map<K,V> datatype can be represented using combination of three 
> value vectors:
> keysVector - vector for storing keys of each map
> valuesVector - vector for storing values of each map
> offsetsVector - vector for storing of start indexes of next each map
> So it's not very hard to create such Map vector, but there is a major issue 
> with such map representation. It's hard to search maps values by key in such 
> vector, need to investigate some advanced techniques to make such search 
> efficient. Or find other more suitable options to represent map datatype in 
> world of vectors.
> After question about maps, Apache Arrow developers responded that for Java 
> they don't have real Map vector, for now they just have logical Map type 
> definition where they define Map like: List< Struct<key:key_type, 
> value:value_type> >. So implementation of value vector would be useful for 
> Arrow too.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

Reply via email to