[ 
https://issues.apache.org/jira/browse/HIVE-18738?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16368092#comment-16368092
 ] 

Sergey Shelukhin commented on HIVE-18738:
-----------------------------------------

A WIP patch that clarifies the semantics of various include variables and fixes 
the double-ACID-ifying and other incorrect treatment of them in the IO elevator.

It already works if one makes changes to not exclude the ACID ROW column from 
includes and removes the added nested column magic from ORC genIncludedColumns; 
however it then reads all ACID struct nested columns, even though it might only 
need a subset.
Excluding the ROW column and including its subset via nested column magic makes 
it read only a subset of the ROW struct that it needs, but everything after the 
reader (e.g. decoder) is still set up incorrectly; after treereaderfactory is 
fixed, I suspect the next thing to break will be the ACID wrapper that blindly 
puts CVs from elevator into VRB that is passed to the thing the applies 
deletes. That might not work so well without always having the whole ROW struct 
(payload will be null).

Also all the debug logging needs to be undone later.


> LLAP IO ACID - includes handling is broken
> ------------------------------------------
>
>                 Key: HIVE-18738
>                 URL: https://issues.apache.org/jira/browse/HIVE-18738
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Deepesh Khandelwal
>            Assignee: Sergey Shelukhin
>            Priority: Major
>         Attachments: HIVE-18738.patch
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to