[ 
https://issues.apache.org/jira/browse/HIVE-12995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15197014#comment-15197014
 ] 

Lefty Leverenz commented on HIVE-12995:
---------------------------------------

Doc note:  This adds configuration parameter 
*hive.orc.splits.allow.synthetic.fileid* to HiveConf.java, so it will need to 
be documented in the ORC section of Configuration Properties for release 2.1.0.

* [Configuration Properties -- ORC File Format | 
https://cwiki.apache.org/confluence/display/Hive/Configuration+Properties#ConfigurationProperties-ORCFileFormat]

Should it also be mentioned in the llap documentation?  (The parameter 
description doesn't say anything about llap.)

* [LLAP design document | 
https://issues.apache.org/jira/secure/attachment/12665704/LLAPdesigndocument.pdf]
 attached to HIVE-7926

> LLAP: Synthetic file ids need collision checks
> ----------------------------------------------
>
>                 Key: HIVE-12995
>                 URL: https://issues.apache.org/jira/browse/HIVE-12995
>             Project: Hive
>          Issue Type: Bug
>          Components: llap
>    Affects Versions: 2.1.0
>            Reporter: Gopal V
>            Assignee: Sergey Shelukhin
>              Labels: TODOC2.1
>             Fix For: 2.1.0
>
>         Attachments: HIVE-12995.01.patch, HIVE-12995.02.patch, 
> HIVE-12995.03.patch, HIVE-12995.04.patch, HIVE-12995.patch
>
>
> LLAP synthetic file ids do not have any way of checking whether a collision 
> occurs other than a data-error.
> Synthetic file-ids have only been used with unit tests so far - but they will 
> be needed to add cache mechanisms to non-HDFS filesystems.
> In case of Synthetic file-ids, it is recommended that we track the full-tuple 
> (path, mtime, len) in the cache so that a cache-hit for the synthetic file-id 
> can be compared against the parameters & only accepted if those match.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to