[
https://issues.apache.org/jira/browse/HIVE-8808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14207183#comment-14207183
]
Josh Elser commented on HIVE-8808:
----------------------------------
Thanks for looping me in, [~sushanth].
As far as I can recall, Accumulo's InputFormat classes are stateless, relying
on the state to be provided through the JobConf/InputSplits as you described. I
know we have some "annoyances" where multiple calls to the InputFormat which
alter the JobConf are not idempotent (they typically throw an error if things
are re-set). I work around most of that pain in the StorageHandler impl.
Nothing is coming to mind that would be fundamentally broken if we get a
re-used instance of the input format. HTH test/evaluate this too.
> HiveInputFormat caching cannot work with all input formats
> ----------------------------------------------------------
>
> Key: HIVE-8808
> URL: https://issues.apache.org/jira/browse/HIVE-8808
> Project: Hive
> Issue Type: Bug
> Reporter: Brock Noland
>
> In {{HiveInputFormat}} we implement instance caching (see
> {{getInputFormatFromCache}}). In HS2, this assumes that InputFormats are
> stateless but I don't think this assumption is true, especially with regards
> to HBase.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)