[
https://issues.apache.org/jira/browse/TEZ-779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Bikas Saha updated TEZ-779:
---------------------------
Summary: Make Tez grouped splits logic available outside of InputFormat
(was: Allow grouping to work with pre-existing splits)
> Make Tez grouped splits logic available outside of InputFormat
> --------------------------------------------------------------
>
> Key: TEZ-779
> URL: https://issues.apache.org/jira/browse/TEZ-779
> Project: Apache Tez
> Issue Type: Improvement
> Reporter: Siddharth Seth
>
> Grouping currently fetches splits from the underlying file format.
> It'd be useful to allow grouping to accept a set of splits instead of always
> fetching them from the underlying format.
> One example of where this will be used : Bucketed Hive data - regular
> HiveInputFormat splits are generated, only splits belonging to the same
> bucket can be Grouped together.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)