[
https://issues.apache.org/jira/browse/HBASE-25566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michael Stack resolved HBASE-25566.
-----------------------------------
Fix Version/s: 2.4.2
2.3.5
3.0.0-alpha-1
Hadoop Flags: Reviewed
Release Note: Adds RoundRobinTableInputFormat, a subclass of
TableInputFormat, that takes the TIF#getSplits list and resorts it so as to
spread the InputFormats as broadly about the cluster as possible. RRTIF works
to frustrate bunching of InputSplits on RegionServers to avoid the scenario
where a few RegionServers are working hard fielding many InputSplits while
others idle hosting a few or none.
Resolution: Fixed
> RoundRobinTableInputFormat
> --------------------------
>
> Key: HBASE-25566
> URL: https://issues.apache.org/jira/browse/HBASE-25566
> Project: HBase
> Issue Type: Improvement
> Components: mapreduce
> Reporter: Michael Stack
> Assignee: Sudhir Reddy
> Priority: Major
> Fix For: 3.0.0-alpha-1, 2.3.5, 2.4.2
>
>
> [~huaxiang] figured we needed a round robin on the result returned out
> TableInputFormat to better distribute task on a production cluster at our
> place-of-employ.
> This seems to be an old problem. See this helpful, old Flipkart blog:
> https://tech.flipkart.com/is-data-locality-always-out-of-the-box-in-hadoop-not-really-2ae9c95163cb
--
This message was sent by Atlassian Jira
(v8.3.4#803005)