[ https://issues.apache.org/jira/browse/GIRAPH-552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Dionysios Logothetis resolved GIRAPH-552. ----------------------------------------- Resolution: Abandoned > HBaseVertexInputFormat is ignoring region locality on input superstep > --------------------------------------------------------------------- > > Key: GIRAPH-552 > URL: https://issues.apache.org/jira/browse/GIRAPH-552 > Project: Giraph > Issue Type: Bug > Components: graph > Affects Versions: 1.0.0 > Reporter: Brian Femiano > Priority: Major > > During the input superstep, you can see the data for different regions being > needlessly transferred across the network, instead of giving preference to > machine-local regions if available. > On modest to large size graphs (5mil V 10mil E) we've noticed this causing > resource contention, Zookeeper timeouts, and other issues that often freeze > the input superstep until manually killed on the task tracker hosts. > This doesn't happen for TextVertexInputFormat subclasses. Perhaps it has to > do with each instance of the HBaseVertexInputFormat subclass delegating to a > private TableInputFormat instance. -- This message was sent by Atlassian Jira (v8.3.4#803005)