[ 
https://issues.apache.org/jira/browse/HBASE-2535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12866438#action_12866438
 ] 

John Sichi commented on HBASE-2535:
-----------------------------------

Tweaking TableInputFormatBase.getSplits to return the regionLocation with a 
trailing dot fixes this, but the real fix involves using the same calls that 
the tasktracker does for coming up with its hostname (since I'm guessing that 
the presence of the trailing dot there may be configuration-dependent).


> split hostname format should be consistent with tasktracker for locality
> ------------------------------------------------------------------------
>
>                 Key: HBASE-2535
>                 URL: https://issues.apache.org/jira/browse/HBASE-2535
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: mapred, mapreduce
>    Affects Versions: 0.20.4
>            Reporter: John Sichi
>
> I was running a mapreduce job (via Hive) against HBase, and noticed that I 
> wasn't getting any locality (the input split location and the task tracker 
> machine in the job tracker UI were always different, and "Rack-local map 
> tasks" in the job counters was 0).
> I tracked this down to a discrepancy in the way hostnames were being compared.
> The task tracker detail had a Host like
> /f/s/1.2.3.4/h.s.f.com.
> (with trailing dot)
> But the Input Split Location had
> /f/s/1.2.3.4/h.s.f.com
> (without trailing dot)

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to