The web UI doesn't show any failures.
And it does show several reduce tasks with progress = 475.70%... any clues?
On Sun, Dec 27, 2009 at 12:20 PM, wrote:
> Task failed?
>
> 原始信息 ----
> 发自: Doopah Shaf
> 发送时间:
> 收件人: common-user@hadoop.apache.org
> 主题: hado
Running jobs sometimes displays progress as follows:
map 100% reduce 10%
and then
map 100% reduce 0%
What causes this? Is this speculative execution restarting tasks?
Cheers,
DS
like Cloudera.
>
> Hope this helps - you didn't give much background into your specific
> situation, so I can only answer in very general terms.
>
> Brian
>
> On Dec 22, 2009, at 10:24 AM, Doopah Shaf wrote:
>
> > Does anyone have any recommendations for / against using a NAS / SAN
> system
> > as the underlying physical storage for a hadoop cluster, instead of local
> > data node DAS?
>
>
Does anyone have any recommendations for / against using a NAS / SAN system
as the underlying physical storage for a hadoop cluster, instead of local
data node DAS?
Trying to figure out how hadoop actually achieves its speed. Assuming that
data locality is central to the efficiency of hadoop, how does the magic
actually happen, given that data still gets moved all over the network to
reach the reducers?
For example, if I have 1gb of logs spread across 10 data