Re: hadoop job progress going back

2009-12-27 Thread Doopah Shaf
The web UI doesn't show any failures. And it does show several reduce tasks with progress = 475.70%... any clues? On Sun, Dec 27, 2009 at 12:20 PM, wrote: > Task failed? > > 原始信息 ---- > 发自: Doopah Shaf > 发送时间: > 收件人: common-user@hadoop.apache.org > 主题: hado

hadoop job progress going back

2009-12-27 Thread Doopah Shaf
Running jobs sometimes displays progress as follows: map 100% reduce 10% and then map 100% reduce 0% What causes this? Is this speculative execution restarting tasks? Cheers, DS

Re: file system

2009-12-22 Thread Doopah Shaf
like Cloudera. > > Hope this helps - you didn't give much background into your specific > situation, so I can only answer in very general terms. > > Brian > > On Dec 22, 2009, at 10:24 AM, Doopah Shaf wrote: > > > Does anyone have any recommendations for / against using a NAS / SAN > system > > as the underlying physical storage for a hadoop cluster, instead of local > > data node DAS? > >

file system

2009-12-22 Thread Doopah Shaf
Does anyone have any recommendations for / against using a NAS / SAN system as the underlying physical storage for a hadoop cluster, instead of local data node DAS?

how does hadoop work?

2009-12-20 Thread Doopah Shaf
Trying to figure out how hadoop actually achieves its speed. Assuming that data locality is central to the efficiency of hadoop, how does the magic actually happen, given that data still gets moved all over the network to reach the reducers? For example, if I have 1gb of logs spread across 10 data