On Wed, Jul 30, 2008 at 11:24 PM, Naama Kraus <[EMAIL PROTECTED]> wrote: > Hi, > > I am a bit confused of how the framework determines the number of mappers of > a job and the number of input splits. > Could anyone summarize ?
Take a look at http://wiki.apache.org/hadoop/HowManyMapsAndReduces Things start to become a little more clear when you think about Hadoop-size datasets. It's common that you usually care about tuning the number of simultaneous jobs running on a single machine (one per core? one per hard drive? one per <whatever>?), and the total number is just "many." -- James Moore | [EMAIL PROTECTED] Ruby and Ruby on Rails consulting blog.restphone.com