RE: Hadoop research

2008-02-25 Thread Daming Wang
27;s advantages compared with hadoop's architecture. -Original Message- From: Ahmad Humayun [mailto:[EMAIL PROTECTED] Sent: Tuesday, February 26, 2008 1:31 PM To: core-dev@hadoop.apache.org Subject: Re: Hadoop research Hello there everyone, Great discussion going on here :) I was

Re: Hadoop research

2008-02-25 Thread Ahmad Humayun
improvement for the whole > architecture, how to adopt decentralized strategy maybe a direction and help > you to publish papers. :) > > > > -Original Message- > From: Jaideep Dhok [mailto:[EMAIL PROTECTED] > Sent: Monday, February 25, 2008 8:53 PM > To: core-dev@hado

RE: Hadoop research

2008-02-25 Thread Daming Wang
To: core-dev@hadoop.apache.org Subject: Re: Hadoop research Hi, First of all thank you for your responses. "One interesting direction for research would be more sophisticated scheduling policies for the JobTracker to help improve locality and overall cluster utilization." This

Re: Hadoop research

2008-02-25 Thread Jaideep Dhok
Hi, First of all thank you for your responses. "One interesting direction for research would be more sophisticated scheduling policies for the JobTracker to help improve locality and overall cluster utilization." This is a very interesting area. In fact I was trying a simple Round Robin scheduler,

RE: Hadoop research

2008-02-24 Thread Daming Wang
Message- From: Jaideep Dhok [mailto:[EMAIL PROTECTED] Sent: Monday, February 25, 2008 1:42 AM To: core-dev@hadoop.apache.org Cc: [EMAIL PROTECTED] Subject: Hadoop research Hello, I am a graduate research student in CS at the Search and Information Extraction Lab, in IIIT Hyderabad, India

RE: Hadoop research

2008-02-24 Thread dhruba Borthakur
it helps, dhruba -Original Message- From: Jaideep Dhok [mailto:[EMAIL PROTECTED] Sent: Sunday, February 24, 2008 9:42 AM To: core-dev@hadoop.apache.org Cc: [EMAIL PROTECTED] Subject: Hadoop research Hello, I am a graduate research student in CS at the Search and Information Extraction

Re: Hadoop research

2008-02-24 Thread Chad Walters
+1 to Jeff's suggestions, especially on locality. I'd love to see some rigorous work done so that the scheduler could prefer distributing tasks to the nodes that are already hosting the appropriate data. Generalizing this further so that a full vertical integration of HDFS, Hbase, and Map/Reduce

Re: Hadoop research

2008-02-24 Thread Jeff Hammerbacher
Hey Jaideep, One interesting direction for research would be more sophisticated scheduling policies for the JobTracker to help improve locality and overall cluster utilization. The introduction of speculative execution is a step in this direction; you could perhaps investigate the implications of

Hadoop research

2008-02-24 Thread Jaideep Dhok
Hello, I am a graduate research student in CS at the Search and Information Extraction Lab, in IIIT Hyderabad, India (http://search.iiit.ac.in). I have been working on Nutch and Hadoop for the past couple of months, basically to get an understanding of the platform, and to discover possible resear