NO DataNode Daemon on a single node Hadoop 2.2 installation

2013-12-14 Thread Karim Awara
Hi, After I run start-dfs.sh, I dont get a datanode daemon. And in the log file it generates this exception: java.io.IOException: Failed on local exception: java.io.EOFException; Host Details : local host is: DELL1/127.0.1.1; destination host is: localhost:9000; This is my /etc/hosts file:

Re: Hadoop setup

2013-12-14 Thread Adam Kawa
In general, it is very open question and there are many possibilities depending on your workload (e.g. CPU-bound, IO-bound etc). If it is your first Hadoop cluster, and you do not know too much about what types of jobs you will be running, I would recommend just to collect any available machines

Re: hadoop fs -text OutOfMemoryError

2013-12-14 Thread Adam Kawa
Since snappy is non-splittable file (so that to decompress snappy file, you need to read it from the beginning to the end), does the *append* operation handle it well on a plain text file? I guess, that it might be problematic. Snappy is recommended to use with a container format, like Sequence

Hadoop setup doubts

2013-12-14 Thread Indranil Majumder (imajumde)
I stared with Hadoop few days ago, I do have few doubts on the setup, 1. For name node I do format the name directory, is it recommended to do the same for the data node directories too. 2. How does log aggregation work? 3. Does resource manager run on every node (both Name

Re: Hadoop setup

2013-12-14 Thread kishore alajangi
what makes difference in H/W selection, when we choosed yarn to install, and is necessary ? On 12/14/13, Adam Kawa kawa.a...@gmail.com wrote: In general, it is very open question and there are many possibilities depending on your workload (e.g. CPU-bound, IO-bound etc). If it is your first

Hadoop Jobtracker job and UI hangs - Deadlock detection

2013-12-14 Thread Viswanathan J
Hi, JT memory reaches 6.68/8.89 GB and not able to submit the job and UI is not loading at all. But didn't see any JT OOM exceptions. Have taken the thread dump of Jobtracker, and the JT thread dump as follows: Deadlock Detection: Can't print deadlocks:null Thread 25817: (state = BLOCKED)

Re: MiniDFSCluster setup

2013-12-14 Thread Ted Yu
If you search under hadoop-hdfs-project/hadoop-hdfs/src/test, you would see a lot of tests which use MiniDFSCluster e.g. cluster = new MiniDFSCluster.Builder(conf).numDataNodes(3).build(); hadoop-hdfs-project/hadoop-hdfs/src/test//java/org/apache/hadoop/hdfs/TestWriteRead.java Cheers On

Re: Hadoop setup

2013-12-14 Thread Adam Kawa
Again, hardware may depend what types of frameworks and applications, you aim to run on the YARN cluster. If you mostly run MapReduce jobs, then there should not be any significant difference. In our case, we simply migrated our cluster from MRv1 to YARN on the same hardware. Installing YARN is

manipulate DFSInputStream in FSDataInputStream?

2013-12-14 Thread Nan Zhu
Hi, all I’m modifying FSDataInputStream for some project, I would like to directly manipulate “in object in my implementation as in the constructor a DFSInputStream is passed, so I convert “in” from InputStream to DFSInputStream with import org.apache.hadoop.hdfs.DFSClient;

Re: MiniDFSCluster setup

2013-12-14 Thread Karim Awara
Running the test from Maven through commandline works fine. But I am using eclipse. And it generates problem if I try to run the test as Junit, as if eclipse is not aware of any of the conf parameters or args. Can someone point to me a detailed source where it explains how to run Junit through

Re: manipulate DFSInputStream in FSDataInputStream?

2013-12-14 Thread Nan Zhu
Solved by declare an empty somemethod() in FSInputStream and override it in DFSInputStream -- Nan Zhu School of Computer Science, McGill University On Saturday, December 14, 2013 at 7:53 PM, Nan Zhu wrote: Hi, all I’m modifying FSDataInputStream for some project, I would

Re: MiniDFSCluster setup

2013-12-14 Thread Ted Yu
You can use the following command to generate .project files for Eclipse (at the root of your workspace): mvn clean package -DskipTests eclipse:eclipse When you import hadoop, call sub-projects would be imported. I was able to run TestWriteRead in Eclipse successfully. Cheers On Sat, Dec 14,

Re: NO DataNode Daemon on a single node Hadoop 2.2 installation

2013-12-14 Thread Chris Mawata
What do you have in your masters and slaves files? Chris On 12/14/2013 5:05 AM, Karim Awara wrote: Hi, After I run start-dfs.sh, I dont get a datanode daemon. And in the log file it generates this exception: java.io.IOException: Failed on local exception: java.io.EOFException; Host Details

Site-specific dfs.client.local.interfaces setting not respected for Yarn MR container

2013-12-14 Thread Jeff Stuckman
Hello, I have set up a two-node Hadoop cluster on Ubuntu 12.04 running streaming jobs with Hadoop 2.2.0. I am having problems with running tasks on a NM which is on a different host than the RM, and I believe that this is happening because the NM host's dfs.client.local.interfaces property