hadoop questions for a begginer

2017-09-22 Thread Demian Kurejwowski
hi, i am learning hadoop and currently doing python map reduce tutorial.  i am trying to understand the difference of having a map and reduce  files. i am assumingwhen we lunch the scripts.The mapper.py script goes to all the machines at the same time and all start printing at the same time, and

Re: Hadoop "managed" setup basic question (Ambari, CDH?)

2017-09-22 Thread Sanel Zukan
Hi, For this amount of nodes, I'd go with automation tools like Ansible[1]/Puppet[2]/Rex[3]. They can install necessary packages, setup /etc/hosts and make per-node settings. Ansibles has a nice playbook (https://github.com/analytically/hadoop-ansible) you can start with and Puppet isn't short

Inter-cluster Communication

2017-09-22 Thread Rishikesh Gawade
Hello. I have been working on Hadoop for a while. As of now I have implemented 2 small-scale clusters, each consisting of 3 nodes. I am looking for a way to achieve communication between these 2 clusters for coordination purposes. Is there any valuable resource/weblink for the same? If so, I would

NodeManager exit without spesific log messages.

2017-09-22 Thread Nur Kholis Majid
Hi, one of my NM nodes periodically exit with this error log https://paste.ee/p/hc104 Anyone have idea about this? Thank you. - To unsubscribe, e-mail: user-unsubscr...@hadoop.apache.org For additional commands, e-mail: