Hadoop 2.4.0 to Hadoop 2.7.1

2016-04-28 Thread kumar, Senthil(AWF)
Hi Everyone , We are Planning to upgrade our Clusters to Hadoop 2.7.1 Stable Release .. Can someone help me to get List of Deprecated API's or Configurations ? Hopefully the Jobs which are running against 2.4.0 should run in 2.7.1 smoothly.. --Senthil

Re: NameNode Crashing with "flush failed for required journal" exception

2016-04-28 Thread Shaik M
Thank you for your suggestions. I found in logs "WARN security.Groups (Groups.java:fetchGroupList(244)) - Potential performance problem: getGroups(user=hdfs) took 15915 milliseconds. First I'll deploy "nscd" service on all three journal nodes and will update you accordingly. Thanks, Shaik On 2

Yarn container use huge virtual memory with JVM option -XX:+PrintGCDetails added.

2016-04-28 Thread JR Wang
Hi All, I’m currently running Hadoop 2.7.2 on my three node cluster, all node equiped with 32-core CPU and 64 GB memory, Ubuntu 14.04.3 LTS. Yarn’s configuration kept the same with default one, except yarn.nodemanager.resource.memory-mb set as 16384(16G). Everything worked fine, but when I try

[HDFS-inotify] "IOException: The client is stopped" after reading file

2016-04-28 Thread Cazen Lee
Good day this is Cazen Could I kindly ask about something weird situation when reading file in hdfs with inotify polling - Env : MacOS, EMR, Linux(standalone) - same problem - Version : Hadoop 2.7.2 1. I would like to write down a code that read file under particular location when it created(with

YARN queues become unusable and jobs are stuck in ACCEPTED state

2016-04-28 Thread Matt Cheah
Hi, I¹ve been sporadically seeing an issue when using Hadoop YARN. I¹m using Hadoop 2.5.0, CDH5.3.3. When I¹ve configured the stack to use the fair scheduler protocol, after some period of time of the cluster being alive and running jobs, I¹m noticing that when I submit a job, the job will be

Re: Multiples dfs.cluster.administrators

2016-04-28 Thread Chris Nauroth
Hello Tallenl, This property supports multiple users and groups. The documentation for this property describes the syntax, but that documentation is not yet live on hadoop.apache.org. Here is a copy of that documentation content from trunk: dfs.cluster.administrators ACL for the ad

Re: NameNode Crashing with "flush failed for required journal" exception

2016-04-28 Thread Chris Nauroth
A problem I've seen a few times is that slow lookups of the hdfs user's groups at the JournalNode introduce delays in handling the edit logging RPC, which then times out at the NameNode side, ultimately causing an abort and an HA failover. If your environment is experiencing this, then you'll see

Multiples dfs.cluster.administrators

2016-04-28 Thread Tallanel Siriel
Hello ! I send you this mail because I need to set multiple users with the same powers than hdfs in my hadoop cluster. Do you know if it is possible with the property dfs.cluster.administrators ? Do we put user or group in this property ? If I can put multiple items in this property, do you know

Re: NameNode Crashing with "flush failed for required journal" exception

2016-04-28 Thread Gagan Brahmi
Hi Shaik, The error basically indicates that namenode crashed waiting for the write and sync to happen on the quorum of JournalNodes. In your case atleast 2 journal nodes should complete the write and sync without the timeout period of 20 seconds which does not seems to be the case. I will advice

Re: HDFS Federation

2016-04-28 Thread Kun Ren
Thanks a lot, Akira. When the Federation is disabled, I can view the namenode from http://localhost:50070 Do you know where is the 50070 come form or where to set it? If I configure the Federation in one node, I probably will change this, right? Thanks a lot. On Thu, Apr 28, 2016 at 1:55 AM, Ak

S3 Hadoop FileSystems

2016-04-28 Thread Elliot West
Hello, I'm working on a project that moves data from HDFS file systems into S3 for analysis with Hive on EMR. Recently I've become quite confused with the state of play regarding the different FileSystems: s3, s3n, and s3a. For my use case I require the following: - Support for the transfer of

NameNode Crashing with "flush failed for required journal" exception

2016-04-28 Thread Shaik M
Hi All, I am running 8 node HDP 2.3 Hadoop Cluster (3 Master+5 DataNodes) with Kerberos security. NameNode having HA and it is crashing at least once in a day with "flush failed for required journal " exception. don't have any network issues between the nodes. I have tried to find the causing t