[jira] [Created] (HDFS-15464) ViewFsOverloadScheme should work when -fs option pointing to remote cluster without mount links

2020-07-09 Thread Uma Maheswara Rao G (Jira)
Uma Maheswara Rao G created HDFS-15464:
--

 Summary: ViewFsOverloadScheme should work when -fs option pointing 
to remote cluster without mount links
 Key: HDFS-15464
 URL: https://issues.apache.org/jira/browse/HDFS-15464
 Project: Hadoop HDFS
  Issue Type: Sub-task
  Components: viewfsOverloadScheme
Affects Versions: 3.2.1
Reporter: Uma Maheswara Rao G
Assignee: Uma Maheswara Rao G


When users try to connect to remote cluster from the cluster env where you 
enabled ViewFSOverloadScheme, it expects to have at least one mount link make 
fs init success. 
Unfortunately you might not have configured any mount links with that remote 
cluster in your current env. You would have configured only with your local 
clusters mount points.
In this case fs init will fail with no mount points configured the mount table 
if that remote cluster uri's authority.

One idea is that, when there are no mount links configured, we should just 
consider that as default cluster, that can be achieved by considering it as 
fallback option automatically.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org



[jira] [Created] (HDFS-15463) Add a tool to validate FsImage

2020-07-09 Thread Tsz-wo Sze (Jira)
Tsz-wo Sze created HDFS-15463:
-

 Summary: Add a tool to validate FsImage
 Key: HDFS-15463
 URL: https://issues.apache.org/jira/browse/HDFS-15463
 Project: Hadoop HDFS
  Issue Type: New Feature
  Components: namenode
Reporter: Tsz-wo Sze
Assignee: Tsz-wo Sze






--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org



Apache Hadoop qbt Report: trunk+JDK8 on Linux/x86_64

2020-07-09 Thread Apache Jenkins Server
For more details, see 
https://ci-hadoop.apache.org/job/hadoop-qbt-trunk-java8-linux-x86_64/198/

[Jul 8, 2020 7:13:20 AM] (pjoseph) YARN-8047. RMWebApp make external class 
pluggable.
[Jul 8, 2020 3:03:15 PM] (noreply) HADOOP-17117 Fix typos in hadoop-aws 
documentation (#2127)


[Error replacing 'FILE' - Workspace is not accessible]

-
To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org

Re: [VOTE] Release Apache Hadoop 3.1.4 (RC2)

2020-07-09 Thread Masatake Iwasaki

Hi Gabor Bota,

I committed the fix of YARN-10347 to branch-3.1.
I think this should be blocker for 3.1.4.
Could you cherry-pick it to branch-3.1.4 and cut a new RC?

Thanks,
Masatake Iwasaki

On 2020/07/08 23:31, Masatake Iwasaki wrote:

Thanks Steve and Prabhu for the information.

The cause turned out to be locking in CapacityScheduler#reinitialize.
I think the method is called after transitioning to active stat if 
RM-HA is enabled.


I filed YARN-10347 and created PR.


Masatake Iwasaki


On 2020/07/08 16:33, Prabhu Joseph wrote:

Hi Masatake,

  The thread is waiting for a ReadLock, we need to check what the 
other

thread holding WriteLock is blocked on.
Can you get three consecutive complete jstack of ResourceManager 
during the

issue.


I got no issue if RM-HA is disabled.

Looks RM is not able to access Zookeeper State Store. Can you check if
there is any connectivity issue between RM and Zookeeper.

Thanks,
Prabhu Joseph


On Mon, Jul 6, 2020 at 2:44 AM Masatake Iwasaki 


wrote:


Thanks for putting this up, Gabor Bota.

I'm testing the RC2 on 3 node docker cluster with NN-HA and RM-HA 
enabled.
ResourceManager reproducibly blocks on submitApplication while 
launching

example MR jobs.
Does anyone run into the same issue?

The same configuration worked for 3.1.3.
I got no issue if RM-HA is disabled.


"IPC Server handler 1 on default port 8032" #167 daemon prio=5 
os_prio=0
tid=0x7fe91821ec50 nid=0x3b9 waiting on condition 
[0x7fe901bac000]

 java.lang.Thread.State: WAITING (parking)
  at sun.misc.Unsafe.park(Native Method)
  - parking to wait for  <0x85d37a40> (a
java.util.concurrent.locks.ReentrantReadWriteLock$NonfairSync)
  at
java.util.concurrent.locks.LockSupport.park(LockSupport.java:175)
  at

java.util.concurrent.locks.AbstractQueuedSynchronizer.parkAndCheckInterrupt(AbstractQueuedSynchronizer.java:836) 


  at

java.util.concurrent.locks.AbstractQueuedSynchronizer.doAcquireShared(AbstractQueuedSynchronizer.java:967) 


  at

java.util.concurrent.locks.AbstractQueuedSynchronizer.acquireShared(AbstractQueuedSynchronizer.java:1283) 


  at

java.util.concurrent.locks.ReentrantReadWriteLock$ReadLock.lock(ReentrantReadWriteLock.java:727) 


  at

org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.checkAndGetApplicationPriority(CapacityScheduler.java:2521) 


  at

org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.createAndPopulateNewRMApp(RMAppManager.java:417) 


  at

org.apache.hadoop.yarn.server.resourcemanager.RMAppManager.submitApplication(RMAppManager.java:342) 


  at

org.apache.hadoop.yarn.server.resourcemanager.ClientRMService.submitApplication(ClientRMService.java:678) 


  at

org.apache.hadoop.yarn.api.impl.pb.service.ApplicationClientProtocolPBServiceImpl.submitApplication(ApplicationClientProtocolPBServiceImpl.java:277) 


  at

org.apache.hadoop.yarn.proto.ApplicationClientProtocol$ApplicationClientProtocolService$2.callBlockingMethod(ApplicationClientProtocol.java:563) 


  at

org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:527) 


  at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:1036)
  at org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:1015)
  at org.apache.hadoop.ipc.Server$RpcCall.run(Server.java:943)
  at java.security.AccessController.doPrivileged(Native Method)
  at javax.security.auth.Subject.doAs(Subject.java:422)
  at

org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1729) 


  at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2943)


Masatake Iwasaki

On 2020/06/26 22:51, Gabor Bota wrote:

Hi folks,

I have put together a release candidate (RC2) for Hadoop 3.1.4.

The RC is available at:

http://people.apache.org/~gabota/hadoop-3.1.4-RC2/

The RC tag in git is here:
https://github.com/apache/hadoop/releases/tag/release-3.1.4-RC2
The maven artifacts are staged at
https://repository.apache.org/content/repositories/orgapachehadoop-1269/ 



You can find my public key at:
https://dist.apache.org/repos/dist/release/hadoop/common/KEYS
and http://keys.gnupg.net/pks/lookup?op=get=0xB86249D83539B38C

Please try the release and vote. The vote will run for 5 weekdays,
until July 6. 2020. 23:00 CET.

The release includes the revert of HDFS-14941, as it caused
HDFS-15421. IBR leak causes standby NN to be stuck in safe mode.
(https://issues.apache.org/jira/browse/HDFS-15421)
The release includes HDFS-15323, as requested.
(https://issues.apache.org/jira/browse/HDFS-15323)

Thanks,
Gabor

-
To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-dev-h...@hadoop.apache.org