date:20150526

[jira] [Commented] (TEZ-2475) Tez local mode hanging in big testsuite

2015-05-26 Thread Siddharth Seth (JIRA)


[ 
https://issues.apache.org/jira/browse/TEZ-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560251#comment-14560251
 ] 

Siddharth Seth commented on TEZ-2475:
-

Don't think it's related. The message shows up for pretty much all tasks - 
should investigate what it is, but I don't think it's causing the job to hang.

 Tez local mode hanging in big testsuite
 ---

 Key: TEZ-2475
 URL: https://issues.apache.org/jira/browse/TEZ-2475
 Project: Apache Tez
  Issue Type: Bug
Affects Versions: 0.7.0, 0.6.1
Reporter: André Kelpe
 Attachments: 2015-05-21_15-55-20_buildLog.log.gz


 we have a big test suite for lingual, our SQL layer for cascading. We are 
 trying very hard to make it work correctly on Tez, but I am stuck:
 The setup is a huge suite of SQL based tests (6000+), which are being 
 executed in order in local mode. At certain moments the whole process just 
 stops. Nothing gets executed any longer. This is not all the time, but quite 
 often. Note that it is not happening at the same line of code, more at 
 random, which makes it quite complex to debug.
 What I am seeing, is these kind of stacktraces in the middle of the run:
 2015-05-21 16:07:42,413 ERROR [TaskHeartbeatThread] task.TezTaskRunner 
 (TezTaskRunner.java:reportError(333)) - TaskReporter reported error
 java.lang.InterruptedException
 at 
 java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.reportInterruptAfterWait(AbstractQueuedSynchronizer.java:2017)
 at 
 java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2188)
 at 
 org.apache.tez.runtime.task.TaskReporter$HeartbeatCallable.call(TaskReporter.java:187)
 at 
 org.apache.tez.runtime.task.TaskReporter$HeartbeatCallable.call(TaskReporter.java:118)
 at java.util.concurrent.FutureTask.run(FutureTask.java:262)
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
 at java.lang.Thread.run(Thread.java:745)
 This looks like it could be related to the hang, but the hang is not 
 happening immediately afterwards, but some time later.
 I have gone through quite a few JIRAs and saw that there were problems with 
 locks and hanging threads before, which should be fixed, but it still happens.
 I have tried 0.6.1 and 0.7.0. Both show the same behaviour.
 This gist contains a thread dump of a hanging build: 
 https://gist.github.com/fs111/1ee44469bf5cc31e5a52



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Failed: TEZ-1883 PreCommit Build #744

2015-05-26 Thread Apache Jenkins Server

Jira: https://issues.apache.org/jira/browse/TEZ-1883
Build: https://builds.apache.org/job/PreCommit-TEZ-Build/744/

###
## LAST 60 LINES OF THE CONSOLE 
###
[...truncated 2545 lines...]



{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12735492/TEZ-1883.5.txt
  against master revision 9dabf94.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The following test timeouts occurred in :
 org.apache.tez.dag.app.dag.impl.TestVertexImpl

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/744//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/744//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/744//console

This message is automatically generated.


==
==
Adding comment to Jira.
==
==


Comment added.
848d5cf082251406a0dc1162af54cf959a4d58e2 logged out


==
==
Finished build.
==
==


Build step 'Execute shell' marked build as failure
Archiving artifacts
Sending artifact delta relative to PreCommit-TEZ-Build #737
Archived 47 artifacts
Archive block size is 32768
Received 22 blocks and 2167622 bytes
Compression is 25.0%
Took 1.1 sec
[description-setter] Could not determine description.
Recording test results
Email was triggered for: Failure
Sending email for trigger: Failure



###
## FAILED TESTS (if any) 
##
All tests passed

[jira] [Comment Edited] (TEZ-2490) TEZ-2450 breaks Hadoop 2.2 and 2.4 compatability

2015-05-26 Thread Rajesh Balamohan (JIRA)


[ 
https://issues.apache.org/jira/browse/TEZ-2490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560295#comment-14560295
 ] 

Rajesh Balamohan edited comment on TEZ-2490 at 5/27/15 2:43 AM:


[~sseth], [~hitesh], [~pramachandran] - Please review.  Tested with 2.2, 2.4, 
2.6. (related hadoop jira HADOOP-11243)


was (Author: rajesh.balamohan):
[~sseth], [~hitesh] - Please review.  Tested with 2.2, 2.4, 2.6. (related 
hadoop jira HADOOP-11243)

 TEZ-2450 breaks Hadoop 2.2 and 2.4 compatability
 

 Key: TEZ-2490
 URL: https://issues.apache.org/jira/browse/TEZ-2490
 Project: Apache Tez
  Issue Type: Bug
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: TEZ-2490.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (TEZ-1883) Change findbugs version to 3.x

2015-05-26 Thread Siddharth Seth (JIRA)


 [ 
https://issues.apache.org/jira/browse/TEZ-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Siddharth Seth updated TEZ-1883:

Attachment: TEZ-1883.4.txt

Added excludes for the DAGAM Inconsistent sync warnings. [~hitesh] - please 
review. 

 Change findbugs version to 3.x 
 ---

 Key: TEZ-1883
 URL: https://issues.apache.org/jira/browse/TEZ-1883
 Project: Apache Tez
  Issue Type: Bug
Reporter: Hitesh Shah
Assignee: Siddharth Seth
Priority: Minor
 Attachments: TEZ-1883.1.patch, TEZ-1883.2.txt, TEZ-1883.3.txt, 
 TEZ-1883.4.txt






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TEZ-1954) Multiple instances of Inconsistent synchronization in org.apache.tez.dag.app.DAGAppMaster.

2015-05-26 Thread Siddharth Seth (JIRA)


[ 
https://issues.apache.org/jira/browse/TEZ-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560154#comment-14560154
 ] 

Siddharth Seth commented on TEZ-1954:
-

Some more after findbugs3

CodeWarning
IS  Inconsistent synchronization of 
org.apache.tez.dag.app.DAGAppMaster.containers; locked 80% of time
IS  Inconsistent synchronization of 
org.apache.tez.dag.app.DAGAppMaster.currentRecoveryDataDir; locked 66% of time
IS  Inconsistent synchronization of 
org.apache.tez.dag.app.DAGAppMaster.execService; locked 75% of time
IS  Inconsistent synchronization of 
org.apache.tez.dag.app.DAGAppMaster.historyEventHandler; locked 91% of time
IS  Inconsistent synchronization of 
org.apache.tez.dag.app.DAGAppMaster.nodes; locked 80% of time
IS  Inconsistent synchronization of 
org.apache.tez.dag.app.DAGAppMaster.recoveryEnabled; locked 66% of time

 Multiple instances of Inconsistent synchronization in 
 org.apache.tez.dag.app.DAGAppMaster.
 --

 Key: TEZ-1954
 URL: https://issues.apache.org/jira/browse/TEZ-1954
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Hitesh Shah

 Inconsistent synchronization of org.apache.tez.dag.app.DAGAppMaster.amTokens; 
 locked 50% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.appMasterUgi; locked 66% of time
 Inconsistent synchronization of org.apache.tez.dag.app.DAGAppMaster.context; 
 locked 65% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.currentDAG; locked 72% of time
 Inconsistent synchronization of org.apache.tez.dag.app.DAGAppMaster.state; 
 locked 80% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.taskSchedulerEventHandler; locked 78% of 
 time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.versionMismatch; locked 83% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.versionMismatchDiagnostics; locked 80% of 
 time



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TEZ-1954) Multiple instances of Inconsistent synchronization in org.apache.tez.dag.app.DAGAppMaster.

2015-05-26 Thread Jeff Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/TEZ-1954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560185#comment-14560185
 ] 

Jeff Zhang commented on TEZ-1954:
-

I believe things will change after TEZ-1273. 

 Multiple instances of Inconsistent synchronization in 
 org.apache.tez.dag.app.DAGAppMaster.
 --

 Key: TEZ-1954
 URL: https://issues.apache.org/jira/browse/TEZ-1954
 Project: Apache Tez
  Issue Type: Sub-task
Reporter: Hitesh Shah

 Inconsistent synchronization of org.apache.tez.dag.app.DAGAppMaster.amTokens; 
 locked 50% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.appMasterUgi; locked 66% of time
 Inconsistent synchronization of org.apache.tez.dag.app.DAGAppMaster.context; 
 locked 65% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.currentDAG; locked 72% of time
 Inconsistent synchronization of org.apache.tez.dag.app.DAGAppMaster.state; 
 locked 80% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.taskSchedulerEventHandler; locked 78% of 
 time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.versionMismatch; locked 83% of time
 Inconsistent synchronization of 
 org.apache.tez.dag.app.DAGAppMaster.versionMismatchDiagnostics; locked 80% of 
 time



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TEZ-2304) InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery

2015-05-26 Thread Jeff Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/TEZ-2304?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560187#comment-14560187
 ] 

Jeff Zhang commented on TEZ-2304:
-

bq. Maybe createAttempt could be changed to use the last seen attempt id 
instead?
This should also solve this issue. But I think it would be better to recover 
the task attempt even if it has not started (log TaskAttemptFinishedEvent even 
if there's no TaskAttemptStartedEvent), otherwise we may get wrong 
killedTaskAttemptCount, although it is not critical. And I believe recovery 
should recover AM to the same state of last application attempt. 

 InvalidStateTransitonException TA_SCHEDULE at START_WAIT during recovery
 

 Key: TEZ-2304
 URL: https://issues.apache.org/jira/browse/TEZ-2304
 Project: Apache Tez
  Issue Type: Bug
Affects Versions: 0.6.0
Reporter: Jason Lowe
  Labels: Recovery
 Attachments: 168563_recovery.gz


 I saw a Tez AM throw a few InvalidStateTransitonException (sic) instances 
 during recovery complaining about TA_SCHEDULE arriving at the START_WAIT 
 state.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TEZ-1883) Change findbugs version to 3.x

2015-05-26 Thread TezQA (JIRA)


[ 
https://issues.apache.org/jira/browse/TEZ-1883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560274#comment-14560274
 ] 

TezQA commented on TEZ-1883:


{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment
  http://issues.apache.org/jira/secure/attachment/12735492/TEZ-1883.5.txt
  against master revision 9dabf94.

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 1 new 
or modified test files.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  There were no new javadoc warning messages.

{color:red}-1 findbugs{color}.  The patch appears to introduce 1 new 
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 core tests{color}.  The following test timeouts occurred in :
 org.apache.tez.dag.app.dag.impl.TestVertexImpl

Test results: https://builds.apache.org/job/PreCommit-TEZ-Build/744//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-TEZ-Build/744//artifact/patchprocess/newPatchFindbugsWarningstez-runtime-library.html
Console output: https://builds.apache.org/job/PreCommit-TEZ-Build/744//console

This message is automatically generated.

 Change findbugs version to 3.x 
 ---

 Key: TEZ-1883
 URL: https://issues.apache.org/jira/browse/TEZ-1883
 Project: Apache Tez
  Issue Type: Bug
Reporter: Hitesh Shah
Assignee: Siddharth Seth
Priority: Minor
 Attachments: TEZ-1883.1.patch, TEZ-1883.2.txt, TEZ-1883.3.txt, 
 TEZ-1883.4.txt, TEZ-1883.5.txt






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TEZ-2475) Tez local mode hanging in big testsuite

2015-05-26 Thread Siddharth Seth (JIRA)

[
https://issues.apache.org/jira/browse/TEZ-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560312#comment-14560312
]

Siddharth Seth commented on TEZ-2475:
-

My best guess here is a RuntimeException in the
LocalContainerLauncher-SubTaskRunner thread while creating a TezChild instance.
These exception aren't caught or logged anywhere. I'm assuming the trace and
the logs on this jira are unrelated.

That's the last message during TezChild creation.
{code}2015-05-26 13:10:23,128 WARN [LocalContainerLauncher-SubTaskRunner]
token.Token (Token.java:getClassForIdentifier(121)) - Cannot find class for
token kind tez.job{code}

After this, the LocalTaskExecutionThread doesn't show up at all - which leads
me to believe the failure happened during TezChild construction itself. The
previous container holding on to the thread (single thread pool) would have
generated log messages when the previous container would've tried fetching new
work.

A patch to at least log exceptions when the sub-task-runner is about to die
should be simple. That should help diagnose this further.

[~fs111] - is it possible to get instructions on how to reproduce this ? Also a
set of logs / stack trace when this happens next.

Tez local mode hanging in big testsuite
---

Key: TEZ-2475
URL: https://issues.apache.org/jira/browse/TEZ-2475
Project: Apache Tez
Issue Type: Bug
Affects Versions: 0.7.0, 0.6.1
Reporter: André Kelpe
Attachments: 2015-05-21_15-55-20_buildLog.log.gz

we have a big test suite for lingual, our SQL layer for cascading. We are
trying very hard to make it work correctly on Tez, but I am stuck:
The setup is a huge suite of SQL based tests (6000+), which are being
executed in order in local mode. At certain moments the whole process just
stops. Nothing gets executed any longer. This is not all the time, but quite
often. Note that it is not happening at the same line of code, more at
random, which makes it quite complex to debug.
What I am seeing, is these kind of stacktraces in the middle of the run:
2015-05-21 16:07:42,413 ERROR [TaskHeartbeatThread] task.TezTaskRunner
(TezTaskRunner.java:reportError(333)) - TaskReporter reported error
java.lang.InterruptedException
at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.reportInterruptAfterWait(AbstractQueuedSynchronizer.java:2017)
at
java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await(AbstractQueuedSynchronizer.java:2188)
at
org.apache.tez.runtime.task.TaskReporter$HeartbeatCallable.call(TaskReporter.java:187)
at
org.apache.tez.runtime.task.TaskReporter$HeartbeatCallable.call(TaskReporter.java:118)
at java.util.concurrent.FutureTask.run(FutureTask.java:262)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:745)
This looks like it could be related to the hang, but the hang is not
happening immediately afterwards, but some time later.
I have gone through quite a few JIRAs and saw that there were problems with
locks and hanging threads before, which should be fixed, but it still happens.
I have tried 0.6.1 and 0.7.0. Both show the same behaviour.
This gist contains a thread dump of a hanging build:
https://gist.github.com/fs111/1ee44469bf5cc31e5a52

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (TEZ-2467) document tez-history-parser usage

2015-05-26 Thread Rajesh Balamohan (JIRA)


 [ 
https://issues.apache.org/jira/browse/TEZ-2467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rajesh Balamohan updated TEZ-2467:
--
Target Version/s: 0.8.0

 document tez-history-parser usage
 -

 Key: TEZ-2467
 URL: https://issues.apache.org/jira/browse/TEZ-2467
 Project: Apache Tez
  Issue Type: Improvement
Reporter: Rajesh Balamohan
Assignee: Rajesh Balamohan
 Attachments: TEZ-2467.1.patch






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TEZ-2488) Tez AM crashes if a submitted DAG is configured to use invalid resource sizes.

2015-05-26 Thread Jeff Zhang (JIRA)


[ 
https://issues.apache.org/jira/browse/TEZ-2488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14560243#comment-14560243
 ] 

Jeff Zhang commented on TEZ-2488:
-

[~hitesh] Here DAG specify the memory request which is beyond the limit of yarn 
scheduler's property RM_SCHEDULER_MAXIMUM_ALLOCATION_MB. This would cause 
SCHEDULING_SERVICE_ERROR which will cause the AM shutdown.

Ideally I think this should only cause the DAG failed but AM should be able to 
continue to server the next dag. But it is hard to identify whether the 
SCHEDULING_SERVICE_ERROR is caused by dag or other reasons, so I think shutdown 
AM is reasonable here. One thing we can do is adding the error in the 
diagnostics to prograpate it to client side. Any thoughts ?

 Tez AM crashes if a submitted DAG is configured to use invalid resource 
 sizes. 
 ---

 Key: TEZ-2488
 URL: https://issues.apache.org/jira/browse/TEZ-2488
 Project: Apache Tez
  Issue Type: Bug
Reporter: Hitesh Shah
Priority: Critical
 Attachments: applogs.txt


 2015-05-26 21:54:03,485 ERROR [AMRM Heartbeater thread] 
 impl.AMRMClientAsyncImpl: Exception on heartbeat
 org.apache.hadoop.yarn.exceptions.InvalidResourceRequestException: Invalid 
 resource request, requested memory  0, or requested memory  max configured, 
 requestedMemory=682, maxMemory=512
   at 
 org.apache.hadoop.yarn.server.resourcemanager.scheduler.SchedulerUtils.validateResourceRequest(SchedulerUtils.java:249)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.scheduler.SchedulerUtils.normalizeAndValidateRequest(SchedulerUtils.java:226)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.scheduler.SchedulerUtils.normalizeAndvalidateRequest(SchedulerUtils.java:234)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.RMServerUtils.normalizeAndValidateRequests(RMServerUtils.java:98)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ApplicationMasterService.allocate(ApplicationMasterService.java:505)
   at 
 org.apache.hadoop.yarn.api.impl.pb.service.ApplicationMasterProtocolPBServiceImpl.allocate(ApplicationMasterProtocolPBServiceImpl.java:60)
   at 
 org.apache.hadoop.yarn.proto.ApplicationMasterProtocol$ApplicationMasterProtocolService$2.callBlockingMethod(ApplicationMasterProtocol.java:99)
   at 
 org.apache.hadoop.ipc.ProtobufRpcEngine$Server$ProtoBufRpcInvoker.call(ProtobufRpcEngine.java:616)
   at org.apache.hadoop.ipc.RPC$Server.call(RPC.java:969)
   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2049)
   at org.apache.hadoop.ipc.Server$Handler$1.run(Server.java:2045)
   at java.security.AccessController.doPrivileged(Native Method)
   at javax.security.auth.Subject.doAs(Subject.java:422)
   at 
 org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1657)
   at org.apache.hadoop.ipc.Server$Handler.run(Server.java:2043)
   at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
   at 
 sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
   at 
 sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
   at java.lang.reflect.Constructor.newInstance(Constructor.java:422)
   at 
 org.apache.hadoop.yarn.ipc.RPCUtil.instantiateException(RPCUtil.java:53)
   at 
 org.apache.hadoop.yarn.ipc.RPCUtil.unwrapAndThrowException(RPCUtil.java:101)
   at 
 org.apache.hadoop.yarn.api.impl.pb.client.ApplicationMasterProtocolPBClientImpl.allocate(ApplicationMasterProtocolPBClientImpl.java:79)
   at sun.reflect.GeneratedMethodAccessor3.invoke(Unknown Source)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
 2015-05-26 21:54:03,495 INFO [Dispatcher thread: Central] app.DAGAppMaster: 
 Error in the TaskScheduler. Shutting down.
 org.apache.hadoop.yarn.exceptions.InvalidResourceRequestException: Invalid 
 resource request, requested memory  0, or requested memory  max configured, 
 requestedMemory=682, maxMemory=512
   at 
 org.apache.hadoop.yarn.server.resourcemanager.scheduler.SchedulerUtils.validateResourceRequest(SchedulerUtils.java:249)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.scheduler.SchedulerUtils.normalizeAndValidateRequest(SchedulerUtils.java:226)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.scheduler.SchedulerUtils.normalizeAndvalidateRequest(SchedulerUtils.java:234)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.RMServerUtils.normalizeAndValidateRequests(RMServerUtils.java:98)
   at 
 org.apache.hadoop.yarn.server.resourcemanager.ApplicationMasterService.allocate(ApplicationMasterService.java:505)
   at

68 matches

Mail list logo