Hi Jacky,

Could you search for "Application Master start command:" in the debug log
and post the result and a few lines before & after that? This is not
included in the clip of attached log file.

Thank you~

Xintong Song



On Tue, May 12, 2020 at 5:33 AM Jacky D <jacky.du0...@gmail.com> wrote:

> hi, Robert
>
> Thanks so much for quick reply  , I changed the log level to debug  and
> attach the log file .
>
> Thanks
> Jacky
>
> Robert Metzger <rmetz...@apache.org> 于2020年5月11日周一 下午4:14写道:
>
>> Thanks a lot for posting the full output.
>>
>> It seems that Flink is passing an invalid list of arguments to the JVM.
>> Can you
>> - set the root log level in conf/log4j-yarn-session.properties to DEBUG
>> - then launch the YARN session
>> - share the log file of the yarn session on the mailing list?
>>
>> I'm particularly interested in the line printed here, as it shows the JVM
>> invocation.
>>
>> https://github.com/apache/flink/blob/release-1.6/flink-yarn/src/main/java/org/apache/flink/yarn/AbstractYarnClusterDescriptor.java#L1630
>>
>>
>> On Mon, May 11, 2020 at 9:56 PM Jacky D <jacky.du0...@gmail.com> wrote:
>>
>>> Hi,Robert
>>>
>>> Yes , I tried to retrieve more log info from yarn UI , the full logs
>>> showing below , this happens when I try to create a flink yarn session on
>>> emr when set up jitwatch configuration .
>>>
>>> 2020-05-11 19:06:09,552 ERROR
>>> org.apache.flink.yarn.cli.FlinkYarnSessionCli                 - Error while
>>> running the Flink Yarn session.
>>> java.lang.reflect.UndeclaredThrowableException
>>> at
>>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1862)
>>> at
>>> org.apache.flink.runtime.security.HadoopSecurityContext.runSecured(HadoopSecurityContext.java:41)
>>> at
>>> org.apache.flink.yarn.cli.FlinkYarnSessionCli.main(FlinkYarnSessionCli.java:813)
>>> Caused by:
>>> org.apache.flink.client.deployment.ClusterDeploymentException: Couldn't
>>> deploy Yarn session cluster
>>> at
>>> org.apache.flink.yarn.AbstractYarnClusterDescriptor.deploySessionCluster(AbstractYarnClusterDescriptor.java:429)
>>> at
>>> org.apache.flink.yarn.cli.FlinkYarnSessionCli.run(FlinkYarnSessionCli.java:610)
>>> at
>>> org.apache.flink.yarn.cli.FlinkYarnSessionCli.lambda$main$2(FlinkYarnSessionCli.java:813)
>>> at java.security.AccessController.doPrivileged(Native Method)
>>> at javax.security.auth.Subject.doAs(Subject.java:422)
>>> at
>>> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1844)
>>> ... 2 more
>>> Caused by:
>>> org.apache.flink.yarn.AbstractYarnClusterDescriptor$YarnDeploymentException:
>>> The YARN application unexpectedly switched to state FAILED during
>>> deployment.
>>> Diagnostics from YARN: Application application_1584459865196_0165 failed
>>> 1 times (global limit =2; local limit is =1) due to AM Container for
>>> appattempt_1584459865196_0165_000001 exited with  exitCode: 1
>>> Failing this attempt.Diagnostics: Exception from container-launch.
>>> Container id: container_1584459865196_0165_01_000001
>>> Exit code: 1
>>> Exception message: Usage: java [-options] class [args...]
>>>            (to execute a class)
>>>    or  java [-options] -jar jarfile [args...]
>>>            (to execute a jar file)
>>> where options include:
>>>     -d32   use a 32-bit data model if available
>>>     -d64   use a 64-bit data model if available
>>>     -server   to select the "server" VM
>>>                   The default VM is server,
>>>                   because you are running on a server-class machine.
>>>
>>>
>>>     -cp <class search path of directories and zip/jar files>
>>>     -classpath <class search path of directories and zip/jar files>
>>>                   A : separated list of directories, JAR archives,
>>>                   and ZIP archives to search for class files.
>>>     -D<name>=<value>
>>>                   set a system property
>>>     -verbose:[class|gc|jni]
>>>                   enable verbose output
>>>     -version      print product version and exit
>>>     -version:<value>
>>>                   Warning: this feature is deprecated and will be removed
>>>                   in a future release.
>>>                   require the specified version to run
>>>     -showversion  print product version and continue
>>>     -jre-restrict-search | -no-jre-restrict-search
>>>                   Warning: this feature is deprecated and will be removed
>>>                   in a future release.
>>>                   include/exclude user private JREs in the version search
>>>     -? -help      print this help message
>>>     -X            print help on non-standard options
>>>     -ea[:<packagename>...|:<classname>]
>>>     -enableassertions[:<packagename>...|:<classname>]
>>>                   enable assertions with specified granularity
>>>     -da[:<packagename>...|:<classname>]
>>>     -disableassertions[:<packagename>...|:<classname>]
>>>                   disable assertions with specified granularity
>>>     -esa | -enablesystemassertions
>>>                   enable system assertions
>>>     -dsa | -disablesystemassertions
>>>                   disable system assertions
>>>     -agentlib:<libname>[=<options>]
>>>                   load native agent library <libname>, e.g.
>>> -agentlib:hprof
>>>                   see also, -agentlib:jdwp=help and -agentlib:hprof=help
>>>     -agentpath:<pathname>[=<options>]
>>>                   load native agent library by full pathname
>>>     -javaagent:<jarpath>[=<options>]
>>>                   load Java programming language agent, see
>>> java.lang.instrument
>>>     -splash:<imagepath>
>>>                   show splash screen with specified image
>>> See
>>> http://www.oracle.com/technetwork/java/javase/documentation/index.html
>>> for more details.
>>>
>>> Thanks
>>> Jacky
>>>
>>> Robert Metzger <rmetz...@apache.org> 于2020年5月11日周一 下午3:42写道:
>>>
>>>> Hey Jacky,
>>>>
>>>> The error says "The YARN application unexpectedly switched to state
>>>> FAILED during deployment.".
>>>> Have you tried retrieving the YARN application logs?
>>>> Does the YARN UI / resource manager logs reveal anything on the reason
>>>> for the deployment to fail?
>>>>
>>>> Best,
>>>> Robert
>>>>
>>>>
>>>> On Mon, May 11, 2020 at 9:34 PM Jacky D <jacky.du0...@gmail.com> wrote:
>>>>
>>>>>
>>>>>
>>>>> ---------- Forwarded message ---------
>>>>> 发件人: Jacky D <jacky.du0...@gmail.com>
>>>>> Date: 2020年5月11日周一 下午3:12
>>>>> Subject: Re: Flink Memory analyze on AWS EMR
>>>>> To: Khachatryan Roman <khachatryan.ro...@gmail.com>
>>>>>
>>>>>
>>>>> Hi, Roman
>>>>>
>>>>> Thanks for quick response , I tried without logFIle option but failed
>>>>> with same error , I'm currently using flink 1.6
>>>>> https://ci.apache.org/projects/flink/flink-docs-release-1.6/monitoring/application_profiling.html,
>>>>> so I can only use Jitwatch or JMC .  I guess those tools only available on
>>>>> Standalone cluster ? as document mentioned "Each standalone
>>>>> JobManager, TaskManager, HistoryServer, and ZooKeeper daemon redirects
>>>>> stdout and stderr to a file with a .out filename suffix and writes
>>>>> internal logging to a file with a .log suffix. Java options
>>>>> configured by the user in env.java.opts" ?
>>>>>
>>>>> Thanks
>>>>> Jacky
>>>>>
>>>>

Reply via email to