Thanks Andrey.
The environment we run [Azure HD insight cluster] only supports Flink 1.4.2 
now. So I can’t run with 1.8 in this environment.
I can run in a different environment with 1.8 [on Kubernetes not YARN though] 
and report the results.

Thanks,
-Fakrudeen
(define (sqrte n xn eph) (if (> eph (abs (- n (* xn xn)))) xn (sqrte n (/ (+ xn 
(/ n xn)) 2) eph)))


From: Andrey Zagrebin <and...@ververica.com>
Date: Monday, July 22, 2019 at 8:52 AM
To: Fakrudeen Ali Ahmed <aliah...@adobe.com>
Cc: "user@flink.apache.org" <user@flink.apache.org>
Subject: Re: Job submission timeout with no error info.

Hi Fakrudeen,

Thanks for sharing the logs. Could you also try it with Flink 1.8?

Best,
Andrey

On Sat, Jul 20, 2019 at 12:44 AM Fakrudeen Ali Ahmed 
<aliah...@adobe.com<mailto:aliah...@adobe.com>> wrote:
Hi Andrey,


Flink  version: 1.4.2
Please find the client log attached and job manager log is at: job manager 
log<https://nam04.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdrive.google.com%2Ffile%2Fd%2F1iNOs2E69jevF9pu1t7uw6Gj2XZKJoWpC%2Fview%3Fusp%3Dsharing&data=02%7C01%7Caliahmed%40adobe.com%7Ce8e339fa64f94716405308d70ebc8efd%7Cfa7b1b5a7b34438794aed2c178decee1%7C0%7C0%7C636994075340692673&sdata=MUvrLHyn355YL5aCI8Yqr4FVDoGhYW0EvW3qj3HIOpY%3D&reserved=0>.

Thanks,
-Fakrudeen
(define (sqrte n xn eph) (if (> eph (abs (- n (* xn xn)))) xn (sqrte n (/ (+ xn 
(/ n xn)) 2) eph)))


From: Andrey Zagrebin <and...@ververica.com<mailto:and...@ververica.com>>
Date: Friday, July 19, 2019 at 10:36 AM
To: Fakrudeen Ali Ahmed <aliah...@adobe.com<mailto:aliah...@adobe.com>>
Cc: "user@flink.apache.org<mailto:user@flink.apache.org>" 
<user@flink.apache.org<mailto:user@flink.apache.org>>
Subject: Re: Job submission timeout with no error info.

Hi Fakrudeen,

which Flink version do you use? could you share full client and job manager 
logs?

Best,
Andrey

On Fri, Jul 19, 2019 at 7:00 PM Fakrudeen Ali Ahmed 
<aliah...@adobe.com<mailto:aliah...@adobe.com>> wrote:
Hi,

We are submitting a Flink topology [YARN] and it fails during upload of the jar 
with no error info.


[main] INFO org.apache.flink.runtime.client.JobClient - Checking and uploading 
JAR files

[main] ERROR org.apache.flink.client.CliFrontend - Error while running the 
command.

org.apache.flink.client.program.ProgramInvocationException: The program 
execution failed: JobManager did not respond within 60000 ms


Flink UI says:
“Could not retrieve the redirect address of the current leader. Please try to 
refresh.”

We tried increasing job manager memory to 8GB and still it has the same issue. 
Jar size is around 190 MB but we were able to successfully run this size 
before. Zookeeper in Hadoop YARN cluster is healthy.

How do we start debugging this? Is it some dependency jar issue in our uber jar 
or something else?

Thanks,
-Fakrudeen
(define (sqrte n xn eph) (if (> eph (abs (- n (* xn xn)))) xn (sqrte n (/ (+ xn 
(/ n xn)) 2) eph)))

Reply via email to