Hello,
this problem is described in
https://issues.apache.org/jira/browse/FLINK-6689.
Basically, if you want to use the LocalFlinkMiniCluster you should use a
TestStreamEnvironment instead.
The RemoteStreamEnvironment only works with a proper Flink cluster.
Regards,
Chesnay
On 14.07.2017 15:43, Boris Lublinsky wrote:
Hi,
I am trying to upgrade my project from Flink 1.2 to 1.3 and getting
problems while trying to run Flink server from my Intellij project.
The code
// Execute on the local Flink server - to test queariable state def
executeServer() :Unit = {
// We use a mini cluster here for sake of simplicity, because I don't
want // to require a Flink installation to run this demo. Everything
should be // contained in this JAR. val port =6124 val parallelism =4 val config =new Configuration()
config.setInteger(ConfigConstants.JOB_MANAGER_IPC_PORT_KEY, port)
config.setInteger(ConfigConstants.LOCAL_NUMBER_TASK_MANAGER, 1)
config.setInteger(ConfigConstants.TASK_MANAGER_NUM_TASK_SLOTS, parallelism)
// In a non MiniCluster setup queryable state is enabled by default.
config.setBoolean(QueryableStateOptions.SERVER_ENABLE, true)
// Create a local Flink server val flinkCluster =new
LocalFlinkMiniCluster(config, false)
try {
// Start server and create environment flinkCluster.start(true); val env =
StreamExecutionEnvironment.createRemoteEnvironment("localhost", port,
parallelism)
// Build Graph buildGraph(env)
env.execute()
val jobGraph = env.getStreamGraph.getJobGraph
// Submit to the server and wait for completion
flinkCluster.submitJobAndWait(jobGraph, false)
}catch {
case e:Exception => e.printStackTrace()
}
}
Worked on version 1.2, but on 1.3 I am getting
08:41:29,179 INFO
org.apache.flink.runtime.minicluster.FlinkMiniCluster - Starting
FlinkMiniCluster.
08:41:29,431 INFO akka.event.slf4j.Slf4jLogger
- Slf4jLogger started
08:41:29,498 INFO Remoting - Starting
remoting
08:41:29,730 INFO Remoting - Remoting
started; listening on addresses :[akka.tcp://flink@localhost:6124]
08:41:29,762 INFO org.apache.flink.runtime.blob.BlobServer - Created
BLOB server storage directory
/var/folders/3m/52z04fgs3hq88mzft9l0fsrm0000gn/T/blobStore-4e626961-9155-47e9-b1b8-f835a8435cfc
08:41:29,765 INFO org.apache.flink.runtime.blob.BlobServer - Started
BLOB server at 0.0.0.0:54319 - max concurrent requests: 50 - max
backlog: 1000
08:41:29,775 INFO org.apache.flink.runtime.metrics.MetricRegistry -
No metrics reporter configured, no metrics will be exposed/reported.
08:41:29,781 INFO org.apache.flink.runtime.jobmanager.MemoryArchivist
- Started memory archivist akka://flink/user/archive
08:41:29,786 INFO org.apache.flink.runtime.jobmanager.JobManager -
Starting JobManager at akka.tcp://flink@localhost:6124/user/jobmanager.
08:41:29,787 INFO
org.apache.flink.runtime.highavailability.nonha.embedded.EmbeddedLeaderService
- Proposing leadership to contender
org.apache.flink.runtime.jobmanager.JobManager@59cd5ef5 @
akka.tcp://flink@localhost:6124/user/jobmanager
08:41:29,796 INFO akka.event.slf4j.Slf4jLogger
- Slf4jLogger started
08:41:29,804 INFO Remoting - Starting
remoting
08:41:29,813 INFO Remoting - Remoting
started; listening on addresses :[akka.tcp://flink@localhost:54320]
08:41:29,825 INFO akka.event.slf4j.Slf4jLogger
- Slf4jLogger started
08:41:29,830 INFO Remoting - Starting
remoting
08:41:29,836 INFO Remoting - Remoting
started; listening on addresses :[akka.tcp://flink@localhost:54321]
08:41:29,846 INFO org.apache.flink.runtime.jobmanager.JobManager -
JobManager akka.tcp://flink@localhost:6124/user/jobmanager was granted
leadership with leader session ID
Some(61d3ed9b-1c24-4bbf-99ef-c2a891613473).
08:41:29,847 INFO
org.apache.flink.runtime.highavailability.nonha.embedded.EmbeddedLeaderService
- Received confirmation of leadership for leader
akka.tcp://flink@localhost:6124/user/jobmanager ,
session=61d3ed9b-1c24-4bbf-99ef-c2a891613473
08:41:29,850 INFO
org.apache.flink.runtime.taskexecutor.TaskManagerConfiguration -
Messages have a max timeout of 10000 ms
08:41:29,851 INFO
org.apache.flink.runtime.clusterframework.standalone.StandaloneResourceManager
- Received leader address but not running in leader ActorSystem.
Cancelling registration.
08:41:29,855 INFO
org.apache.flink.runtime.taskexecutor.TaskManagerServices - Temporary
file directory '/var/folders/3m/52z04fgs3hq88mzft9l0fsrm0000gn/T':
total 464 GB, usable 353 GB (76.08% usable)
08:41:30,493 INFO
org.apache.flink.runtime.io.network.buffer.NetworkBufferPool -
Allocated 363 MB for network buffer pool (number of memory segments:
11634, bytes per segment: 32768).
08:41:30,506 INFO
org.apache.flink.runtime.io.network.NetworkEnvironment - Starting
the network environment and its components.
08:41:30,508 INFO
org.apache.flink.runtime.taskexecutor.TaskManagerServices - Limiting
managed memory to 1145 MB, memory will be allocated lazily.
08:41:30,512 INFO
org.apache.flink.runtime.io.disk.iomanager.IOManager - I/O manager
uses directory
/var/folders/3m/52z04fgs3hq88mzft9l0fsrm0000gn/T/flink-io-9ec461ff-086d-45cd-b69c-e9890217d8fc
for spill files.
08:41:30,514 INFO org.apache.flink.runtime.metrics.MetricRegistry -
No metrics reporter configured, no metrics will be exposed/reported.
08:41:30,561 INFO org.apache.flink.runtime.filecache.FileCache -
User file cache uses directory
/var/folders/3m/52z04fgs3hq88mzft9l0fsrm0000gn/T/flink-dist-cache-4c4e9bcf-5a66-43e7-b2e9-244f310c3c4c
08:41:30,570 INFO org.apache.flink.runtime.filecache.FileCache -
User file cache uses directory
/var/folders/3m/52z04fgs3hq88mzft9l0fsrm0000gn/T/flink-dist-cache-687f7d57-33d7-4df3-915f-481008043fef
08:41:30,575 INFO org.apache.flink.runtime.taskmanager.TaskManager -
Starting TaskManager actor at akka://flink/user/taskmanager#-1401663761.
08:41:30,576 INFO org.apache.flink.runtime.taskmanager.TaskManager -
TaskManager data connection information:
15d5b91a66be806304e6fe15fde8c0fe @ localhost (dataPort=-1)
08:41:30,576 INFO org.apache.flink.runtime.taskmanager.TaskManager -
TaskManager has 4 task slot(s).
08:41:30,578 INFO org.apache.flink.runtime.taskmanager.TaskManager -
Memory usage stats: [HEAP: 391/838/3641 MB, NON HEAP: 25/26/-1 MB
(used/committed/max)]
08:41:30,582 INFO org.apache.flink.runtime.taskmanager.TaskManager -
Trying to register at JobManager
akka.tcp://flink@localhost:6124/user/jobmanager (attempt 1, timeout:
500 milliseconds)
08:41:30,729 INFO org.apache.flink.runtime.jobmanager.JobManager -
Task Manager Registration but not connected to ResourceManager
08:41:30,732 INFO org.apache.flink.runtime.instance.InstanceManager -
Registered TaskManager at localhost
(akka.tcp://flink@localhost:54321/user/taskmanager) as
38047c3fc643910d58ecc414e8233f78. Current number of registered hosts
is 1. Current number of alive task slots is 4.
08:41:30,741 INFO org.apache.flink.runtime.taskmanager.TaskManager -
Successful registration at JobManager
(akka.tcp://flink@localhost:6124/user/jobmanager), starting network
stack and library cache.
08:41:30,743 INFO org.apache.flink.runtime.taskmanager.TaskManager -
Determined BLOB server address to be localhost/127.0.0.1:54319.
Starting BLOB cache.
08:41:30,745 INFO org.apache.flink.runtime.blob.BlobCache - Created
BLOB cache storage directory
/var/folders/3m/52z04fgs3hq88mzft9l0fsrm0000gn/T/blobStore-122d4c13-34d1-4c01-8a50-f6dfdae8b06b
08:41:30,996 INFO
org.apache.flink.streaming.api.environment.RemoteStreamEnvironment -
Running remotely at localhost:6124
08:41:31,085 INFO
org.apache.flink.client.program.StandaloneClusterClient - Starting
client actor system.
08:41:31,087 INFO org.apache.flink.runtime.util.LeaderRetrievalUtils
- Trying to select the network interface and address to use by
connecting to the leading JobManager.
08:41:31,087 INFO org.apache.flink.runtime.util.LeaderRetrievalUtils
- TaskManager will try to connect for 10000 milliseconds before
falling back to heuristics
08:41:31,088 INFO org.apache.flink.runtime.net.ConnectionUtils -
Retrieved new target address localhost/127.0.0.1:6124.
08:41:31,100 INFO akka.event.slf4j.Slf4jLogger
- Slf4jLogger started
08:41:31,103 INFO Remoting - Starting
remoting
08:41:31,108 INFO Remoting - Remoting
started; listening on addresses :[akka.tcp://flink@localhost:54324]
08:41:31,108 INFO
org.apache.flink.client.program.StandaloneClusterClient - Submitting
job with JobID: 74abb7674b9522ad3a204a1315cf609e. Waiting for job
completion.
Submitting job with JobID: 74abb7674b9522ad3a204a1315cf609e. Waiting
for job completion.
08:41:31,113 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor -
Disconnect from JobManager null.
08:41:31,116 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Received
SubmitJobAndWait(JobGraph(jobId: 74abb7674b9522ad3a204a1315cf609e))
but there is no connection to a JobManager yet.
08:41:31,116 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Received
job Flink Streaming Job (74abb7674b9522ad3a204a1315cf609e).
08:41:31,125 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Connect
to JobManager
Actor[akka.tcp://flink@localhost:6124/user/jobmanager#-297192771].
08:41:31,126 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Connected
to JobManager at
Actor[akka.tcp://flink@localhost:6124/user/jobmanager#-297192771] with
leader session id 00000000-0000-0000-0000-000000000000.
Connected to JobManager at
Actor[akka.tcp://flink@localhost:6124/user/jobmanager#-297192771] with
leader session id 00000000-0000-0000-0000-000000000000.
08:41:31,126 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Sending
message to JobManager akka.tcp://flink@localhost:6124/user/jobmanager
to submit job Flink Streaming Job (74abb7674b9522ad3a204a1315cf609e)
and wait for progress
08:41:31,128 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Upload
jar files to job manager akka.tcp://flink@localhost:6124/user/jobmanager.
08:41:31,129 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Submit
job to the job manager akka.tcp://flink@localhost:6124/user/jobmanager.
08:41:31,146 WARN org.apache.flink.runtime.jobmanager.JobManager -
Discard message
LeaderSessionMessage(00000000-0000-0000-0000-000000000000,SubmitJob(JobGraph(jobId:
74abb7674b9522ad3a204a1315cf609e),EXECUTION_RESULT_AND_STATE_CHANGES))
because the expected leader session ID
61d3ed9b-1c24-4bbf-99ef-c2a891613473 did not equal the received leader
session ID 00000000-0000-0000-0000-000000000000.
08:42:30,381 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor - Terminate
JobClientActor.
08:42:30,382 INFO
org.apache.flink.runtime.client.JobSubmissionClientActor -
Disconnect from JobManager
Actor[akka.tcp://flink@localhost:6124/user/jobmanager#-297192771].
08:42:30,391 INFO
akka.remote.RemoteActorRefProvider$RemotingTerminator - Shutting down
remote daemon.
08:42:30,392 INFO
akka.remote.RemoteActorRefProvider$RemotingTerminator - Remote daemon
shut down; proceeding with flushing remote transports.
08:42:30,411 INFO
akka.remote.RemoteActorRefProvider$RemotingTerminator - Remoting shut
down.
org.apache.flink.client.program.ProgramInvocationException: The
program execution failed: Couldn't retrieve the JobExecutionResult
from the JobManager.
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:478)
at
org.apache.flink.client.program.StandaloneClusterClient.submitJob(StandaloneClusterClient.java:105)
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:442)
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:434)
at
org.apache.flink.streaming.api.environment.RemoteStreamEnvironment.executeRemotely(RemoteStreamEnvironment.java:212)
at
org.apache.flink.streaming.api.environment.RemoteStreamEnvironment.execute(RemoteStreamEnvironment.java:176)
at
org.apache.flink.streaming.api.environment.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.java:1499)
at
org.apache.flink.streaming.api.scala.StreamExecutionEnvironment.execute(StreamExecutionEnvironment.scala:629)
at
com.lightbend.modelServer.ModelServingKeyedJob$.executeServer(ModelServingKeyedJob.scala:66)
at
com.lightbend.modelServer.ModelServingKeyedJob$.main(ModelServingKeyedJob.scala:39)
at
com.lightbend.modelServer.ModelServingKeyedJob.main(ModelServingKeyedJob.scala)
Caused by: org.apache.flink.runtime.client.JobExecutionException:
Couldn't retrieve the JobExecutionResult from the JobManager.
at
org.apache.flink.runtime.client.JobClient.awaitJobResult(JobClient.java:309)
at
org.apache.flink.runtime.client.JobClient.submitJobAndWait(JobClient.java:396)
at
org.apache.flink.client.program.ClusterClient.run(ClusterClient.java:467)
... 10 more
Caused by:
org.apache.flink.runtime.client.JobClientActorSubmissionTimeoutException:
Job submission to the JobManager timed out. You may increase
'akka.client.timeout' in case the JobManager needs more time to
configure and confirm the job submission.
at
org.apache.flink.runtime.client.JobSubmissionClientActor.handleCustomMessage(JobSubmissionClientActor.java:119)
at
org.apache.flink.runtime.client.JobClientActor.handleMessage(JobClientActor.java:251)
at
org.apache.flink.runtime.akka.FlinkUntypedActor.handleLeaderSessionID(FlinkUntypedActor.java:89)
at
org.apache.flink.runtime.akka.FlinkUntypedActor.onReceive(FlinkUntypedActor.java:68)
at
akka.actor.UntypedActor$$anonfun$receive$1.applyOrElse(UntypedActor.scala:167)
at akka.actor.Actor$class.aroundReceive(Actor.scala:467)
at akka.actor.UntypedActor.aroundReceive(UntypedActor.scala:97)
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:516)
at akka.actor.ActorCell.invoke(ActorCell.scala:487)
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:238)
at akka.dispatch.Mailbox.run(Mailbox.scala:220)
at
akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:397)
at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
at
scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
at
scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
at
scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)
08:42:30,424 INFO org.apache.flink.runtime.blob.BlobCache - Shutting
down BlobCache
08:42:30,433 INFO org.apache.flink.runtime.blob.BlobServer - Stopped
BLOB server at 0.0.0.0:54319
08:42:30,434 INFO
org.apache.flink.runtime.io.disk.iomanager.IOManager - I/O manager
removed spill file directory
/var/folders/3m/52z04fgs3hq88mzft9l0fsrm0000gn/T/flink-io-9ec461ff-086d-45cd-b69c-e9890217d8fc
Process finished with exit code 0
Any help will be appreciated
Boris Lublinsky
FDP Architect
boris.lublin...@lightbend.com <mailto:boris.lublin...@lightbend.com>
https://www.lightbend.com/