[jira] [Created] (FLINK-7965) Docker-Flink: TaskManagers can't find JobManager when in different nodes in Docker Swarm

2017-11-02 Thread Thalita Vergilio (JIRA)
Thalita Vergilio created FLINK-7965:
---

 Summary: Docker-Flink: TaskManagers can't find JobManager when in 
different nodes in Docker Swarm
 Key: FLINK-7965
 URL: https://issues.apache.org/jira/browse/FLINK-7965
 Project: Flink
  Issue Type: Bug
  Components: Docker
Affects Versions: 1.3.2
 Environment: node: ubuntu-swarm-master
Azure VM Standard D4s v3 (4 vcpus, 16 GB memory)
Docker version 17.03.1-ce, build c6d412e

node: azure-swarm-worker-1
Azure VM Standard D2 v2 Promo (2 vcpus, 7 GB memory)
Docker version 17.09.0-ce, build afdb6d4

Flink: using image 1.3.2-hadoop2-scala_2.10
Reporter: Thalita Vergilio
Priority: Major


This happens even when the nodes are in the same subnet.

I am using the Docker-Flink project in: 
https://github.com/apache/flink/tree/master/flink-contrib/docker-flink

I am creating the services with the following commands: 
docker network create -d overlay overlay 
docker service create --name jobmanager --env 
JOB_MANAGER_RPC_ADDRESS=jobmanager -p 8081:8081 --network overlay --constraint 
'node.hostname == ubuntu-swarm-manager' flink jobmanager 
docker service create --name taskmanager --env 
JOB_MANAGER_RPC_ADDRESS=jobmanager --network overlay --constraint 
'node.hostname != ubuntu-swarm-manager' flink taskmanager 

I wonder if there's any configuration I'm missing. This is the error I get: 
- Trying to register at JobManager akka.tcp://flink@jobmanager:6123/   
user/jobmanager (attempt 4, timeout: 4000 milliseconds) 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (FLINK-7961) Docker-Flink with Docker Swarm doesn't work when machines are in different clouds

2017-11-01 Thread Thalita Vergilio (JIRA)
Thalita Vergilio created FLINK-7961:
---

 Summary: Docker-Flink with Docker Swarm doesn't work when machines 
are in different clouds
 Key: FLINK-7961
 URL: https://issues.apache.org/jira/browse/FLINK-7961
 Project: Flink
  Issue Type: Bug
  Components: Docker
Affects Versions: 1.3.0
Reporter: Thalita Vergilio
Priority: Normal


Task Managers can't find Job Manager by name. Maybe some additional Docker 
configuration is needed?

I am running the standard setup and create-docker-swarm-service.sh script from 
the Docker Flink project:
https://github.com/apache/flink/blob/master/flink-contrib/docker-flink/create-docker-swarm-service.sh

This is the log from one of the Task Manager's containers:

{{Starting Task Manager
config file:
jobmanager.rpc.address: flink-jobmanager
jobmanager.rpc.port: 6123
jobmanager.heap.mb: 1024
taskmanager.heap.mb: 1024
taskmanager.numberOfTaskSlots: 2
taskmanager.memory.preallocate: false
parallelism.default: 1
jobmanager.web.port: 8081
blob.server.port: 6124
query.server.port: 6125
Starting taskmanager as a console application on host c42a6093f7bb.
2017-11-01 11:20:51,459 WARN  org.apache.hadoop.util.NativeCodeLoader   
- Unable to load native-hadoop library for your platform... using 
builtin-java classes where applicable
2017-11-01 11:20:51,522 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- 

2017-11-01 11:20:51,522 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  Starting TaskManager (Version: 1.3.2, Rev:0399bee, 
Date:03.08.2017 @ 10:23:11 UTC)
2017-11-01 11:20:51,522 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  Current user: flink
2017-11-01 11:20:51,522 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  JVM: OpenJDK 64-Bit Server VM - Oracle Corporation - 
1.8/25.141-b15
2017-11-01 11:20:51,522 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  Maximum heap size: 1024 MiBytes
2017-11-01 11:20:51,522 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  JAVA_HOME: /docker-java-home/jre
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  Hadoop version: 2.7.2
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  JVM Options:
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- -XX:+UseG1GC
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- -Xms1024M
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- -Xmx1024M
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- -XX:MaxDirectMemorySize=8388607T
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- 
-Dlog4j.configuration=file:/opt/flink/conf/log4j-console.properties
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- 
-Dlogback.configurationFile=file:/opt/flink/conf/logback-console.xml
2017-11-01 11:20:51,526 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  Program Arguments:
2017-11-01 11:20:51,527 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- --configDir
2017-11-01 11:20:51,527 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- /opt/flink/conf
2017-11-01 11:20:51,527 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
-  Classpath: 
/opt/flink/lib/flink-python_2.11-1.3.2.jar:/opt/flink/lib/flink-shaded-hadoop2-uber-1.3.2.jar:/opt/flink/lib/log4j-1.2.17.jar:/opt/flink/lib/slf4j-log4j12-1.7.7.jar:/opt/flink/lib/flink-dist_2.11-1.3.2.jar:::
2017-11-01 11:20:51,527 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- 

2017-11-01 11:20:51,528 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- Registered UNIX signal handlers for [TERM, HUP, INT]
2017-11-01 11:20:51,532 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- Maximum number of open file descriptors is 1048576
2017-11-01 11:20:51,548 INFO  org.apache.flink.runtime.taskmanager.TaskManager  
- Loading configuration from /opt/flink/conf
2017-11-01 11:20:51,551 INFO  
org.apache.flink.configuration.GlobalConfiguration- Loading 
configuration property: jobmanager.rpc.address, flink-jobmanager
2017-11-01 11:20:51,551 INFO  
org.apache.flink.configuration.GlobalConfiguration- Loading 
configuration property: jobmanager.rpc.port, 6123
2017-11-01 11:20:51,551 INFO  
org.apache.flink.configuration