[jira] [Resolved] (AURORA-1788) vagrant up does not properly configure network adapters
[ https://issues.apache.org/jira/browse/AURORA-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joshua Cohen resolved AURORA-1788. -- Resolution: Fixed Thanks for the patch! > vagrant up does not properly configure network adapters > --- > > Key: AURORA-1788 > URL: https://issues.apache.org/jira/browse/AURORA-1788 > Project: Aurora > Issue Type: Bug >Reporter: Andrew Jorgensen >Assignee: Andrew Jorgensen > > I am not sure of the specifics of why this happens but on vagrant 1.8.6 the > network interface does not come up correctly and the private_network is > attached to the eth0 nat interface rather than the host-only interface. I > tried a number of different parameters but none of them were able to > configure the network appropriately. This change manually configures the > static ip so that it is connected to the correct adapter. Without this change > I could not access the aurora web interface when running vagrant up. > I've created a patch here: https://reviews.apache.org/r/52609/ > This is what the configuration looks like when run off master: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state > DOWN group default > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} > here is what it is supposed to look like: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:fe7c:4e72/64 scope link >valid_lft forever preferred_lft forever > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} > Steps to reproduce: > 1. Update to vagrant 1.8.6 (unsure if previous versions are affected as well) > 2. Run `vagrant up` > 3. Try to visit http://192.168.33.7:8081 > Expected outcome: > I expect that following the steps in > http://aurora.apache.org/documentation/latest/getting-started/vagrant/ I > would be able to visit the web interface for aurora. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (AURORA-1788) vagrant up does not properly configure network adapters
[ https://issues.apache.org/jira/browse/AURORA-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joshua Cohen updated AURORA-1788: - Assignee: Andrew Jorgensen > vagrant up does not properly configure network adapters > --- > > Key: AURORA-1788 > URL: https://issues.apache.org/jira/browse/AURORA-1788 > Project: Aurora > Issue Type: Bug >Reporter: Andrew Jorgensen >Assignee: Andrew Jorgensen > > I am not sure of the specifics of why this happens but on vagrant 1.8.6 the > network interface does not come up correctly and the private_network is > attached to the eth0 nat interface rather than the host-only interface. I > tried a number of different parameters but none of them were able to > configure the network appropriately. This change manually configures the > static ip so that it is connected to the correct adapter. Without this change > I could not access the aurora web interface when running vagrant up. > I've created a patch here: https://reviews.apache.org/r/52609/ > This is what the configuration looks like when run off master: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state > DOWN group default > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} > here is what it is supposed to look like: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:fe7c:4e72/64 scope link >valid_lft forever preferred_lft forever > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} > Steps to reproduce: > 1. Update to vagrant 1.8.6 (unsure if previous versions are affected as well) > 2. Run `vagrant up` > 3. Try to visit http://192.168.33.7:8081 > Expected outcome: > I expect that following the steps in > http://aurora.apache.org/documentation/latest/getting-started/vagrant/ I > would be able to visit the web interface for aurora. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (AURORA-1788) vagrant up does not properly configure network adapters
[ https://issues.apache.org/jira/browse/AURORA-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15553739#comment-15553739 ] Andrew Jorgensen commented on AURORA-1788: -- This can be closed by: https://reviews.apache.org/r/52609/ > vagrant up does not properly configure network adapters > --- > > Key: AURORA-1788 > URL: https://issues.apache.org/jira/browse/AURORA-1788 > Project: Aurora > Issue Type: Bug >Reporter: Andrew Jorgensen > > I am not sure of the specifics of why this happens but on vagrant 1.8.6 the > network interface does not come up correctly and the private_network is > attached to the eth0 nat interface rather than the host-only interface. I > tried a number of different parameters but none of them were able to > configure the network appropriately. This change manually configures the > static ip so that it is connected to the correct adapter. Without this change > I could not access the aurora web interface when running vagrant up. > I've created a patch here: https://reviews.apache.org/r/52609/ > This is what the configuration looks like when run off master: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state > DOWN group default > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} > here is what it is supposed to look like: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:fe7c:4e72/64 scope link >valid_lft forever preferred_lft forever > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} > Steps to reproduce: > 1. Update to vagrant 1.8.6 (unsure if previous versions are affected as well) > 2. Run `vagrant up` > 3. Try to visit http://192.168.33.7:8081 > Expected outcome: > I expect that following the steps in > http://aurora.apache.org/documentation/latest/getting-started/vagrant/ I > would be able to visit the web interface for aurora. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (AURORA-1789) namespaces/pid isolator causes lost process
[ https://issues.apache.org/jira/browse/AURORA-1789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15553561#comment-15553561 ] Zameer Manji commented on AURORA-1789: -- Here is my gut feeling on the first cut of these logs and what I see in the code. I'm focusing on {{apache/thermos/core/process.py}}. The coordinator appears to be forked and running. Once it is forked, it is supposed to execute the target process, and then write a checkpoint indicating the process is RUNNING and with the pid. That never happens here. Another thing going on here is that this task is running using the unified containerizer and has a task filesystem. To me there appears to be a bug inside the {{execute()}} method, perhaps in trying to execute the target process via the mesos-containerizer. > namespaces/pid isolator causes lost process > --- > > Key: AURORA-1789 > URL: https://issues.apache.org/jira/browse/AURORA-1789 > Project: Aurora > Issue Type: Bug > Components: Executor >Affects Versions: 0.16.0 >Reporter: Justin Pinkul >Assignee: Zameer Manji > > When using the Mesos containerizer with namespaces/pid isolator and a Docker > image the Thermos executor is unable to launch processes. The executor tries > to fork the process then is unable to locate the process after the fork. > {code:title=thermos_runner.INFO} > I1006 21:36:22.842595 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:37:22.929864 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=205, process=u'BigBrother start', start_time=None, > coordinator_pid=1144, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789782.842882) > I1006 21:37:22.931456 75 helper.py:153] Coordinator BigBrother start [pid: > 1144] completed. > I1006 21:37:22.931732 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:37:22.935580 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:38:23.023725 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=208, process=u'BigBrother start', start_time=None, > coordinator_pid=1157, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789842.935872) > I1006 21:38:23.025332 75 helper.py:153] Coordinator BigBrother start [pid: > 1157] completed. > I1006 21:38:23.025629 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:38:23.029414 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:39:23.117208 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=211, process=u'BigBrother start', start_time=None, > coordinator_pid=1170, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789903.029694) > I1006 21:39:23.118841 75 helper.py:153] Coordinator BigBrother start [pid: > 1170] completed. > I1006 21:39:23.119134 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:39:23.122920 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:40:23.211095 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=214, process=u'BigBrother start', start_time=None, > coordinator_pid=1183, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789963.123206) > I1006 21:40:23.212711 75 helper.py:153] Coordinator BigBrother start [pid: > 1183] completed. > I1006 21:40:23.213006 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:40:23.216810 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:41:23.305505 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=217, process=u'BigBrother start', start_time=None, > coordinator_pid=1196, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475790023.21709) > I1006 21:41:23.307157 75 helper.py:153] Coordinator BigBrother start [pid: > 1196] completed. > I1006 21:41:23.307450 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:41:23.311230 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:42:23.398277 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=220, process=u'BigBrother start', start_time=None, > coordinator_pid=1209, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475790083.311512) > I1006 21:42:23.399893 75 helper.py:153] Coordinator BigBrother start [pid: > 1209] completed. > I1006 21:42:23.400185 75 runner.py:133] Process BigBrother start had an > abnormal termination > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (AURORA-1789) namespaces/pid isolator causes lost process
[ https://issues.apache.org/jira/browse/AURORA-1789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Justin Pinkul updated AURORA-1789: -- Assignee: Zameer Manji (was: Justin Pinkul) > namespaces/pid isolator causes lost process > --- > > Key: AURORA-1789 > URL: https://issues.apache.org/jira/browse/AURORA-1789 > Project: Aurora > Issue Type: Bug > Components: Executor >Affects Versions: 0.16.0 >Reporter: Justin Pinkul >Assignee: Zameer Manji > > When using the Mesos containerizer with namespaces/pid isolator and a Docker > image the Thermos executor is unable to launch processes. The executor tries > to fork the process then is unable to locate the process after the fork. > {code:title=thermos_runner.INFO} > I1006 21:36:22.842595 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:37:22.929864 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=205, process=u'BigBrother start', start_time=None, > coordinator_pid=1144, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789782.842882) > I1006 21:37:22.931456 75 helper.py:153] Coordinator BigBrother start [pid: > 1144] completed. > I1006 21:37:22.931732 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:37:22.935580 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:38:23.023725 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=208, process=u'BigBrother start', start_time=None, > coordinator_pid=1157, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789842.935872) > I1006 21:38:23.025332 75 helper.py:153] Coordinator BigBrother start [pid: > 1157] completed. > I1006 21:38:23.025629 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:38:23.029414 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:39:23.117208 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=211, process=u'BigBrother start', start_time=None, > coordinator_pid=1170, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789903.029694) > I1006 21:39:23.118841 75 helper.py:153] Coordinator BigBrother start [pid: > 1170] completed. > I1006 21:39:23.119134 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:39:23.122920 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:40:23.211095 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=214, process=u'BigBrother start', start_time=None, > coordinator_pid=1183, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789963.123206) > I1006 21:40:23.212711 75 helper.py:153] Coordinator BigBrother start [pid: > 1183] completed. > I1006 21:40:23.213006 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:40:23.216810 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:41:23.305505 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=217, process=u'BigBrother start', start_time=None, > coordinator_pid=1196, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475790023.21709) > I1006 21:41:23.307157 75 helper.py:153] Coordinator BigBrother start [pid: > 1196] completed. > I1006 21:41:23.307450 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:41:23.311230 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:42:23.398277 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=220, process=u'BigBrother start', start_time=None, > coordinator_pid=1209, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475790083.311512) > I1006 21:42:23.399893 75 helper.py:153] Coordinator BigBrother start [pid: > 1209] completed. > I1006 21:42:23.400185 75 runner.py:133] Process BigBrother start had an > abnormal termination > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Assigned] (AURORA-1789) namespaces/pid isolator causes lost process
[ https://issues.apache.org/jira/browse/AURORA-1789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Justin Pinkul reassigned AURORA-1789: - Assignee: Justin Pinkul > namespaces/pid isolator causes lost process > --- > > Key: AURORA-1789 > URL: https://issues.apache.org/jira/browse/AURORA-1789 > Project: Aurora > Issue Type: Bug > Components: Executor >Affects Versions: 0.16.0 >Reporter: Justin Pinkul >Assignee: Justin Pinkul > > When using the Mesos containerizer with namespaces/pid isolator and a Docker > image the Thermos executor is unable to launch processes. The executor tries > to fork the process then is unable to locate the process after the fork. > {code:title=thermos_runner.INFO} > I1006 21:36:22.842595 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:37:22.929864 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=205, process=u'BigBrother start', start_time=None, > coordinator_pid=1144, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789782.842882) > I1006 21:37:22.931456 75 helper.py:153] Coordinator BigBrother start [pid: > 1144] completed. > I1006 21:37:22.931732 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:37:22.935580 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:38:23.023725 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=208, process=u'BigBrother start', start_time=None, > coordinator_pid=1157, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789842.935872) > I1006 21:38:23.025332 75 helper.py:153] Coordinator BigBrother start [pid: > 1157] completed. > I1006 21:38:23.025629 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:38:23.029414 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:39:23.117208 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=211, process=u'BigBrother start', start_time=None, > coordinator_pid=1170, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789903.029694) > I1006 21:39:23.118841 75 helper.py:153] Coordinator BigBrother start [pid: > 1170] completed. > I1006 21:39:23.119134 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:39:23.122920 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:40:23.211095 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=214, process=u'BigBrother start', start_time=None, > coordinator_pid=1183, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475789963.123206) > I1006 21:40:23.212711 75 helper.py:153] Coordinator BigBrother start [pid: > 1183] completed. > I1006 21:40:23.213006 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:40:23.216810 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:41:23.305505 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=217, process=u'BigBrother start', start_time=None, > coordinator_pid=1196, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475790023.21709) > I1006 21:41:23.307157 75 helper.py:153] Coordinator BigBrother start [pid: > 1196] completed. > I1006 21:41:23.307450 75 runner.py:133] Process BigBrother start had an > abnormal termination > I1006 21:41:23.311230 75 runner.py:865] Forking Process(BigBrother start) > I1006 21:42:23.398277 75 runner.py:825] Detected a LOST task: > ProcessStatus(seq=220, process=u'BigBrother start', start_time=None, > coordinator_pid=1209, pid=None, return_code=None, state=1, stop_time=None, > fork_time=1475790083.311512) > I1006 21:42:23.399893 75 helper.py:153] Coordinator BigBrother start [pid: > 1209] completed. > I1006 21:42:23.400185 75 runner.py:133] Process BigBrother start had an > abnormal termination > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (AURORA-1789) namespaces/pid isolator causes lost process
Justin Pinkul created AURORA-1789: - Summary: namespaces/pid isolator causes lost process Key: AURORA-1789 URL: https://issues.apache.org/jira/browse/AURORA-1789 Project: Aurora Issue Type: Bug Components: Executor Affects Versions: 0.16.0 Reporter: Justin Pinkul When using the Mesos containerizer with namespaces/pid isolator and a Docker image the Thermos executor is unable to launch processes. The executor tries to fork the process then is unable to locate the process after the fork. {code:title=thermos_runner.INFO} I1006 21:36:22.842595 75 runner.py:865] Forking Process(BigBrother start) I1006 21:37:22.929864 75 runner.py:825] Detected a LOST task: ProcessStatus(seq=205, process=u'BigBrother start', start_time=None, coordinator_pid=1144, pid=None, return_code=None, state=1, stop_time=None, fork_time=1475789782.842882) I1006 21:37:22.931456 75 helper.py:153] Coordinator BigBrother start [pid: 1144] completed. I1006 21:37:22.931732 75 runner.py:133] Process BigBrother start had an abnormal termination I1006 21:37:22.935580 75 runner.py:865] Forking Process(BigBrother start) I1006 21:38:23.023725 75 runner.py:825] Detected a LOST task: ProcessStatus(seq=208, process=u'BigBrother start', start_time=None, coordinator_pid=1157, pid=None, return_code=None, state=1, stop_time=None, fork_time=1475789842.935872) I1006 21:38:23.025332 75 helper.py:153] Coordinator BigBrother start [pid: 1157] completed. I1006 21:38:23.025629 75 runner.py:133] Process BigBrother start had an abnormal termination I1006 21:38:23.029414 75 runner.py:865] Forking Process(BigBrother start) I1006 21:39:23.117208 75 runner.py:825] Detected a LOST task: ProcessStatus(seq=211, process=u'BigBrother start', start_time=None, coordinator_pid=1170, pid=None, return_code=None, state=1, stop_time=None, fork_time=1475789903.029694) I1006 21:39:23.118841 75 helper.py:153] Coordinator BigBrother start [pid: 1170] completed. I1006 21:39:23.119134 75 runner.py:133] Process BigBrother start had an abnormal termination I1006 21:39:23.122920 75 runner.py:865] Forking Process(BigBrother start) I1006 21:40:23.211095 75 runner.py:825] Detected a LOST task: ProcessStatus(seq=214, process=u'BigBrother start', start_time=None, coordinator_pid=1183, pid=None, return_code=None, state=1, stop_time=None, fork_time=1475789963.123206) I1006 21:40:23.212711 75 helper.py:153] Coordinator BigBrother start [pid: 1183] completed. I1006 21:40:23.213006 75 runner.py:133] Process BigBrother start had an abnormal termination I1006 21:40:23.216810 75 runner.py:865] Forking Process(BigBrother start) I1006 21:41:23.305505 75 runner.py:825] Detected a LOST task: ProcessStatus(seq=217, process=u'BigBrother start', start_time=None, coordinator_pid=1196, pid=None, return_code=None, state=1, stop_time=None, fork_time=1475790023.21709) I1006 21:41:23.307157 75 helper.py:153] Coordinator BigBrother start [pid: 1196] completed. I1006 21:41:23.307450 75 runner.py:133] Process BigBrother start had an abnormal termination I1006 21:41:23.311230 75 runner.py:865] Forking Process(BigBrother start) I1006 21:42:23.398277 75 runner.py:825] Detected a LOST task: ProcessStatus(seq=220, process=u'BigBrother start', start_time=None, coordinator_pid=1209, pid=None, return_code=None, state=1, stop_time=None, fork_time=1475790083.311512) I1006 21:42:23.399893 75 helper.py:153] Coordinator BigBrother start [pid: 1209] completed. I1006 21:42:23.400185 75 runner.py:133] Process BigBrother start had an abnormal termination {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (AURORA-1788) vagrant up does not properly configure network adapters
[ https://issues.apache.org/jira/browse/AURORA-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Jorgensen updated AURORA-1788: - Description: I am not sure of the specifics of why this happens but on vagrant 1.8.6 the network interface does not come up correctly and the private_network is attached to the eth0 nat interface rather than the host-only interface. I tried a number of different parameters but none of them were able to configure the network appropriately. This change manually configures the static ip so that it is connected to the correct adapter. Without this change I could not access the aurora web interface when running vagrant up. I've created a patch here: https://reviews.apache.org/r/52609/ This is what the configuration looks like when run off master: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state DOWN group default link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff 4: docker0: mtu 1500 qdisc noqueue state DOWN group default link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff inet 172.17.0.1/16 scope global docker0 valid_lft forever preferred_lft forever {code} here is what it is supposed to look like: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:fe7c:4e72/64 scope link valid_lft forever preferred_lft forever 4: docker0: mtu 1500 qdisc noqueue state DOWN group default link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff inet 172.17.0.1/16 scope global docker0 valid_lft forever preferred_lft forever {code} Steps to reproduce: 1. Update to vagrant 1.8.6 (unsure if previous versions are affected as well) 2. Run `vagrant up` 3. Try to visit http://192.168.33.7:8081 Expected outcome: I expect that following the steps in http://aurora.apache.org/documentation/latest/getting-started/vagrant/ I would be able to visit the web interface for aurora. was: I am not sure of the specifics of why this happens but on vagrant 1.8.6 the network interface does not come up correctly and the private_network is attached to the eth0 nat interface rather than the host-only interface. I tried a number of different parameters but none of them were able to configure the network appropriately. This change manually configures the static ip so that it is connected to the correct adapter. Without this change I could not access the aurora web interface when running vagrant up. I've created a patch here: https://reviews.apache.org/r/52609/ This is what the configuration looks like when run off master: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state DOWN group default link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff 4: docker0: mtu 1500 qdisc noqueu
[jira] [Updated] (AURORA-1788) vagrant up does not properly configure network adapters
[ https://issues.apache.org/jira/browse/AURORA-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Jorgensen updated AURORA-1788: - Description: I am not sure of the specifics of why this happens but on vagrant 1.8.6 the network interface does not come up correctly and the private_network is attached to the eth0 nat interface rather than the host-only interface. I tried a number of different parameters but none of them were able to configure the network appropriately. This change manually configures the static ip so that it is connected to the correct adapter. Without this change I could not access the aurora web interface when running vagrant up. I've created a patch here: https://reviews.apache.org/r/52609/ This is what the configuration looks like when run off master: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state DOWN group default link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff 4: docker0: mtu 1500 qdisc noqueue state DOWN group default link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff inet 172.17.0.1/16 scope global docker0 valid_lft forever preferred_lft forever {code} here is what it is supposed to look like: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:fe7c:4e72/64 scope link valid_lft forever preferred_lft forever 4: docker0: mtu 1500 qdisc noqueue state DOWN group default link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff inet 172.17.0.1/16 scope global docker0 valid_lft forever preferred_lft forever {code} was: I am not sure of the specifics of why this happens but on vagrant 1.8.6 the network interface does not come up correctly and the private_network is attached to the eth0 nat interface rather than the host-only interface. I tried a number of different parameters but none of them were able to configure the network appropriately. This change manually configures the static ip so that it is connected to the correct adapter. Without this change I could not access the aurora web interface when running vagrant up. I've created a patch here: https://reviews.apache.org/r/52609/ This is what the configuration looks like when run off master: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state DOWN group default qlen 1000 link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff 4: docker0: mtu 1500 qdisc noqueue state DOWN group default link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff inet 172.17.0.1/16 scope global docker0 valid_lft forever preferred_lft forever {code} here is what it is supposed to look like: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:0
[jira] [Updated] (AURORA-1788) vagrant up does not properly configure network adapters
[ https://issues.apache.org/jira/browse/AURORA-1788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Jorgensen updated AURORA-1788: - Issue Type: Bug (was: Story) > vagrant up does not properly configure network adapters > --- > > Key: AURORA-1788 > URL: https://issues.apache.org/jira/browse/AURORA-1788 > Project: Aurora > Issue Type: Bug >Reporter: Andrew Jorgensen > > I am not sure of the specifics of why this happens but on vagrant 1.8.6 the > network interface does not come up correctly and the private_network is > attached to the eth0 nat interface rather than the host-only interface. I > tried a number of different parameters but none of them were able to > configure the network appropriately. This change manually configures the > static ip so that it is connected to the correct adapter. Without this change > I could not access the aurora web interface when running vagrant up. > I've created a patch here: https://reviews.apache.org/r/52609/ > This is what the configuration looks like when run off master: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state > DOWN group default qlen 1000 > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} > here is what it is supposed to look like: > {code} > ip addr > 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group > default > link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 > inet 127.0.0.1/8 scope host lo >valid_lft forever preferred_lft forever > inet6 ::1/128 scope host >valid_lft forever preferred_lft forever > 2: eth0: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff > inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:feb3:1b30/64 scope link >valid_lft forever preferred_lft forever > 3: eth1: mtu 1500 qdisc pfifo_fast state UP > group default qlen 1000 > link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff > inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 >valid_lft forever preferred_lft forever > inet6 fe80::a00:27ff:fe7c:4e72/64 scope link >valid_lft forever preferred_lft forever > 4: docker0: mtu 1500 qdisc noqueue state > DOWN group default > link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff > inet 172.17.0.1/16 scope global docker0 >valid_lft forever preferred_lft forever > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (AURORA-1788) vagrant up does not properly configure network adapters
Andrew Jorgensen created AURORA-1788: Summary: vagrant up does not properly configure network adapters Key: AURORA-1788 URL: https://issues.apache.org/jira/browse/AURORA-1788 Project: Aurora Issue Type: Story Reporter: Andrew Jorgensen I am not sure of the specifics of why this happens but on vagrant 1.8.6 the network interface does not come up correctly and the private_network is attached to the eth0 nat interface rather than the host-only interface. I tried a number of different parameters but none of them were able to configure the network appropriately. This change manually configures the static ip so that it is connected to the correct adapter. Without this change I could not access the aurora web interface when running vagrant up. I've created a patch here: https://reviews.apache.org/r/52609/ This is what the configuration looks like when run off master: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state DOWN group default qlen 1000 link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff 4: docker0: mtu 1500 qdisc noqueue state DOWN group default link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff inet 172.17.0.1/16 scope global docker0 valid_lft forever preferred_lft forever {code} here is what it is supposed to look like: {code} ip addr 1: lo: mtu 65536 qdisc noqueue state UNKNOWN group default link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00 inet 127.0.0.1/8 scope host lo valid_lft forever preferred_lft forever inet6 ::1/128 scope host valid_lft forever preferred_lft forever 2: eth0: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:b3:1b:30 brd ff:ff:ff:ff:ff:ff inet 10.0.2.15/24 brd 10.0.2.255 scope global eth0 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:feb3:1b30/64 scope link valid_lft forever preferred_lft forever 3: eth1: mtu 1500 qdisc pfifo_fast state UP group default qlen 1000 link/ether 08:00:27:7c:4e:72 brd ff:ff:ff:ff:ff:ff inet 192.168.33.7/24 brd 192.168.33.255 scope global eth1 valid_lft forever preferred_lft forever inet6 fe80::a00:27ff:fe7c:4e72/64 scope link valid_lft forever preferred_lft forever 4: docker0: mtu 1500 qdisc noqueue state DOWN group default link/ether 02:42:f6:de:a3:ca brd ff:ff:ff:ff:ff:ff inet 172.17.0.1/16 scope global docker0 valid_lft forever preferred_lft forever {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)