[ https://ovirt-jira.atlassian.net/browse/OVIRT-1840?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=35637#comment-35637 ]
Evgheni Dereveanchin edited comment on OVIRT-1840 at 1/11/18 8:51 AM: ---------------------------------------------------------------------- Looking at the slave, here's the stuck part: {quote} `-bash -c cd "/home/jenkins" && java -jar slave.jar{quote} {quote} `-java -jar slave.jar{quote} {quote} |-bash -ex /tmp/jenkins6289914333579712645.sh{quote} {quote} | `-bash -ex /tmp/jenkins6289914333579712645.sh{quote} {quote} | |-grep -oP .+?(?=:exported-artifacts){quote} {quote} | `-sudo -n docker images --format={{.Repository}}:{{.Tag}}{quote} {quote} | `-docker-current images --format={{.Repository}}:{{.Tag}}{quote} {quote} | `-5*[{docker-current}]{quote} At the same time, I see the following in pstree output of the same node:{quote} {quote}-sudo -n docker images --format={{.Repository}}:{{.Tag}}{quote} {quote} `-docker-current images --format={{.Repository}}:{{.Tag}}{quote} {quote} `-5*[{docker-current}]{quote} {quote}-sudo systemctl start docker{quote} {quote} `-systemctl start docker{quote} {quote}-sudo -n /bin/yum install -y docker{quote} {quote} `-yum /bin/yum install -y docker{quote} {quote} `-sh /var/tmp/rpm-tmp.gO7ceb 1{quote} {quote} `-systemctl try-restart docker.service{quote} {quote}-sh -c DEAD=`docker ps -aq -f status=dead` && [ -n "$DEAD" ] && docker rm $DEAD; exit 0{quote} {quote} `-sh -c DEAD=`docker ps -aq -f status=dead` && [ -n "$DEAD" ] && docker rm $DEAD; exit 0{quote} {quote} `-docker-current ps -aq -f status=dead{quote} {quote} `-6*[{docker-current}]{quote} As all of these commands are stuck from various stages of the job while docker wasn't even used throughout it. was (Author: ederevea): Looking at the slave, here's the stuck part: `-bash -c cd "/home/jenkins" && java -jar slave.jar `-java -jar slave.jar |-bash -ex /tmp/jenkins6289914333579712645.sh | `-bash -ex /tmp/jenkins6289914333579712645.sh | |-grep -oP .+?(?=:exported-artifacts) | `-sudo -n docker images --format={{.Repository}}:{{.Tag}} | `-docker-current images --format={{.Repository}}:{{.Tag}} | `-5*[{docker-current}] At the same time, I see the following in pstree output of the same node: -sudo -n docker images --format={{.Repository}}:{{.Tag}} `-docker-current images --format={{.Repository}}:{{.Tag}} `-5*[{docker-current}] -sudo systemctl start docker `-systemctl start docker -sudo -n /bin/yum install -y docker `-yum /bin/yum install -y docker `-sh /var/tmp/rpm-tmp.gO7ceb 1 `-systemctl try-restart docker.service -sh -c DEAD=`docker ps -aq -f status=dead` && [ -n "$DEAD" ] && docker rm $DEAD; exit 0 `-sh -c DEAD=`docker ps -aq -f status=dead` && [ -n "$DEAD" ] && docker rm $DEAD; exit 0 `-docker-current ps -aq -f status=dead `-6*[{docker-current}] As all of these commands are stuck from various stages of the job while docker wasn't even used throughout it. > jobs freeze due to unresponsive docker > -------------------------------------- > > Key: OVIRT-1840 > URL: https://ovirt-jira.atlassian.net/browse/OVIRT-1840 > Project: oVirt - virtualization made easy > Issue Type: Task > Reporter: Evgheni Dereveanchin > Assignee: infra > > Quite often do I see jobs stuck at various stages for hours that seem related > to docker. > Example: > http://jenkins.ovirt.org/job/ovirt-engine_master_build-artifacts-fc26-x86_64/610/console > There's multiple docker commands stuck on the slave (will post in the next > comment) so it seems to be deadlocked. Opening ticket to investigate which > step exactly is causing this and possible ways of resolving. The job in > question doesn't even use docker so shouldn't suffer if this happens. -- This message was sent by Atlassian Jira (v1001.0.0-SNAPSHOT#100075)
_______________________________________________ Infra mailing list Infra@ovirt.org http://lists.ovirt.org/mailman/listinfo/infra