Re: [Gluster-infra] Possible Gluster outage? (was Re: [Gluster-devel] tests/bitrot/bug-1373520.t is failing multiple times)

2017-02-01 Thread Pranith Kumar Karampuri
It just didn't work for an hour or so. I didn't look into what exactly was
the problem. The page was not even loading.

On Mon, Jan 30, 2017 at 7:48 PM, Nigel Babu  wrote:

> Hi Pranith,
>
> What problems did you see?
> * Did you have an Apache error?
> * Did you have a possible DNS issue?
>
> On Sat, Jan 28, 2017 at 3:56 PM, Pranith Kumar Karampuri <
> pkara...@redhat.com> wrote:
>
>> It is a bug in EC name heal code path. I sent a fix but
>> review.gluster.org is not accessible now to paste the link here. Will
>> send a mail again once it is accessible.
>>
>
> --
> nigelb
>



-- 
Pranith
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] [Bug 1416953] Request gerrit integration for gluster/ test github repository

2017-02-01 Thread bugzilla
https://bugzilla.redhat.com/show_bug.cgi?id=1416953



--- Comment #2 from Nigel Babu  ---
Okay so replication is setup and working. I've also cleared the cache, so now
it should mostly work. I'm fixing up apache issues now

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug 
https://bugzilla.redhat.com/token.cgi?t=sJ3VZcVaOt&a=cc_unsubscribe
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


[Gluster-infra] [Bug 1416953] Request gerrit integration for gluster/ test github repository

2017-02-01 Thread bugzilla
https://bugzilla.redhat.com/show_bug.cgi?id=1416953



--- Comment #3 from Nigel Babu  ---
And Apache issues are now fixed up.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug 
https://bugzilla.redhat.com/token.cgi?t=CqrllqYWI3&a=cc_unsubscribe
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


Re: [Gluster-infra] Possible Gluster outage? (was Re: [Gluster-devel] tests/bitrot/bug-1373520.t is failing multiple times)

2017-02-01 Thread Michael Scherer
Le mercredi 01 février 2017 à 14:11 +0530, Pranith Kumar Karampuri a
écrit :
> It just didn't work for an hour or so. I didn't look into what exactly was
> the problem. The page was not even loading.

I heard of issue on another server in the same location from people in
india. What internet provider were you using ?

What time did it start and stop ?

Can you do a traceroute next time ?

> On Mon, Jan 30, 2017 at 7:48 PM, Nigel Babu  wrote:
> 
> > Hi Pranith,
> >
> > What problems did you see?
> > * Did you have an Apache error?
> > * Did you have a possible DNS issue?
> >
> > On Sat, Jan 28, 2017 at 3:56 PM, Pranith Kumar Karampuri <
> > pkara...@redhat.com> wrote:
> >
> >> It is a bug in EC name heal code path. I sent a fix but
> >> review.gluster.org is not accessible now to paste the link here. Will
> >> send a mail again once it is accessible.
> >>
> >
> > --
> > nigelb
> >
> 
> 
> 
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-infra

-- 
Michael Scherer
Sysadmin, Community Infrastructure and Platform, OSAS




signature.asc
Description: This is a digitally signed message part
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] [Bug 1418369] New: Need way to revert commits or mark tests bad without testing

2017-02-01 Thread bugzilla
https://bugzilla.redhat.com/show_bug.cgi?id=1418369

Bug ID: 1418369
   Summary: Need way to revert commits or mark tests bad without
testing
   Product: GlusterFS
   Version: mainline
 Component: project-infrastructure
  Assignee: b...@gluster.org
  Reporter: jda...@redhat.com
CC: b...@gluster.org, gluster-infra@gluster.org



As discussed in today's community meeting, we need a way to "dig ourselves out
of the hole" quickly when work is blocked by sporadically failing tests. 
Specifically, there should be a way to exempt patches from testing which (a)
revert a previous commit or (b) mark a test as bad.  Gerrit has permissions
that allow specified users to bypass tests by pushing to refs/heads/xxx instead
of refs/for/xxx.  This permission should be enabled for project architects (and
probably not for others).  If we can further extend or limit this ability by
recognizing reverts and test markings for what they are, that's great but not
immediately necessary.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug 
https://bugzilla.redhat.com/token.cgi?t=Pegl66jblI&a=cc_unsubscribe
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


[Gluster-infra] [Bug 1418542] New: Multiple smoke failures observed since yesterday

2017-02-01 Thread bugzilla
https://bugzilla.redhat.com/show_bug.cgi?id=1418542

Bug ID: 1418542
   Summary: Multiple smoke failures observed since yesterday
   Product: GlusterFS
   Version: mainline
 Component: project-infrastructure
  Assignee: b...@gluster.org
  Reporter: amukh...@redhat.com
CC: b...@gluster.org, gluster-infra@gluster.org



Description of problem:

https://build.gluster.org/job/devrpm-el7/3055/console

07:13:33 Triggered by Gerrit: https://review.gluster.org/16502
07:13:33 Building remotely on slave25.cloud.gluster.org (smoke_tests
rackspace_regression_2gb glusterfs-devrpms) in workspace
/home/jenkins/root/workspace/devrpm-el7
07:13:33 Wiping out workspace first.
07:13:34 java.io.IOException: remote file operation failed:
/home/jenkins/root/workspace/devrpm-el7 at
hudson.remoting.Channel@148e532:slave25.cloud.gluster.org:
java.nio.file.AccessDeniedException:
/home/jenkins/root/workspace/devrpm-el7/RPMS/el7/x86_64/build.log
07:13:34 at hudson.FilePath.act(FilePath.java:986)
07:13:34 at hudson.FilePath.act(FilePath.java:968)
07:13:34 at hudson.FilePath.deleteContents(FilePath.java:1183)
07:13:34 at
hudson.plugins.git.extensions.impl.WipeWorkspace.beforeCheckout(WipeWorkspace.java:28)
07:13:34 at hudson.plugins.git.GitSCM.checkout(GitSCM.java:1094)
07:13:34 at hudson.scm.SCM.checkout(SCM.java:485)
07:13:34 at
hudson.model.AbstractProject.checkout(AbstractProject.java:1269)
07:13:34 at
hudson.model.AbstractBuild$AbstractBuildExecution.defaultCheckout(AbstractBuild.java:607)
07:13:34 at
jenkins.scm.SCMCheckoutStrategy.checkout(SCMCheckoutStrategy.java:86)
07:13:34 at
hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:529)
07:13:34 at hudson.model.Run.execute(Run.java:1738)
07:13:34 at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43)
07:13:34 at
hudson.model.ResourceController.execute(ResourceController.java:98)
07:13:34 at hudson.model.Executor.run(Executor.java:410)
07:13:34 Caused by: java.nio.file.AccessDeniedException:
/home/jenkins/root/workspace/devrpm-el7/RPMS/el7/x86_64/build.log
07:13:34 at
sun.nio.fs.UnixException.translateToIOException(UnixException.java:84)
07:13:34 at
sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:102)
07:13:34 at
sun.nio.fs.UnixException.rethrowAsIOException(UnixException.java:107)
07:13:34 at
sun.nio.fs.UnixFileSystemProvider.implDelete(UnixFileSystemProvider.java:244)
07:13:34 at
sun.nio.fs.AbstractFileSystemProvider.delete(AbstractFileSystemProvider.java:103)
07:13:34 at java.nio.file.Files.delete(Files.java:1079)
07:13:34 at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
07:13:34 at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
07:13:34 at java.lang.reflect.Method.invoke(Method.java:606)
07:13:34 at hudson.Util.deleteFile(Util.java:255)
07:13:34 at hudson.FilePath.deleteRecursive(FilePath.java:1203)
07:13:34 at hudson.FilePath.deleteContentsRecursive(FilePath.java:1212)
07:13:34 at hudson.FilePath.deleteRecursive(FilePath.java:1194)
07:13:34 at hudson.FilePath.deleteContentsRecursive(FilePath.java:1212)
07:13:34 at hudson.FilePath.deleteRecursive(FilePath.java:1194)
07:13:34 at hudson.FilePath.deleteContentsRecursive(FilePath.java:1212)
07:13:34 at hudson.FilePath.deleteRecursive(FilePath.java:1194)
07:13:34 at hudson.FilePath.deleteContentsRecursive(FilePath.java:1212)
07:13:34 at hudson.FilePath.access$1100(FilePath.java:190)
07:13:34 at hudson.FilePath$15.invoke(FilePath.java:1186)
07:13:34 at hudson.FilePath$15.invoke(FilePath.java:1183)
07:13:34 at hudson.FilePath$FileCallableWrapper.call(FilePath.java:2719)
07:13:34 at hudson.remoting.UserRequest.perform(UserRequest.java:120)
07:13:34 at hudson.remoting.UserRequest.perform(UserRequest.java:48)
07:13:34 at hudson.remoting.Request$2.run(Request.java:332)
07:13:34 at
hudson.remoting.InterceptingExecutorService$1.call(InterceptingExecutorService.java:68)
07:13:34 at java.util.concurrent.FutureTask.run(FutureTask.java:262)
07:13:34 at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
07:13:34 at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
07:13:34 at java.lang.Thread.run(Thread.java:745)
07:13:34 at ..remote call to slave25.cloud.gluster.org(Native Method)
07:13:34 at
hudson.remoting.Channel.attachCallSiteStackTrace(Channel.java:1416)
07:13:34 at hudson.remoting.UserResponse.retrieve(UserRequest.java:220)
07:13:34 at hudson.remoting.Channel.call(Channel.java:781)
07:13:34 at hudson.FilePath.act(FilePath.java:979)
07:13:34 ... 13 more
07:13:34 Archiving artifacts
07:13:37 Finished: FAILURE

Version-Release number of selected component (if applicable):


How reproducible:


Steps to Reproduce:
1

[Gluster-infra] Postmortem for RPM build failures

2017-02-01 Thread Nigel Babu
Hello,

Some of you may have noticed that we've been particularly plagued with RPM
build failures for the past couple of days. The failure looks like a Java error
from Jenkins[1]. I first noticed the error on Tuesday, 31st January. I narrowed
down the issue with a problem with permissions. Jenkins cleans the workspace at
the start of every run and Jenkins was unable to delete the RPM folder for some
reason. My best guess is that something changed in the rpmbuild or mock package
in the last couple of days, which lead to this. We have not deployed any change
from the infra side which should have caused this.

I immediately ran an Ansible job that deleted all the RPM workspaces so that
fresh ones would work and I added this line at the end of every Jenkins job:

sudo chown -R jenkins:jenkins ${WORKSPACE}/RPMS

This seemed to work for the moment and I'd moved onto other things. Yesterday,
we saw quite a resurgence of the errors. Atin filed a bug this morning and I've
figured out what I did wrong. The permission change runs only when the job is
successful. Not exactly my best idea :)

I've now fixed it up by making it a post-job script[2] that runs the same 
command
irrespective of whether the job failed or worked. That should fix up any
problems in the future. Apologies for the inconvenience. As always, please file
a bug when you notice any unexpected Jenkins behavior.

[1]: https://build.gluster.org/job/strfmt_errors/2789/console
[2]: 
http://git.gluster.org/cgit/build-jobs.git/commit/?id=3b77dbdac288bf21f802be41019e2cd4d4dc3e3c

--
nigelb


signature.asc
Description: PGP signature
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] [Bug 1418542] Multiple smoke failures observed since yesterday

2017-02-01 Thread bugzilla
https://bugzilla.redhat.com/show_bug.cgi?id=1418542

Nigel Babu  changed:

   What|Removed |Added

 Status|NEW |CLOSED
 CC||nig...@redhat.com
 Resolution|--- |CURRENTRELEASE
   Assignee|b...@gluster.org|nig...@redhat.com
Last Closed||2017-02-02 02:35:04



--- Comment #1 from Nigel Babu  ---
Fixed. Post-mortem here:
http://lists.gluster.org/pipermail/gluster-infra/2017-February/003153.html

-- 
You are receiving this mail because:
You are on the CC list for the bug.
Unsubscribe from this bug 
https://bugzilla.redhat.com/token.cgi?t=zdTeTbaWX9&a=cc_unsubscribe
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra