Re: [Gluster-infra] New workflow proposal for glusterfs repo

2019-06-12 Thread Raghavendra Talur
On Wed, Jun 12, 2019, 1:56 PM Atin Mukherjee  wrote:

>
>
> On Wed, 12 Jun 2019 at 18:04, Amar Tumballi Suryanarayan <
> atumb...@redhat.com> wrote:
>
>>
>> Few bullet points:
>>
>> * Let smoke job sequentially for below, and if successful, in parallel
>> for others.
>>   - Sequential:
>>   -- clang-format check
>>   -- compare-bugzilla-version-git-branch
>>   -- bugzilla-post
>>   -- comment-on-issue
>>   -- fedora-smoke (mainly don't want warning).
>>
>
> +1
>
>   - Parallel
>>-- all devrpm jobs
>>-- 32bit smoke
>>-- freebsd-smoke
>>-- smoke
>>-- strfmt_errors
>>-- python-lint, and shellcheck.
>>
>
> I’m sure there must be a reason but would like to know that why do they
> need to be parallel? Can’t we have them sequentially to have similar
> benefits of the resource utilisation like above? Or are all these
> individual jobs are time consuming such that having them sequentially will
> lead the overall smoke job to consume much longer?
>
>
>> * Remove Verified flag. No point in one more extra button which users
>> need to click, anyways CentOS regression is considered as 'Verification'.
>>
>
The requirement of verified flag by patch owner for regression to run was
added because the number of Jenkins machines we had were few and patches
being uploaded were many.


>> * In a normal flow, let CentOS regression which is running after
>> 'Verified' vote, be triggered on first 'successful' +1 reviewed vote.
>>
>
> As I believe some reviewers/maintainers (including me) would like to see
> the regression vote to put a +1/+2 in most of the patches until and unless
> they are straight forward ones. So although with this you’re reducing the
> burden of one extra click from the patch owner, but on the other way you’re
> introducing the same burden on the reviewers who would like to check the
> regression vote. IMHO, I don’t see much benefits in implementing this.
>

Agree with Atin here. Burden should be on machines before people. Reviewers
prefer to look at patches that have passed regression.

In github heketi, we have configured regression to run on all patches that
are submitted by heketi developer group. If such configuration is possible
in gerrit+Jenkins, we should definitely do it that way.

For patches that are submitted by someone outside of the developer group, a
maintainer should verify that the patch doesn't do anything harmful and
mark the regression to run.

Talur



>
>> * For those patches which got pushed to system to just 'validate'
>> behavior, to run sample tests, WIP patches, continue to support 'recheck
>> centos'  comment message, so we can run without any vote. Let it not be the
>> norm.
>>
>>
>> With this, I see that we can reduce smoke failures utilize 90% less
>> resources for a patch which would fail smoke anyways. (ie, 95% of the smoke
>> failures would be caught in first 10% of the resource, and time).
>>
>> Also we can reduce number of regression running, as review is mandatory
>> to run regression.
>>
>> These are just suggestions, happy to discuss more on these.
>>
>> -Amar
>>
>>
>>
>> ___
>> Gluster-infra mailing list
>> Gluster-infra@gluster.org
>> https://lists.gluster.org/mailman/listinfo/gluster-infra
>
> --
> - Atin (atinm)
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> https://lists.gluster.org/mailman/listinfo/gluster-infra
___
Gluster-infra mailing list
Gluster-infra@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Where to request for adding versions in Bugzilla

2017-05-14 Thread Raghavendra Talur
On Mon, May 15, 2017 at 5:49 AM, Vijay Bellur  wrote:
>
>
> On Sun, May 14, 2017 at 8:04 PM, Vijay Bellur  wrote:
>>
>> On 05/14/2017 04:02 PM, Raghavendra Talur wrote:
>>>
>>> Hi All,
>>>
>>> I need 3.10.1 and 3.10.2 versions added to Glusterfs product. How do I do
>>> it?
>>>
>>
>> You would need to file a request with Red Hat's bugzilla admin. I will
>> mail details about the process offline.
>>
>
> I have forwarded you details. However I remember a discussion on not adding
> bz versions for minor updates in 3.8.x. The expectation was that users would
> provide details about the exact version in the bug description. Do we want
> to follow a similar process for 3.10?

Thanks Vijay!
May be this is why Shyam hasn't created 3.10.1 version already? If so,
I will not send this request.

>
> Thanks,
> Vijay
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


[Gluster-infra] Where to request for adding versions in Bugzilla

2017-05-14 Thread Raghavendra Talur
Hi All,

I need 3.10.1 and 3.10.2 versions added to Glusterfs product. How do I do it?

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


Re: [Gluster-infra] Is anyone else having trouble authenticating with review.gluster.org over ssh?

2017-04-16 Thread Raghavendra Talur
On Sun, Apr 16, 2017 at 9:07 PM, Raghavendra Talur  wrote:
> I am not able to login even after specifying the key file
>
> $ ssh -T -vvv -i ~/.ssh/gluster raghavendra-ta...@git.gluster.org
> OpenSSH_7.4p1, OpenSSL 1.0.2k-fips  26 Jan 2017
> debug1: Reading configuration data /home/rtalur/.ssh/config
> debug1: Reading configuration data /etc/ssh/ssh_config
> debug3: /etc/ssh/ssh_config line 56: Including file
> /etc/ssh/ssh_config.d/05-redhat.conf depth 0
> debug1: Reading configuration data /etc/ssh/ssh_config.d/05-redhat.conf
> debug1: /etc/ssh/ssh_config.d/05-redhat.conf line 2: include
> /etc/crypto-policies/back-ends/openssh.txt matched no files
> debug1: /etc/ssh/ssh_config.d/05-redhat.conf line 8: Applying options for *
> debug1: auto-mux: Trying existing master
> debug1: Control socket
> "/tmp/ssh_mux_git.gluster.org_22_raghavendra-talur" does not exist
> debug2: resolving "git.gluster.org" port 22
> debug2: ssh_connect_direct: needpriv 0
> debug1: Connecting to git.gluster.org [8.43.85.171] port 22.
> debug1: Connection established.
> debug1: identity file /home/rtalur/.ssh/gluster type 1
> debug1: key_load_public: No such file or directory
> debug1: identity file /home/rtalur/.ssh/gluster-cert type -1
> debug1: Enabling compatibility mode for protocol 2.0
> debug1: Local version string SSH-2.0-OpenSSH_7.4
> ssh_exchange_identification: Connection closed by remote host

Confirmed with Pranith that he is facing same issue.

Nigel/Misc,
Please have a look.


>
> Thanks,
> Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


[Gluster-infra] Is anyone else having trouble authenticating with review.gluster.org over ssh?

2017-04-16 Thread Raghavendra Talur
I am not able to login even after specifying the key file

$ ssh -T -vvv -i ~/.ssh/gluster raghavendra-ta...@git.gluster.org
OpenSSH_7.4p1, OpenSSL 1.0.2k-fips  26 Jan 2017
debug1: Reading configuration data /home/rtalur/.ssh/config
debug1: Reading configuration data /etc/ssh/ssh_config
debug3: /etc/ssh/ssh_config line 56: Including file
/etc/ssh/ssh_config.d/05-redhat.conf depth 0
debug1: Reading configuration data /etc/ssh/ssh_config.d/05-redhat.conf
debug1: /etc/ssh/ssh_config.d/05-redhat.conf line 2: include
/etc/crypto-policies/back-ends/openssh.txt matched no files
debug1: /etc/ssh/ssh_config.d/05-redhat.conf line 8: Applying options for *
debug1: auto-mux: Trying existing master
debug1: Control socket
"/tmp/ssh_mux_git.gluster.org_22_raghavendra-talur" does not exist
debug2: resolving "git.gluster.org" port 22
debug2: ssh_connect_direct: needpriv 0
debug1: Connecting to git.gluster.org [8.43.85.171] port 22.
debug1: Connection established.
debug1: identity file /home/rtalur/.ssh/gluster type 1
debug1: key_load_public: No such file or directory
debug1: identity file /home/rtalur/.ssh/gluster-cert type -1
debug1: Enabling compatibility mode for protocol 2.0
debug1: Local version string SSH-2.0-OpenSSH_7.4
ssh_exchange_identification: Connection closed by remote host

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


[Gluster-infra] what does G_BROKEN_FILENAMES=1 in console output mean?

2017-04-03 Thread Raghavendra Talur
I am seeing a new env var in console output[1] of jenkins,
G_BROKEN_FILENAMES, what does this mean?


[1] https://build.gluster.org/job/centos6-regression/3819/consoleFull


Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-infra


[Gluster-infra] Request to provide PASS flags to a patch in gerrit

2016-08-31 Thread Raghavendra Talur
Hi All,

We have a test [1] which is causing hangs in NetBSD. We have not been able
to debug the issue yet.
It could be because the bash script does not comply with posix guidelines
or that there is a bug in the brick code.

However, as we have 3.9 merge deadline tomorrow this is causing the test
pipeline to grow a lot and needing manual intervention.
I recommend we disable this test for now. I request Kaushal to provide pass
flags to the patch [2] for faster merge.


[1] ./tests/features/lock_revocation.t
[2] http://review.gluster.org/#/c/15374/


Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] [Gluster-devel] tests/basic/op_errnos.t failure

2016-07-12 Thread Raghavendra Talur
Nigel/Misc,

Could you please look into this?
slave29 does not seem to have a xfs formatted backend for tests.

Thanks,
Raghavendra Talur

On Tue, Jul 12, 2016 at 6:41 PM, Avra Sengupta  wrote:

> Atin,
>
> I am not sure about the docker containers, but both the failures you
> mentioned are in slave29, which as Talur explained is missing the
> appropriate backend filesystem. Owing to this, op-errno.t is just the tip
> of the iceberg, and every other test that uses lvm will fail in this
> particular slave will fail too.
>
> Talur,
> Thanks for looking into it. It is indeed strange this. I checked the dmesg
> and the /var/log/messages in this slave and I couldn't find any relevant
> log.
>
>
> On 07/12/2016 05:29 PM, Raghavendra Talur wrote:
>
> I checked the machine.
>
> Here is the df -hT output
> [jenkins@slave29 ~]$ cat /etc/fstab
> # Accessible filesystems, by reference, are maintained under '/dev/disk'
> # See man pages fstab(5), findfs(8), mount(8) and/or blkid(8) for more info
> #
> /dev/xvda1  /   ext3
>  defaults,noatime,barrier=0 1 1
> tmpfs   /dev/shmtmpfs   defaults0 0
> devpts  /dev/ptsdevpts  gid=5,mode=620  0 0
> sysfs   /syssysfs   defaults0 0
> proc/proc   procdefaults0 0
> #/dev/xvdc1 noneswapsw  0 0
>
>
> We don't see a xfs device mounted at /d and / is of type ext3 which does
> not support fallocate. The uptime of the machine is 73 days though. I don't
> know how the /d xfs partition vanished.
>
> On Tue, Jul 12, 2016 at 4:54 PM, Atin Mukherjee < 
> amukh...@redhat.com> wrote:
>
>>
>> https://build.gluster.org/job/rackspace-regression-2GB-triggered/22156/consoleFull
>> - another failure
>>
>> On Tue, Jul 12, 2016 at 4:42 PM, Atin Mukherjee < 
>> amukh...@redhat.com> wrote:
>>
>>>
>>>
>>> On Tue, Jul 12, 2016 at 4:36 PM, Avra Sengupta < 
>>> aseng...@redhat.com> wrote:
>>>
>>>> Hi Atin,
>>>>
>>>> Please check the testcase result in the console. It clearly states the
>>>> reason of the failure. A quick search of 30815, as shown in the testcase
>>>> shows that the error that is generated is a thinp issue, and we can see
>>>> fallocate failing and lvm not properly being setup in the environment.
>>>>
>>>
>>> While this is valid for my docker containers, I am just wondering why
>>> did this happen in jenkins slave?
>>>
>>>
>>>> Regards,
>>>> Avra
>>>>
>>>> P.S Here are the logs from the console stating so.
>>>>
>>>> *02:50:34* [09:50:34] Running tests in file 
>>>> ./tests/basic/op_errnos.t*02:50:41* fallocate: 
>>>> /d/backends/patchy_snap_vhd: fallocate failed: Operation not 
>>>> supported*02:50:41* losetup: /d/backends/patchy_snap_vhd: warning: file 
>>>> smaller than 512 bytes, the loop device maybe be useless or invisible for 
>>>> system tools.*02:50:41*   Device /d/backends/patchy_snap_loop not found 
>>>> (or ignored by filtering).*02:50:41*   Device /d/backends/patchy_snap_loop 
>>>> not found (or ignored by filtering).*02:50:41*   Unable to add physical 
>>>> volume '/d/backends/patchy_snap_loop' to volume group 
>>>> 'patchy_snap_vg_1'.*02:50:41*   Volume group "patchy_snap_vg_1" not 
>>>> found*02:50:41*   Cannot process volume group patchy_snap_vg_1*02:50:42*   
>>>> Volume group "patchy_snap_vg_1" not found*02:50:42*   Cannot process 
>>>> volume group patchy_snap_vg_1*02:50:42* /dev/patchy_snap_vg_1/brick_lvm: 
>>>> No such file or directory*02:50:42* Usage: mkfs.xfs*02:50:42* /* blocksize 
>>>> */  [-b log=n|size=num]*02:50:42* /* data subvol */ [-d 
>>>> agcount=n,agsize=n,file,name=xxx,size=num,*02:50:42*   
>>>>  (sunit=value,swidth=value|su=num,sw=num),*02:50:42*   
>>>>   sectlog=n|sectsize=num*02:50:42* /* inode size */   [-i 
>>>> log=n|perblock=n|size=num,maxpct=n,attr=0|1|2,*02:50:42*   
>>>>  projid32bit=0|1]*02:50:42* /* log subvol */ [-l 
>>>> agnum=n,internal,size=num,logdev=xxx,version=n*02:50:42*   
>>>>  sunit=value|su=num,sectlog=n|sectsize=num,*02:50:42*  
>>>>   

Re: [Gluster-infra] NetBSD machine required to debug a core

2016-05-12 Thread Raghavendra Talur
On Thu, May 12, 2016 at 7:48 PM, Kaushal M  wrote:

> On Thu, May 12, 2016 at 6:02 PM, Raghavendra Talur 
> wrote:
> > slave26.cloud.gluster.org is taken down for you.
> > please update here when done.
>
> Isn't this a CentOS machine?
>

I blame it on lack of caffeine and bad headache :-/
I have brought back slave26.cloud.gluster.org and taken down
nbslave7j.cloud.gluster.org for Ravi.


> >
> > Thanks,
> > Raghavendra Talur
> >
> >
> > On Wed, May 11, 2016 at 2:49 PM, Ravishankar N 
> > wrote:
> >>
> >> Hello,
> >>
> >> I would need a NetBSD machine to debug a crash
> >>
> https://build.gluster.org/job/rackspace-netbsd7-regression-triggered/16749/consoleFull
> >>
> >> The test that is failing is a .t that I have written as a part of the
> >> patch against which the regression was triggered.
> >>
> >> Thanks,
> >>
> >> Ravi
> >>
> >> ___
> >> Gluster-infra mailing list
> >> Gluster-infra@gluster.org
> >> http://www.gluster.org/mailman/listinfo/gluster-infra
> >
> >
> >
> > ___
> > Gluster-infra mailing list
> > Gluster-infra@gluster.org
> > http://www.gluster.org/mailman/listinfo/gluster-infra
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] NetBSD machine required to debug a core

2016-05-12 Thread Raghavendra Talur
slave26.cloud.gluster.org is taken down for you.
please update here when done.

Thanks,
Raghavendra Talur

On Wed, May 11, 2016 at 2:49 PM, Ravishankar N 
wrote:

> Hello,
>
> I would need a NetBSD machine to debug a crash
> https://build.gluster.org/job/rackspace-netbsd7-regression-triggered/16749/consoleFull
>
> The test that is failing is a .t that I have written as a part of the
> patch against which the regression was triggered.
>
> Thanks,
>
> Ravi
>
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-infra
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] LVM operations on some slaves taking a lot of time

2016-04-22 Thread Raghavendra Talur
I saw snapshot tests taking a lot of time of slave22.cloud.gluster.org and
on further investigation I found that it was lvm and its commands that were
taking time.

Link:
https://build.gluster.org/job/rackspace-regression-2GB-triggered/19891/consoleFull

Ignore if this is a one off case. If this repeats we need to see what is
wrong with the slave.

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Request to add more maintainers for DISTAF

2016-04-11 Thread Raghavendra Talur
Hi All,

We have now merged the first patch for Distaf libs and Distaf tests in
Gluster git repo.[1]
M S Vishwanath already has merge rights on gerrit for Gluster repo. I would
like to propose Shwetha and Jonathan as additional maintainers for Distaf
related code. If no one has any objections I request infra team to add them
to maintainers group.

[1]: http://review.gluster.org/#/c/13853/

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] nbslave7g.cloud.gluster.org assigned to anoop c s for debugging

2016-03-29 Thread Raghavendra Talur

___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] [Gluster-devel] Gerrit trigger connection to jenkins not working

2016-03-15 Thread Raghavendra Talur
Thanks Kaushal!

On Wed, Mar 16, 2016 at 11:54 AM, Kaushal M  wrote:

> Fixed now. Check my reply to Kaleb's earlier mail for the details.
>
> On Wed, Mar 16, 2016 at 11:07 AM, Raghavendra Talur 
> wrote:
> > Root cause: I see the following error when I do "test connection"
> > Connection error : com.jcraft.jsch.JSchException:
> > java.net.NoRouteToHostException: No route to host
> >
> >
> > On Wed, Mar 16, 2016 at 10:57 AM, Raghavendra Talur 
> > wrote:
> >>
> >> Hi All,
> >>
> >> I see that jenkins isn't getting any triggers from gerrit. This includes
> >> patch set update trigger, so basically no patch is automatically
> tested. I
> >> am looking into it. Will update in this thread.
> >>
> >> For anything critical and urgent, please mail in gluster-devel asking
> >> jenkins maintainers to trigger build from you.
> >>
> >> Thanks,
> >> Raghavendra Talur
> >
> >
> >
> > ___
> > Gluster-devel mailing list
> > gluster-de...@gluster.org
> > http://www.gluster.org/mailman/listinfo/gluster-devel
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Gerrit trigger connection to jenkins not working

2016-03-15 Thread Raghavendra Talur
Root cause: I see the following error when I do "test connection"
Connection error : com.jcraft.jsch.JSchException:
java.net.NoRouteToHostException: No route to host


On Wed, Mar 16, 2016 at 10:57 AM, Raghavendra Talur 
wrote:

> Hi All,
>
> I see that jenkins isn't getting any triggers from gerrit. This includes
> patch set update trigger, so basically no patch is automatically tested. I
> am looking into it. Will update in this thread.
>
> For anything critical and urgent, please mail in gluster-devel asking
> jenkins maintainers to trigger build from you.
>
> Thanks,
> Raghavendra Talur
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Gerrit trigger connection to jenkins not working

2016-03-15 Thread Raghavendra Talur
Hi All,

I see that jenkins isn't getting any triggers from gerrit. This includes
patch set update trigger, so basically no patch is automatically tested. I
am looking into it. Will update in this thread.

For anything critical and urgent, please mail in gluster-devel asking
jenkins maintainers to trigger build from you.

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Code-Review+2 and Verified+1 cause multiple retriggers on Jenkins

2016-03-06 Thread Raghavendra Talur
On Fri, Mar 4, 2016 at 6:13 PM, Raghavendra Talur  wrote:

>
>
> On Thu, Feb 4, 2016 at 7:13 PM, Niels de Vos  wrote:
>
>> On Thu, Feb 04, 2016 at 04:15:16PM +0530, Raghavendra Talur wrote:
>> > On Thu, Feb 4, 2016 at 4:13 PM, Niels de Vos  wrote:
>> >
>> > > On Thu, Feb 04, 2016 at 03:34:05PM +0530, Raghavendra Talur wrote:
>> > > > Hi,
>> > > >
>> > > > We recently changed the jenkins builds to be triggered on the
>> following
>> > > > triggers.
>> > > >
>> > > > 1. Verified+1
>> > > > 2. Code-review+2
>> > > > 3. recheck (netbsd|centos|smoke)
>> > > >
>> > > > There is a bug in 1 and 2.
>> > > >
>> > > > Multiple triggers of 1 or 2 would result in re-runs even when not
>> > > intended.
>> > > >
>> > > > I would like to replace 1 and 2 with a comment "run-all-regression"
>> or
>> > > > something like that.
>> > > > Thoughts?
>> > >
>> > > Maybe starting regressions on Code-Review +1 (or +2) only?
>> > >
>> >
>> > Multiple code-reviews would do multiple triggers. Won't work.
>>
>> How can we make this to work, without the need of providing magic
>> comments?
>>
>
> I investigated but couldn't find a way to make it work. Discussed with
> Kaushal and we feel it should be ok to go with a "check all" comment for
> initial regression run and deprecate Code-Review+2 and Verified+1 triggers.
>
> I would like to go ahead and do it as the build queue is increasing again
> just because of Code-Review+2's given just before a patch is merged; they
> don't serve any purpose.
>

I have for now just removed trigger for code-review.
Trigger for verified+1 remains as is.
No new trigger on comments have been added.


>> Niels
>>
>
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Code-Review+2 and Verified+1 cause multiple retriggers on Jenkins

2016-03-04 Thread Raghavendra Talur
On Thu, Feb 4, 2016 at 7:13 PM, Niels de Vos  wrote:

> On Thu, Feb 04, 2016 at 04:15:16PM +0530, Raghavendra Talur wrote:
> > On Thu, Feb 4, 2016 at 4:13 PM, Niels de Vos  wrote:
> >
> > > On Thu, Feb 04, 2016 at 03:34:05PM +0530, Raghavendra Talur wrote:
> > > > Hi,
> > > >
> > > > We recently changed the jenkins builds to be triggered on the
> following
> > > > triggers.
> > > >
> > > > 1. Verified+1
> > > > 2. Code-review+2
> > > > 3. recheck (netbsd|centos|smoke)
> > > >
> > > > There is a bug in 1 and 2.
> > > >
> > > > Multiple triggers of 1 or 2 would result in re-runs even when not
> > > intended.
> > > >
> > > > I would like to replace 1 and 2 with a comment "run-all-regression"
> or
> > > > something like that.
> > > > Thoughts?
> > >
> > > Maybe starting regressions on Code-Review +1 (or +2) only?
> > >
> >
> > Multiple code-reviews would do multiple triggers. Won't work.
>
> How can we make this to work, without the need of providing magic
> comments?
>

I investigated but couldn't find a way to make it work. Discussed with
Kaushal and we feel it should be ok to go with a "check all" comment for
initial regression run and deprecate Code-Review+2 and Verified+1 triggers.

I would like to go ahead and do it as the build queue is increasing again
just because of Code-Review+2's given just before a patch is merged; they
don't serve any purpose.


>
> Niels
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] How to use jenkins cli for editing job

2016-03-04 Thread Raghavendra Talur
Hi,

I changed the worker thread count for gerrit-trigger plugin send worker to
3 from 1.
This was because there were warning in jenkins home page saying the queue
has 2400 events and suggested to increase worker threads.

As Niels had suggested before, I tried performing the same update using the
cli.

I was not able to use it though, kept getting "resource has moved" error.
Could someone else try too and also write a simple doc on how to use the
cli tool?

Thanks,
Raghavendra
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] review.gluster.org not working

2016-02-27 Thread Raghavendra Talur
On Feb 27, 2016 9:04 PM, "Amye Scavarda"  wrote:
>
> On Fri, Feb 26, 2016 at 9:24 PM, Raghavendra Talur 
wrote:
> >
> > On Feb 26, 2016 11:23 PM, "Raghavendra Talur"  wrote:
> >>
> >>
> >>
> >> On Fri, Feb 26, 2016 at 6:29 PM, Kaleb Keithley 
> >> wrote:
> >>>
> >>>
> >>>
> >>> - Original Message -
> >>> > From: "Raghavendra Talur" 
> >>> >
> >>> > Most likely it looks like disk is full. Someone please have a look
at
> >>> > it.
> >>> >
> >>>
> >>> Someone who? Misc is on holiday IIRC. Who else has access?
> >>
> >>
> >> I think it is only Vijay and Kaushal other than misc who have access.
> >
> > I spoke to Kaushal over phone. He does not have access either. Adding
misc
> > hoping to catch his attention.
> >
> >>
> >>>
> >>>
> >>> Why do we think running our own gerrit and jenkins is a good thing?
(When
> >>> we could be using the CentOS gerrit and jenkins?)
> >>
> >>
> >> That discussion did happen in Devconf and will be discussed in
> >> gluster-devel is what I understand. We should wait for people to put
across
> >> pros and cons for that.
> >> Personally,
> >> 1. I would want infra which has a large enough team to have a 24/7
> >> monitoring.(As ndevos suggested, if we form a bigger group we can do
so with
> >> our current setup.)
> >> 2. Have control over gerrit settings so that we can decide what do we
mean
> >> by trivial patches, and what patches carry over review comments etc.
> >>
> >>
> >>>
> >>>
> >>> (Yes, I am feeling snarky.)
> >>
> >>
> >> Understandable, given how easily our infra breaks. We who are in
> >> gluster-infra group have to come up with a solution too.
> >>
> >>>
> >>> --
> >>>
> >>> Kaleb
> >>> ___
> >>> Gluster-infra mailing list
> >>> Gluster-infra@gluster.org
> >>> http://www.gluster.org/mailman/listinfo/gluster-infra
> >>
> >>
> >
> >
> > ___
> > Gluster-infra mailing list
> > Gluster-infra@gluster.org
> > http://www.gluster.org/mailman/listinfo/gluster-infra
>
> Misc is indeed out of the office, but he will return on Monday.
>
> In the meantime, can we put this on the list of documented
> infrastructure pieces that more than one person needs access to - so
> that we can start forming that effective infra team? Part of the
> challenge is that the responsibilities haven't been clear, so getting
> people to volunteer to assist is not helpful.
>
> The conversation about CentOS's CI/CD is still ongoing, and in no way
> do I want to dissuade Kaleb's snarky yet helpful suggestions. Keep 'em
> coming.

+1 to everything.

Update : the site is working now. Not sure about who fixed it or what fixed
it.

> - amye
>
> --
> Amye Scavarda | a...@redhat.com | Gluster Community Lead
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] review.gluster.org not working

2016-02-26 Thread Raghavendra Talur
On Feb 26, 2016 11:23 PM, "Raghavendra Talur"  wrote:
>
>
>
> On Fri, Feb 26, 2016 at 6:29 PM, Kaleb Keithley 
wrote:
>>
>>
>>
>> - Original Message -
>> > From: "Raghavendra Talur" 
>> >
>> > Most likely it looks like disk is full. Someone please have a look at
it.
>> >
>>
>> Someone who? Misc is on holiday IIRC. Who else has access?
>
>
> I think it is only Vijay and Kaushal other than misc who have access.

I spoke to Kaushal over phone. He does not have access either. Adding misc
hoping to catch his attention.

>
>>
>>
>> Why do we think running our own gerrit and jenkins is a good thing?
(When we could be using the CentOS gerrit and jenkins?)
>
>
> That discussion did happen in Devconf and will be discussed in
gluster-devel is what I understand. We should wait for people to put across
pros and cons for that.
> Personally,
> 1. I would want infra which has a large enough team to have a 24/7
monitoring.(As ndevos suggested, if we form a bigger group we can do so
with our current setup.)
> 2. Have control over gerrit settings so that we can decide what do we
mean by trivial patches, and what patches carry over review comments etc.
>
>
>>
>>
>> (Yes, I am feeling snarky.)
>
>
> Understandable, given how easily our infra breaks. We who are in
gluster-infra group have to come up with a solution too.
>
>>
>> --
>>
>> Kaleb
>> ___
>> Gluster-infra mailing list
>> Gluster-infra@gluster.org
>> http://www.gluster.org/mailman/listinfo/gluster-infra
>
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] review.gluster.org not working

2016-02-26 Thread Raghavendra Talur
On Fri, Feb 26, 2016 at 6:29 PM, Kaleb Keithley  wrote:

>
>
> - Original Message -
> > From: "Raghavendra Talur" 
> >
> > Most likely it looks like disk is full. Someone please have a look at it.
> >
>
> Someone who? Misc is on holiday IIRC. Who else has access?
>

I think it is only Vijay and Kaushal other than misc who have access.


>
> Why do we think running our own gerrit and jenkins is a good thing? (When
> we could be using the CentOS gerrit and jenkins?)
>

That discussion did happen in Devconf and will be discussed in
gluster-devel is what I understand. We should wait for people to put across
pros and cons for that.
Personally,
1. I would want infra which has a large enough team to have a 24/7
monitoring.(As ndevos suggested, if we form a bigger group we can do so
with our current setup.)
2. Have control over gerrit settings so that we can decide what do we mean
by trivial patches, and what patches carry over review comments etc.



>
> (Yes, I am feeling snarky.)
>

Understandable, given how easily our infra breaks. We who are in
gluster-infra group have to come up with a solution too.


> --
>
> Kaleb
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-infra
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] review.gluster.org not working

2016-02-26 Thread Raghavendra Talur
Most likely it looks like disk is full. Someone please have a look at it.
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Restricted trigger server for centos and netbsd regression to review.gluster.org

2016-02-07 Thread Raghavendra Talur
Hi,

After misc fixed ssl and cert issues on gerrit, jenkins now connects to
review.gluster.org using both the configurations.
1. old one which has name review.gluster.org
2. new one which has review.gluster.org_for_smoke_builds

2 was configured by ndevos for CI work I guess.

CentOS and NetBSD regression jobs were configured to take events from any
server and this resulted in duplicate/concurrent runs for each event from
gerrit.

I have changed the event trigger on these jobs to use only
review.gluster.org. Terminated some of the duplicate builds on jenkins for
the same reason.

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Gerrit trigger isn't working

2016-02-07 Thread Raghavendra Talur
Hi All,

The last triggered builds in jenkins show the following time for me
Feb 6, 2016 1:24 PM

Current time as per jenkins is
Feb 7, 2016 12:45:50 AM

I am guessing some change happened about 12 hours ago which caused gerrit
trigger to stop working.

I have started a few jobs using "build with parameters" option and it is
working fine.


Could someone have a look..


Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Gerrit is not responding

2016-02-05 Thread Raghavendra Talur
$subject, can someone have a look at it?
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Code-Review+2 and Verified+1 cause multiple retriggers on Jenkins

2016-02-04 Thread Raghavendra Talur
On Thu, Feb 4, 2016 at 4:13 PM, Niels de Vos  wrote:

> On Thu, Feb 04, 2016 at 03:34:05PM +0530, Raghavendra Talur wrote:
> > Hi,
> >
> > We recently changed the jenkins builds to be triggered on the following
> > triggers.
> >
> > 1. Verified+1
> > 2. Code-review+2
> > 3. recheck (netbsd|centos|smoke)
> >
> > There is a bug in 1 and 2.
> >
> > Multiple triggers of 1 or 2 would result in re-runs even when not
> intended.
> >
> > I would like to replace 1 and 2 with a comment "run-all-regression" or
> > something like that.
> > Thoughts?
>
> Maybe starting regressions on Code-Review +1 (or +2) only?
>

Multiple code-reviews would do multiple triggers. Won't work.


>
> Niels
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Code-Review+2 and Verified+1 cause multiple retriggers on Jenkins

2016-02-04 Thread Raghavendra Talur
Hi,

We recently changed the jenkins builds to be triggered on the following
triggers.

1. Verified+1
2. Code-review+2
3. recheck (netbsd|centos|smoke)

There is a bug in 1 and 2.

Multiple triggers of 1 or 2 would result in re-runs even when not intended.

I would like to replace 1 and 2 with a comment "run-all-regression" or
something like that.
Thoughts?


Thanks
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Different version of run-tests.sh in jenkin slaves?

2016-01-27 Thread Raghavendra Talur
On Thu, Jan 28, 2016 at 12:10 PM, Atin Mukherjee 
wrote:

>
>
> On 01/28/2016 12:00 PM, Raghavendra Talur wrote:
> > Ok, RCA:
> >
> > In NetBSD cores are being generated in /d/backends/*/*.core
> > run-tests.sh looks only for "/core*" when looking for cores.
> >
> > So, at the end of test run when regression.sh looks for core everywhere,
> > it finds one and errors out.
> So does that mean we never analyzed any core reported by NetBSD
> regression failure? That's strange.
>

I wonder when these changes were done.

Manu,

Where do I find config in NetBSD which decides which location to dump core
in?
Any particular reason you added /d/backends/*/*.core to list of path to
search for core?

>
> > Should think of a solution which is generic. Will update.
> >
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Different version of run-tests.sh in jenkin slaves?

2016-01-27 Thread Raghavendra Talur
Ok, RCA:

In NetBSD cores are being generated in /d/backends/*/*.core
run-tests.sh looks only for "/core*" when looking for cores.

So, at the end of test run when regression.sh looks for core everywhere, it
finds one and errors out.

Should think of a solution which is generic. Will update.


On Thu, Jan 28, 2016 at 11:37 AM, Raghavendra Talur 
wrote:

>
>
> On Thu, Jan 28, 2016 at 11:17 AM, Atin Mukherjee 
> wrote:
>
>> Are we running a different version of run-tests.sh in jenkin slaves. The
>> reason of suspection is beacuse in last couple of runs [1] & [2] in
>> NetBSD I am seeing no failures apart from bad tests but the regression
>> voted failure and I can not make out any valid reason out of it.
>>
>> [1]
>>
>> https://build.gluster.org/job/rackspace-netbsd7-regression-triggered/13756/consoleFull
>> [2]
>>
>> https://build.gluster.org/job/rackspace-netbsd7-regression-triggered/13755/consoleFull
>
>
>
> I checked the slave machine now.
> regression.sh file is different but the run-tests.sh script is same.
>
> A wild guess here, is it possible that the core generation takes time and
> when we check for a core right after a test is run it is not present yet?
> Does anyone know how to work around that?
>
>
>>
>> ~Atin
>> ___
>> Gluster-infra mailing list
>> Gluster-infra@gluster.org
>> http://www.gluster.org/mailman/listinfo/gluster-infra
>>
>
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Different version of run-tests.sh in jenkin slaves?

2016-01-27 Thread Raghavendra Talur
On Thu, Jan 28, 2016 at 11:17 AM, Atin Mukherjee 
wrote:

> Are we running a different version of run-tests.sh in jenkin slaves. The
> reason of suspection is beacuse in last couple of runs [1] & [2] in
> NetBSD I am seeing no failures apart from bad tests but the regression
> voted failure and I can not make out any valid reason out of it.
>
> [1]
>
> https://build.gluster.org/job/rackspace-netbsd7-regression-triggered/13756/consoleFull
> [2]
>
> https://build.gluster.org/job/rackspace-netbsd7-regression-triggered/13755/consoleFull



I checked the slave machine now.
regression.sh file is different but the run-tests.sh script is same.

A wild guess here, is it possible that the core generation takes time and
when we check for a core right after a test is run it is not present yet?
Does anyone know how to work around that?


>
> ~Atin
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-infra
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Netbsd regression failing after clean run

2016-01-26 Thread Raghavendra Talur
Hi all,

Over the weekend I have seen instances of tests (mine and ndevos) where ret
is equal to 1 from regression.sh.

I checked for core on the slave and could not find any. I wasn't able to
find the root cause.  Will debug more tomorrow.

Did any configuration change on netbsd machines?

Thanks
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Removed "clean workspace" step from NetBSD regression job

2016-01-25 Thread Raghavendra Talur
Hi,

I have removed "clean workspace" from NetBSD regression job too. Vijay had
earlier removed it from CentOS regression.
Cleaning the workspace resulted in deletion of complete git repo and
re-clone takes a lot of time. Gerrit trigger/Git plugin intelligently reset
to HEAD~n when moving to next job and hence save time.

Thanks
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Enabling comment based trigger on jenkins

2016-01-25 Thread Raghavendra Talur
Hi,

Prashanth informed about a simple way to let every contributor retrigger
the regression runs that openstack uses.
Contributor has to simply post "recheck no bug" as comment on the patch set.
Read more here:
http://zqfan.github.io/openstack/2014/01/03/what-should-i-do-when-jenkins-fails/

I tweaked it for gluster, enabled and tested it. Trigger comments are:

recheck netbsd
recheck smoke
recheck centos

If no one has any objections to this method(security implications
included), then I would like to announce it on gluster-devel.


Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] [Gluster-devel] Smoke tests run on the builder in RH DC (at least)

2016-01-25 Thread Raghavendra Talur
On Mon, Jan 25, 2016 at 11:29 PM, Michael Scherer 
wrote:

> Hi,
>
> so today, after fixing one last config item, the smoke test jobs run
> fine on the Centos 6 builder in the RH DC, which build things as non
> root, then start the tests, then reboot the server.
>

Awesome!


>
> Now, I am looking at the fedora one, but once this one is good, I will
> likely reinstall a few builders as a test, and go on Centos 7 builder.
>

This is what I had to do to get Fedora working. Ansible lines are shown
where applicable.

1. change ownership for python site packages: difference is in version 2.7
when compared to 2.6 of CentOS
file: path=/usr/lib/python2.7/site-packages/gluster/ state=directory
owner=jenkins group=root

2. Had to give jenkins write permission on /usr/lib/systemd/system/
for installing glusterd service file.




> I was also planning to look at jenkins job builder for the jenkins, but
> no time yet. Will be after jenkins migration to a new host (which is
> still not planned, unlike gerrit where we should be attempting to find a
> time for that)
>
>
> --
> Michael Scherer
> Sysadmin, Community Infrastructure and Platform, OSAS
>
>
>
> ___
> Gluster-devel mailing list
> gluster-de...@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-devel
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Vagrant box upload: Was: [Change in glusterfs[master]: vagrant-test: Use pre-baked box for better perf]

2016-01-25 Thread Raghavendra Talur
On Fri, Jan 22, 2016 at 10:09 PM, Raghavendra Talur 
wrote:

>
>
> On Fri, Jan 22, 2016 at 9:03 PM, Niels de Vos  wrote:
>
>> On Fri, Jan 22, 2016 at 08:17:17PM +0530, Raghavendra Talur wrote:
>> > On Tue, Jan 19, 2016 at 2:54 PM, Niels de Vos 
>> wrote:
>> >
>> > > On Mon, Jan 18, 2016 at 10:23:00PM +0530, Raghavendra Talur wrote:
>> > > > Michael has a good point that we should host this(or updated and
>> better)
>> > > > vagrant box for Gluster development under Gluster entity. It could
>> be a
>> > > > Gluster account in https://atlas.hashicorp.com or could be hosted
>> > > somewhere
>> > > > on gluster.org.
>> > > >
>> > > > Thoughts?
>> > >
>> > > Maybe start with placing it on download.gluster.org?
>> > >
>> >
>> > I have created the catalog file and uploaded the box at atlas.
>> >
>> > To get it on download.gluster.org, here is what we should do:
>> >
>> > 1. mkdir -p pub/gluster/glusterfs/vagrant/gluster-dev-fedora/boxes/
>> > 2. download gluster-dev-fedora.json that is attached
>> > 3. mv
>> > gluster-dev-fedora.json pub/gluster/glusterfs/vagrant/gluster-dev-fedora
>> > 4. wget
>> >
>> https://atlas.hashicorp.com/raghavendra-talur/boxes/gluster-dev-fedora/versions/0.3/providers/libvirt.box
>> > 5. verify sha1sum libvirt.box is
>> bd0724d2e346832fd159f05e96aefbc88c4b222a
>> > 6. mv
>> > libvirt.box
>> pub/gluster/glusterfs/vagrant/gluster-dev-fedora/boxes/gluster-dev-fedora_0.3.box
>>
>> It is not clear what Fedora version this is. Should that not be part of
>> the name, or at least description in the .json? Because it is Fedora, it
>> will need relatively frequent updates, how are you planning to get those
>> published? Would it not be easier to use CentOS for this? After all, we
>> use CentOS on our Jenkins slaves too.
>>
>
> The version of fedora is omitted because I intend to upgrade the fedora
> version in the box. Fedora releases too often to make different box
> versions for each of them. For example this is 0.3 version. When I rebase
> to F24, it will be 0.4. The description is again for the main box and not
> the version number.
>
> I was not able to use CentOS because it does not have a trusted box
> published for CentOS 6[1]. I choose to use the trusted Fedora vagrant box
> [2] instead of CentOS 7 as most developers have Fedora as their dev machine.
> I have a patch in progress for vagrant-test where one could specify a
> different base box, so it should not be a problem if someone wants a CentOS
> 7 box.
>
>
> [1] https://wiki.centos.org/Download
> [2] https://atlas.hashicorp.com/fedora/boxes/23-cloud-base
>
>
Bump.
Let me know if you have any other suggestions.
I want to do a demo video of this sometime this week once the box is
uploaded.


>
>> > I request Michael to upload the box, once done, I will update the patch.
>> >
>> >
>> >
>> >
>> > > It would surely need some documentation on gluster.readthedocs.org
>> too?
>> > >
>> >
>> > I am working on the patch. It has info given in this[
>> > http://review.gluster.org/#/c/12753/] patch with more detail.
>>
>> Ok, thanks!
>> Niels
>>
>>
>> >
>> >
>> >
>> > > Thanks,
>> > > Niels
>> > >
>> > > >
>> > > > -- Forwarded message --
>> > > > From: Michael Adam (Code Review) 
>> > > > Date: Mon, Jan 18, 2016 at 4:15 PM
>> > > > Subject: Change in glusterfs[master]: vagrant-test: Use pre-baked
>> box for
>> > > > better perf
>> > > > To: Raghavendra Talur 
>> > > > Cc: Gluster Build System , Kaushal M <
>> > > > kaus...@redhat.com>, Niels de Vos , Michael
>> Adam <
>> > > > ob...@samba.org>
>> > > >
>> > > >
>> > > > Michael Adam has posted comments on this change.
>> > > >
>> > > > Change subject: vagrant-test: Use pre-baked box for better perf
>> > > >
>> ..
>> > > >
>> > > >
>> > > > Patch Set 1:
>> > > >
>> > > > (2 comments)
>> > > >
>> > > > some comments inline
>> > > >
>> > > >
>> > >
>> http://review.gluster.org/#/c/13251/1/tests/vagrant/vagrant-template/Vagrantfile
>> > > > File tests/vagrant/vagrant-template/Vagrantfile:
>> > > >
>> > > > Ideally, at some point, there should be a gluster entity under
>> atlast.
>> > > > Alternatively we could directly specify a download url for the box
>> onder
>> > > > gluster.org
>> > >
>> > > > ___
>> > > > Gluster-infra mailing list
>> > > > Gluster-infra@gluster.org
>> > > > http://www.gluster.org/mailman/listinfo/gluster-infra
>> > >
>> > >
>>
>>
>>
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Vagrant box upload: Was: [Change in glusterfs[master]: vagrant-test: Use pre-baked box for better perf]

2016-01-22 Thread Raghavendra Talur
On Fri, Jan 22, 2016 at 9:03 PM, Niels de Vos  wrote:

> On Fri, Jan 22, 2016 at 08:17:17PM +0530, Raghavendra Talur wrote:
> > On Tue, Jan 19, 2016 at 2:54 PM, Niels de Vos  wrote:
> >
> > > On Mon, Jan 18, 2016 at 10:23:00PM +0530, Raghavendra Talur wrote:
> > > > Michael has a good point that we should host this(or updated and
> better)
> > > > vagrant box for Gluster development under Gluster entity. It could
> be a
> > > > Gluster account in https://atlas.hashicorp.com or could be hosted
> > > somewhere
> > > > on gluster.org.
> > > >
> > > > Thoughts?
> > >
> > > Maybe start with placing it on download.gluster.org?
> > >
> >
> > I have created the catalog file and uploaded the box at atlas.
> >
> > To get it on download.gluster.org, here is what we should do:
> >
> > 1. mkdir -p pub/gluster/glusterfs/vagrant/gluster-dev-fedora/boxes/
> > 2. download gluster-dev-fedora.json that is attached
> > 3. mv
> > gluster-dev-fedora.json pub/gluster/glusterfs/vagrant/gluster-dev-fedora
> > 4. wget
> >
> https://atlas.hashicorp.com/raghavendra-talur/boxes/gluster-dev-fedora/versions/0.3/providers/libvirt.box
> > 5. verify sha1sum libvirt.box is bd0724d2e346832fd159f05e96aefbc88c4b222a
> > 6. mv
> > libvirt.box
> pub/gluster/glusterfs/vagrant/gluster-dev-fedora/boxes/gluster-dev-fedora_0.3.box
>
> It is not clear what Fedora version this is. Should that not be part of
> the name, or at least description in the .json? Because it is Fedora, it
> will need relatively frequent updates, how are you planning to get those
> published? Would it not be easier to use CentOS for this? After all, we
> use CentOS on our Jenkins slaves too.
>

The version of fedora is omitted because I intend to upgrade the fedora
version in the box. Fedora releases too often to make different box
versions for each of them. For example this is 0.3 version. When I rebase
to F24, it will be 0.4. The description is again for the main box and not
the version number.

I was not able to use CentOS because it does not have a trusted box
published for CentOS 6[1]. I choose to use the trusted Fedora vagrant box
[2] instead of CentOS 7 as most developers have Fedora as their dev machine.
I have a patch in progress for vagrant-test where one could specify a
different base box, so it should not be a problem if someone wants a CentOS
7 box.


[1] https://wiki.centos.org/Download
[2] https://atlas.hashicorp.com/fedora/boxes/23-cloud-base


> > I request Michael to upload the box, once done, I will update the patch.
> >
> >
> >
> >
> > > It would surely need some documentation on gluster.readthedocs.org
> too?
> > >
> >
> > I am working on the patch. It has info given in this[
> > http://review.gluster.org/#/c/12753/] patch with more detail.
>
> Ok, thanks!
> Niels
>
>
> >
> >
> >
> > > Thanks,
> > > Niels
> > >
> > > >
> > > > -- Forwarded message --
> > > > From: Michael Adam (Code Review) 
> > > > Date: Mon, Jan 18, 2016 at 4:15 PM
> > > > Subject: Change in glusterfs[master]: vagrant-test: Use pre-baked
> box for
> > > > better perf
> > > > To: Raghavendra Talur 
> > > > Cc: Gluster Build System , Kaushal M <
> > > > kaus...@redhat.com>, Niels de Vos , Michael Adam
> <
> > > > ob...@samba.org>
> > > >
> > > >
> > > > Michael Adam has posted comments on this change.
> > > >
> > > > Change subject: vagrant-test: Use pre-baked box for better perf
> > > >
> ..
> > > >
> > > >
> > > > Patch Set 1:
> > > >
> > > > (2 comments)
> > > >
> > > > some comments inline
> > > >
> > > >
> > >
> http://review.gluster.org/#/c/13251/1/tests/vagrant/vagrant-template/Vagrantfile
> > > > File tests/vagrant/vagrant-template/Vagrantfile:
> > > >
> > > > Ideally, at some point, there should be a gluster entity under
> atlast.
> > > > Alternatively we could directly specify a download url for the box
> onder
> > > > gluster.org
> > >
> > > > ___
> > > > Gluster-infra mailing list
> > > > Gluster-infra@gluster.org
> > > > http://www.gluster.org/mailman/listinfo/gluster-infra
> > >
> > >
>
>
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Vagrant box upload: Was: [Change in glusterfs[master]: vagrant-test: Use pre-baked box for better perf]

2016-01-22 Thread Raghavendra Talur
On Tue, Jan 19, 2016 at 2:54 PM, Niels de Vos  wrote:

> On Mon, Jan 18, 2016 at 10:23:00PM +0530, Raghavendra Talur wrote:
> > Michael has a good point that we should host this(or updated and better)
> > vagrant box for Gluster development under Gluster entity. It could be a
> > Gluster account in https://atlas.hashicorp.com or could be hosted
> somewhere
> > on gluster.org.
> >
> > Thoughts?
>
> Maybe start with placing it on download.gluster.org?
>

I have created the catalog file and uploaded the box at atlas.

To get it on download.gluster.org, here is what we should do:

1. mkdir -p pub/gluster/glusterfs/vagrant/gluster-dev-fedora/boxes/
2. download gluster-dev-fedora.json that is attached
3. mv
gluster-dev-fedora.json pub/gluster/glusterfs/vagrant/gluster-dev-fedora
4. wget
https://atlas.hashicorp.com/raghavendra-talur/boxes/gluster-dev-fedora/versions/0.3/providers/libvirt.box
5. verify sha1sum libvirt.box is bd0724d2e346832fd159f05e96aefbc88c4b222a
6. mv
libvirt.box 
pub/gluster/glusterfs/vagrant/gluster-dev-fedora/boxes/gluster-dev-fedora_0.3.box

I request Michael to upload the box, once done, I will update the patch.




> It would surely need some documentation on gluster.readthedocs.org too?
>

I am working on the patch. It has info given in this[
http://review.gluster.org/#/c/12753/] patch with more detail.



> Thanks,
> Niels
>
> >
> > -- Forwarded message --
> > From: Michael Adam (Code Review) 
> > Date: Mon, Jan 18, 2016 at 4:15 PM
> > Subject: Change in glusterfs[master]: vagrant-test: Use pre-baked box for
> > better perf
> > To: Raghavendra Talur 
> > Cc: Gluster Build System , Kaushal M <
> > kaus...@redhat.com>, Niels de Vos , Michael Adam <
> > ob...@samba.org>
> >
> >
> > Michael Adam has posted comments on this change.
> >
> > Change subject: vagrant-test: Use pre-baked box for better perf
> > ..
> >
> >
> > Patch Set 1:
> >
> > (2 comments)
> >
> > some comments inline
> >
> >
> http://review.gluster.org/#/c/13251/1/tests/vagrant/vagrant-template/Vagrantfile
> > File tests/vagrant/vagrant-template/Vagrantfile:
> >
> > Ideally, at some point, there should be a gluster entity under atlast.
> > Alternatively we could directly specify a download url for the box onder
> > gluster.org
>
> > ___
> > Gluster-infra mailing list
> > Gluster-infra@gluster.org
> > http://www.gluster.org/mailman/listinfo/gluster-infra
>
>


gluster-dev-fedora.json
Description: application/json
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] [Gluster-devel] Jenkins accounts for all devs.

2016-01-22 Thread Raghavendra Talur
On Fri, Jan 22, 2016 at 2:41 PM, Michael Scherer 
wrote:

> Le vendredi 22 janvier 2016 à 11:31 +0530, Ravishankar N a écrit :
> > On 01/14/2016 12:16 PM, Kaushal M wrote:
> > > On Thu, Jan 14, 2016 at 10:33 AM, Raghavendra Talur 
> wrote:
> > >>
> > >> On Thu, Jan 14, 2016 at 10:32 AM, Ravishankar N <
> ravishan...@redhat.com>
> > >> wrote:
> > >>> On 01/08/2016 12:03 PM, Raghavendra Talur wrote:
> > >>>> P.S: Stop using the "universal" jenkins account to trigger jenkins
> build
> > >>>> if you are not a maintainer.
> > >>>> If you are a maintainer and don't have your own jenkins account
> then get
> > >>>> one soon!
> > >>>>
> > >>> I would request for a jenkins account for non-maintainers too, at
> least
> > >>> for the devs who are actively contributing code (as opposed to random
> > >>> one-off commits from persons). That way, if the regression failure is
> > >>> *definitely* not in my patch (or) is a spurious failure (or) is
> something
> > >>> that I need to take a netbsd slave offline to debug etc.,  I don't
> have to
> > >>> be blocked on the Maintainer. Since the accounts are anyway tied to
> an
> > >>> individual, it should be easy to spot if someone habitually
> re-trigger
> > >>> regressions without any initial debugging.
> > >>>
> > >> +1
> > > We'd like to give everyone accounts. But the way we're providing
> > > accounts now gives admin accounts to all. This is not very secure.
> > >
> > > This was one of the reasons misc setup freeipa.gluster.org, to provide
> > > controlled accounts for all. But it hasn't been used yet. We would
> > > need to integrate jenkins and the slaves with freeipa, which would
> > > give everyone easy access.
> >
> > Hi Michael,
> > Do you think it is possible to have this integration soon so that all
> > contributors can re-trigger/initiate builds by themselves?
>
> The thing that is missing is still the same, how do we consider that
> someone is a contributor. IE, do we want people just say "add me" and
> get root access to all our jenkins builder (because that's also what go
> with jenkins way of restarting a build for now) ?
>
> I did the technical stuff, but so far, no one did the organisational
> part of giving a criteria for who has access to what. Without clear
> process, I can't do much.
>


+ndevos +vijay

Something like "should have contributed 10 patches to Gluster and be
supported by at least 1 maintainer" would do?



--
> Michael Scherer
> Sysadmin, Community Infrastructure and Platform, OSAS
>
>
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Linux regression tests are hanging too

2016-01-19 Thread Raghavendra Talur
On Tue, Jan 19, 2016 at 5:20 PM, Pranith Kumar Karampuri <
pkara...@redhat.com> wrote:

> Result: PASS
> Build timed out (after 300 minutes). Marking the build as failed.
> Build was aborted
> Finished: FAILURE
>
>
> https://build.gluster.org/job/rackspace-regression-2GB-triggered/17664/console
>
> Pranith
>

This is due to build timeout. I had set it to 300 minute (5 hours).
Recently though, some patch has caused a major performance regression.
Tests which used to pass in 120 seconds are taking more than 200.
This has increased test run time from 200 minutes to greater than 300.

I have increased timeout value to 500 minutes now and it should not get
aborted.

Someone really needs to look into the regression though. Pick any test from
the console output above and run a .t file that took over 20 ms.

Thanks,
Raghavendra
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Trouble adding a new Gerrit configuration to Jenkins for Smoke tests

2016-01-18 Thread Raghavendra Talur
On Sat, Jan 16, 2016 at 5:16 PM, Niels de Vos  wrote:

> Hi,
>
> I'm trying to add a copy of the existing Gerrit server to Jenkins, but
> seem to be unable to get it connected. I've started with [add server]
> and [copy from existing server] to create
> "review.gluster.org_for-smoke-jobs". The [Advanced] "Gerrit Verified
> Commands" have been modified to set the Smoke label instead of the
> default Verified.
>
> Clicking the [test connection] button return success (after removing the
> ***-out password). But after saving and clicking the red [o] button to
> connect the server, it just seems to time-out.
>
> Some assistence with getting this server connected is most welcome.
> Please see these links for checking:
>
>   https://build.gluster.org/gerrit-trigger/
>
> https://build.gluster.org/gerrit-trigger/server/review.gluster.org_for-smoke-jobs/
>
>
I tried but could not figure out the reason either. Is this still required?


> Thanks,
> Niels
>
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-infra
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] Vagrant box upload: Was: [Change in glusterfs[master]: vagrant-test: Use pre-baked box for better perf]

2016-01-18 Thread Raghavendra Talur
Michael has a good point that we should host this(or updated and better)
vagrant box for Gluster development under Gluster entity. It could be a
Gluster account in https://atlas.hashicorp.com or could be hosted somewhere
on gluster.org.

Thoughts?

-- Forwarded message --
From: Michael Adam (Code Review) 
Date: Mon, Jan 18, 2016 at 4:15 PM
Subject: Change in glusterfs[master]: vagrant-test: Use pre-baked box for
better perf
To: Raghavendra Talur 
Cc: Gluster Build System , Kaushal M <
kaus...@redhat.com>, Niels de Vos , Michael Adam <
ob...@samba.org>


Michael Adam has posted comments on this change.

Change subject: vagrant-test: Use pre-baked box for better perf
..


Patch Set 1:

(2 comments)

some comments inline

http://review.gluster.org/#/c/13251/1/tests/vagrant/vagrant-template/Vagrantfile
File tests/vagrant/vagrant-template/Vagrantfile:

Ideally, at some point, there should be a gluster entity under atlast.
Alternatively we could directly specify a download url for the box onder
gluster.org
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Requesting separate labels in Gerrit for better testing results

2016-01-17 Thread Raghavendra Talur
On Fri, Jan 15, 2016 at 4:22 PM, Niels de Vos  wrote:

> On Thu, Jan 14, 2016 at 10:26:46PM +0530, Kaushal M wrote:
> > I'd pushed the config to a new branch instead of updating the
> > `refs/meta/config` branch. I've corrected this now.
> >
> > The 3 new labels are,
> > - Smoke
> > - CentOS-regression
> > - NetBSD-regression
> >
> > The new labels are active now. Changes cannot be merged without all of
> > them being +1. Only the bot accounts (Gluster Build System and NetBSD
> > Build System) can set them.
>

Thanks Kaushal !


>
> It seems that Verified is also a label that is required. Because this is
> now the label for manual testing by reviewers/qa, I do not think it
> should be a requirement anymore.
>
> Could the labels that are needed for merging be setup like this?
>
>   Code-Review=+2 && (Verified=+1 || (Smoke=+1 && CentOS-regression=+1 &&
> NetBSD-regression=+1))
>

I would prefer not having Verified=+1 here. A dev should not be allowed to
override the restrictions.


>
> I managed to get http://review.gluster.org/13208 merged now, please
> check if the added tags in the commit message are ok, or need to get
> modified.
>
> Thanks,
> Niels
>
>
> >
> > On Thu, Jan 14, 2016 at 9:22 PM, Kaushal M  wrote:
> > > On Thu, Jan 14, 2016 at 5:12 PM, Niels de Vos 
> wrote:
> > >> On Thu, Jan 14, 2016 at 03:46:02PM +0530, Kaushal M wrote:
> > >>> On Thu, Jan 14, 2016 at 2:43 PM, Niels de Vos 
> wrote:
> > >>> > On Thu, Jan 14, 2016 at 11:51:15AM +0530, Raghavendra Talur wrote:
> > >>> >> On Tue, Jan 12, 2016 at 7:59 PM, Atin Mukherjee <
> atin.mukherje...@gmail.com>
> > >>> >> wrote:
> > >>> >>
> > >>> >> > -Atin
> > >>> >> > Sent from one plus one
> > >>> >> > On Jan 12, 2016 7:41 PM, "Niels de Vos" 
> wrote:
> > >>> >> > >
> > >>> >> > > On Tue, Jan 12, 2016 at 07:21:37PM +0530, Raghavendra Talur
> wrote:
> > >>> >> > > > We have now changed the gerrit-jenkins workflow as follows:
> > >>> >> > > >
> > >>> >> > > > 1. Developer works on a new feature/bug fix and tests it
> locally(run
> > >>> >> > > > run-tests.sh completely).
> > >>> >> > > > 2. Developer sends the patch to gerrit using rfc.sh.
> > >>> >> > > >
> > >>> >> > > > +++Note that no regression runs have started automatically
> for this
> > >>> >> > patch
> > >>> >> > > > at this point.+++
> > >>> >> > > >
> > >>> >> > > > 3. Developer marks the patch as +1 verified on gerrit as a
> promise of
> > >>> >> > > > having tested the patch completely. For cases where patches
> don't have
> > >>> >> > a +1
> > >>> >> > > > verified from the developer, maintainer has the following
> options
> > >>> >> > > > a. just do the code-review and award a +2 code review.
> > >>> >> > > > b. pull the patch locally and test completely and award a
> +1 verified.
> > >>> >> > > > Both the above actions would result in triggering of
> regression runs
> > >>> >> > for
> > >>> >> > > > the patch.
> > >>> >> > >
> > >>> >> > > Would it not help if anyone giving +1 code-review starts the
> regression
> > >>> >> > > tests too? When developers ask me to review, I prefer to see
> reviews
> > >>> >> > > done by others first, and any regression failures should have
> been fixed
> > >>> >> > > by the time I look at the change.
> > >>> >> > When this idea was originated (long back) I was in favour of
> having
> > >>> >> > regression triggered on a +1, however verified flag set by the
> developer
> > >>> >> > would still trigger the regression. Being a maintainer I would
> always
> > >>> >> > prefer to look at a patch when its verified  flag is +1 which
> means the
> > >>> >> > regression result would also be available.
> > >>> >> >
> > >>> >>
> > 

Re: [Gluster-infra] Requesting separate labels in Gerrit for better testing results

2016-01-14 Thread Raghavendra Talur
On Thu, Jan 14, 2016 at 2:43 PM, Niels de Vos  wrote:

> On Thu, Jan 14, 2016 at 11:51:15AM +0530, Raghavendra Talur wrote:
> > On Tue, Jan 12, 2016 at 7:59 PM, Atin Mukherjee <
> atin.mukherje...@gmail.com>
> > wrote:
> >
> > > -Atin
> > > Sent from one plus one
> > > On Jan 12, 2016 7:41 PM, "Niels de Vos"  wrote:
> > > >
> > > > On Tue, Jan 12, 2016 at 07:21:37PM +0530, Raghavendra Talur wrote:
> > > > > We have now changed the gerrit-jenkins workflow as follows:
> > > > >
> > > > > 1. Developer works on a new feature/bug fix and tests it
> locally(run
> > > > > run-tests.sh completely).
> > > > > 2. Developer sends the patch to gerrit using rfc.sh.
> > > > >
> > > > > +++Note that no regression runs have started automatically for this
> > > patch
> > > > > at this point.+++
> > > > >
> > > > > 3. Developer marks the patch as +1 verified on gerrit as a promise
> of
> > > > > having tested the patch completely. For cases where patches don't
> have
> > > a +1
> > > > > verified from the developer, maintainer has the following options
> > > > > a. just do the code-review and award a +2 code review.
> > > > > b. pull the patch locally and test completely and award a +1
> verified.
> > > > > Both the above actions would result in triggering of regression
> runs
> > > for
> > > > > the patch.
> > > >
> > > > Would it not help if anyone giving +1 code-review starts the
> regression
> > > > tests too? When developers ask me to review, I prefer to see reviews
> > > > done by others first, and any regression failures should have been
> fixed
> > > > by the time I look at the change.
> > > When this idea was originated (long back) I was in favour of having
> > > regression triggered on a +1, however verified flag set by the
> developer
> > > would still trigger the regression. Being a maintainer I would always
> > > prefer to look at a patch when its verified  flag is +1 which means the
> > > regression result would also be available.
> > >
> >
> >
> > Niels requested in IRC that it is good have a mechanism of getting all
> > patches that have already passed all regressions before starting review.
> > Here is what I found
> > a. You can use the search string
> > status:open label:Verified+1,user=build AND
> label:Verified+1,user=nb7build
> > b. You can bookmark this link and it will take you directly to the page
> > with list of such patches.
> >
> >
> http://review.gluster.org/#/q/status:open+label:Verified%252B1%252Cuser%253Dbuild+AND+label:Verified%252B1%252Cuser%253Dnb7build
>
> Hmm, copy/pasting this URL does not work for me, I get an error:
>
> Code Review - Error
> line 1:26 no viable alternative at character '%'
> [Continue]
>

Firefox bug, Works for me in chrome
Here is the proof
https://code.google.com/p/gerrit/issues/detail?id=3641
https://code.google.com/p/gerrit/issues/detail?id=3582



>
>
> Kaushal, could you add the following labels to gerrit, so that we can
> update the Jenkins jobs and they can start setting their own labels?
>
> http://review.gluster.org/Documentation/config-labels.html#label_custom
>
> - Smoke: misc smoke testing, compile, bug check, posix, ..
> - NetBSD: NetBSD-7 regression
> - Linux: Linux regression on CentOS-6
>
> Users/developers should not be able to set these labels, only the
> Jenkins accounts are allowed to.
>
> The standard Verified label can then be used for manual verification by
> developers, qa and reviewers.
>
> Thanks,
> Niels
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Jenkins slave32 seems broken?

2016-01-13 Thread Raghavendra Talur
On Wed, Jan 13, 2016 at 3:49 PM, Niels de Vos  wrote:

> On Wed, Jan 13, 2016 at 10:35:42AM +0100, Xavier Hernandez wrote:
> > The same has happened to slave34.cloud.gluster.org. I've disabled it to
> > allow regressions to be run on other slaves.
> >
> > There are two files owned by root inside
> > /home/jenkins/root/workspace/rackspace-regression-2GB-triggered:
> >
> > -rwxr-xr-x  1 rootroot 10124 Jan  7 17:54 file_lock
> > drwxr-xr-x  3 rootroot  4096 Jan  7 18:31
> slave34.cloud.gluster.org:
>
> Thanks!
>
> I've looked into this a little more now, and might have identified the
> problem.
>
> This one failed with an unrelated error:
>
>
> https://build.gluster.org/job/rackspace-regression-2GB-triggered/17413/console
>
>   ...
>   Building remotely on slave34.cloud.gluster.org
> (rackspace_regression_2gb) in workspace
> /home/jenkins/root/workspace/rackspace-regression-2GB-triggered
>> git rev-parse --is-inside-work-tree # timeout=10
>   Fetching changes from the remote Git repository
>> git config remote.origin.url git://review.gluster.org/glusterfs.git
> # timeout=10
>   Fetching upstream changes from git://review.gluster.org/glusterfs.git
>   ...
>
> The next run on slave34 failed because of the weird directory:
>
>
> https://build.gluster.org/job/rackspace-regression-2GB-triggered/17440/console
>
>   ...
>   Building remotely on slave34.cloud.gluster.org
> (rackspace_regression_2gb) in workspace
> /home/jenkins/root/workspace/rackspace-regression-2GB-triggered
>   Wiping out workspace first.
>   java.io.IOException: remote file operation failed:
> /home/jenkins/root/workspace/rackspace-regression-2GB-triggered at
> hudson.remoting.Channel@62ecdacb:slave34.cloud.gluster.org:
>   ...
>
> Note the "Wiping out workspace first." line. This comes from an option
> in the regression job. This seems to be a recently added "Additional
> Behaviour" in the Jenkins job. Did anyone add this on purpose, or was
> that automatically done with a Jenkins update or something?
>

The three additional behaviour added in regression configuration were added
by me and I simply copied
whatever was there in smoke configuration page. We can try removing this
configuration line but the tests weren't getting started without this(we
got it running after a restart so this might be false requirement).

Technically, it is not a harmful configuration and wiki pages recommend it.
It says such errors occur only if files were created and left open/locked
by some tests or were created with different permissions. We still need to
identify what tests are responsible for this.


>
> Niels
>
> >
> > Xavi
> >
> > On 12/01/16 12:06, Niels de Vos wrote:
> > >Hi,
> > >
> > >I've disabled slave32.cloud.gluster.org because it failed multiple
> > >regression tests with a weird error. After disabling slave32 and
> > >retriggering the failed run, the same job executed fine on a different
> > >slave.
> > >
> > >The affected directory is owned by root, so the jenkins user is not
> > >allowed to wipe it. Does anyone know how this could happen? The dirname
> > >is rather awkward too...
> > >
> > >
>  
> /home/jenkins/root/workspace/rackspace-regression-2GB-triggered/slave32.cloud.gluster.org:
> /d
> > >
> > >I think we can just remove that dir and the slave can be enabled again.
> > >Leaving the status as is for further investigation.
> > >
> > >Thanks,
> > >Niels
> > >
> > >
> > >Full error:
> > >
> > > Wiping out workspace first.
> > > java.io.IOException: remote file operation failed:
> /home/jenkins/root/workspace/rackspace-regression-2GB-triggered at
> hudson.remoting.Channel@7bc1e07d:slave32.cloud.gluster.org:
> java.nio.file.AccessDeniedException:
> /home/jenkins/root/workspace/rackspace-regression-2GB-triggered/slave32.cloud.gluster.org:
> /d
> > > at hudson.FilePath.act(FilePath.java:986)
> > > at hudson.FilePath.act(FilePath.java:968)
> > > at hudson.FilePath.deleteContents(FilePath.java:1183)
> > > at
> hudson.plugins.git.extensions.impl.WipeWorkspace.beforeCheckout(WipeWorkspace.java:28)
> > > at hudson.plugins.git.GitSCM.checkout(GitSCM.java:1040)
> > > at hudson.scm.SCM.checkout(SCM.java:485)
> > > at
> hudson.model.AbstractProject.checkout(AbstractProject.java:1276)
> > > at
> hudson.model.AbstractBuild$AbstractBuildExecution.defaultCheckout(AbstractBuild.java:607)
> > > at
> jenkins.scm.SCMCheckoutStrategy.checkout(SCMCheckoutStrategy.java:86)
> > > at
> hudson.model.AbstractBuild$AbstractBuildExecution.run(AbstractBuild.java:529)
> > > at hudson.model.Run.execute(Run.java:1738)
> > > at hudson.model.FreeStyleBuild.run(FreeStyleBuild.java:43)
> > > at
> hudson.model.ResourceController.execute(ResourceController.java:98)
> > > at hudson.model.Executor.run(Executor.java:410)
> > > Caused by: java.nio.file.AccessDeniedException:
> /home/jenki

[Gluster-infra] Need help with gerrit trigger plugin

2016-01-11 Thread Raghavendra Talur
I am reconfiguring the events to trigger jenkins build on. The events are
not getting triggered though.

Checking the config page for trigger plugin I see this
[image: Inline image 1]

May be this is the reason for it, could someone check the ssh file and
password?

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] Timeout for jenkins builds

2016-01-07 Thread Raghavendra Talur
On Thu, Jan 7, 2016 at 4:26 PM, Niels de Vos  wrote:

> On Thu, Jan 07, 2016 at 03:32:57PM +0530, Raghavendra Talur wrote:
> > Hi,
> >
> > I have enabled timeouts for jenkins builds.
> > Regression on CentOS and NetBSD now both have a 300 minutes(5 hour)
> timeout
> > and tests will be marked "FAILED" when if it does not complete by then.
>
> Do you know who is investigating these hangs?
>

This is already done. Thanks to Manikandan and Raghavendra Gowdappa for
getting debug info and providing fix respectively.
Here is the fix http://review.gluster.org/#/c/13177/ and it is already
merged.
Rebase your patches to get them pass on NetBSD.



>
> > This is keeping in view the recent hangs which ran tests for more than
> > 10-14 hour.
> >
> > Also, the script used in NetBSD does not do any cleanup between runs like
> > the CentOS script does.
> > I did not modify it for now but we should. Let me know If someone wants
> to
> > take it up. I will have a go at it next week if there are no replies.
>
> It would be nice to have an extremely simple Jenkins job, that only
> executes a script that is located on disk. Could we get such a script in
> https://github.com/gluster/glusterfs-patch-acceptance-tests so that we
> can easily start running regression tests in the CentOS CI as well?
>

+10
Some questions:
1. Currently our images have all the packages baked in, what would be the
implication if the slaves are fresh CentOS images and need to download the
packages
as part of a ansible playbook?
2. Instead of regression.sh(in glusterfs-patch-acceptance-tests),
run-tests.sh(in glusterfs) is the place where most of the intelligence
should reside.
This would make creating jenkins job much simpler. Thoughts?


>
> I would also like to have the Jenkins jobs exported as XML and saved in
> the same git repository. That should make it much easier to import the
> jobs in other Jenkins environments. Exporting and importing can be done
> with the Jenkins CLI, see http://build.gluster.org/cli for details.
>

Awesome idea!!!



>
> Thanks,
> Niels
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

[Gluster-infra] nbslave71 needs to be re-imaged

2016-01-04 Thread Raghavendra Talur
Hi,

I had disabled nbslave71 slave long back in Sep due to scripts being
overwritten by log issues and had not followed it up.

This needs to be re-imaged, I guess misc or manu can help me with that?

Thanks,
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] [Gluster-devel] Lot of Netbsd regressions 'Waiting for the next available executor'

2015-12-31 Thread Raghavendra Talur
You can log in. I think the HUP signal did not cause any
change in process state. I still see it in I state.
pid is 10967.

On Thu, Dec 31, 2015 at 3:26 PM, Emmanuel Dreyfus  wrote:

> On Thu, Dec 31, 2015 at 03:22:54PM +0530, Raghavendra Talur wrote:
> > Manu, this seems to be a bug in libperfuse and not in Gluster.
> > The machine is nbslave75.cloud.gluster.org. You will have to rerun
> > quota.t couple of times to hit the bug. The test would hang in line
> 62(TEST
> > 24).
>
> There is a jenkins job running on that machine. May I proceed? Where
> is the relevant test suite?
>
> A nice way of handling over the bug to someone else could be to run in
> the screen utility.
>
> --
> Emmanuel Dreyfus
> m...@netbsd.org
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] [Gluster-devel] Lot of Netbsd regressions 'Waiting for the next available executor'

2015-12-31 Thread Raghavendra Talur
Hi,

I got info from one of the slave machines,

bt from perfused process

#0  0xbb679a77 in _sys___kevent50 () from /usr/lib/libc.so.12
(gdb) info threads
  Id   Target Id Frame
* 1process 26515 0xbb679a77 in _sys___kevent50 () from
/usr/lib/libc.so.12


bt from mount process
(gdb) bt
#0  0xbb3bb4b7 in ___lwp_park60 () from /usr/lib/libc.so.12


Manu, this seems to be a bug in libperfuse and not in Gluster.
The machine is nbslave75.cloud.gluster.org. You will have to rerun
quota.t couple of times to hit the bug. The test would hang in line 62(TEST
24).


Thanks,
Raghavendra Talur






On Thu, Dec 24, 2015 at 3:32 PM, Emmanuel Dreyfus  wrote:

> On Thu, Dec 24, 2015 at 01:44:21AM -0500, Raghavendra Gowdappa wrote:
> > > Seems to be hung. May be a hung syscall? I've tried to kill it, but
> seems
> > > like its not dead. May be patch #12594 is causing some issues on
> netbsd. It
> > > has passed gluster regression.
>
> ps -axl shows PID 1394 (umount) waiting on tstile, which is used for
> spinlocks. No process should sit there for long, unless there is a
> kernel locking problem (which may be a userland locking problem
> thanks to FUSE).
>
> Using crash(8) I can see umount is awaiting for a vnode lock. There
> is certainly something to investigate but I lack time for now. I
> issued a reboot. Please tell me if you can reproduce it.
>
> --
> Emmanuel Dreyfus
> m...@netbsd.org
> ___
> Gluster-devel mailing list
> gluster-de...@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-devel
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra

Re: [Gluster-infra] slave21 and 23 offline due to disk full

2015-09-09 Thread Raghavendra Talur
On Wed, Sep 9, 2015 at 8:21 PM, Michael Scherer  wrote:

> Hi,
>
> just found out that slave21 and 23 were  offline due to their disk being
> full. The issue is 14G of log, due to something creating tarball
> of /var/log/glusterfs, and place the tarball in /var/log/glusterfs/
>
> Ndevos say the bug is fixed, but I would rather investigate in more
> details, someone has some information, pointer ?
>
> (slave26 and slave46 are however just without ssh, so going to reboot
> them)
>

I am the culprit.
This patch http://review.gluster.org/#/c/12109/ is the reason and it has
been taken care of.

The test ran on sept 5th. If the slave had a little space left and executed
next build then
the logs must have got cleared and it should be fine. Sorry for the
trouble, will be more careful
next time I am messing with test infra scripts.

A better fix is posted at http://review.gluster.org/#/c/12110/ and waiting
for reviews!


--
> Michael Scherer
> Sysadmin, Community Infrastructure and Platform, OSAS
>
>
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-infra
>
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra


Re: [Gluster-infra] No linux slaves up in jenkins

2015-09-07 Thread Raghavendra Talur
On Sat, Sep 5, 2015 at 1:40 PM, Niels de Vos  wrote:

> On Sat, Sep 05, 2015 at 01:11:49PM +0530, Raghavendra Talur wrote:
> > I started the 10 iteration test today by this patch set
> > http://review.gluster.org/#/c/12109/
> >
> > However, it seems no slaves are online for linux. If someone gets to know
> > why please update here.
>
> The Linux slaves always disconnect their Jenkins client and get marked
> offline. When tests get started, the Jenkins master connects (ssh) to
> the slaves and starts the Jenkins client.
>
> Is this not happening? You can manually force connecting by clicking on
> the hostname of the slave, and then the [Launch slave agent] button.
>


I did not know about this.
Yes, the slaves did come up after certain time.
Thanks!


> HTH,
> Niels
>
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra


[Gluster-infra] No linux slaves up in jenkins

2015-09-05 Thread Raghavendra Talur
I started the 10 iteration test today by this patch set
http://review.gluster.org/#/c/12109/

However, it seems no slaves are online for linux. If someone gets to know
why please update here.


Thanks
Raghavendra Talur
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra


Re: [Gluster-infra] bluild.gluster.org self signed cert

2015-07-18 Thread Raghavendra Talur
On Sat, Jul 18, 2015 at 9:14 PM, Emmanuel Dreyfus  wrote:

> Hi
>
> build.gluster.org presented me a self-signed certificate. I accepted it,
> but please someone confirm it is on purpose.
>
> While there, startssl offers free certs...
>

Yes, that is intentional.
Check thread with "jenkins over https" as subject.

Latest update from misc is
"So I did ask for a new cert, waiting on digicert."


>
> --
> Emmanuel Dreyfus
> http://hcpnet.free.fr/pubz
> m...@netbsd.org
> ___
> Gluster-infra mailing list
> Gluster-infra@gluster.org
> http://www.gluster.org/mailman/listinfo/gluster-infra
>



-- 
*Raghavendra Talur *
___
Gluster-infra mailing list
Gluster-infra@gluster.org
http://www.gluster.org/mailman/listinfo/gluster-infra