Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-31 Thread Alan Orth
Thanks, Raghavendra. I won't be able to test readdir-ahead again because it
caused a lot of issues for my users and I don't have the resources to set
up a test environment right now. I hope someone can help figure this bug
out eventually though.

Cheers,

On Tue, Jan 30, 2018 at 1:36 PM Raghavendra Gowdappa <rgowd...@redhat.com>
wrote:

>
>
> - Original Message -
> > From: "Alan Orth" <alan.o...@gmail.com>
> > To: "Raghavendra Gowdappa" <rgowd...@redhat.com>
> > Cc: "gluster-users" <gluster-users@gluster.org>
> > Sent: Tuesday, January 30, 2018 1:37:40 PM
> > Subject: Re: [Gluster-users] parallel-readdir is not recognized in
> GlusterFS  3.12.4
> >
> > Thank you, Raghavendra. I guess this cosmetic fix will be in 3.12.6?
>
> Yes.
>
> >
> > I'm also looking forward to seeing stability fixes to parallel-readdir
> and or
> > readdir-ahead in 3.12.x. :)
>
> I am not sure what caused disconnects. readdir-ahead shouldn't result in
> purging of connections. We've not seen this behavior in our internal tests
> (We have done quite a significant amount of testing on this to ascertain
> performance impact). We need to investigate this. Can you file a bug on
> this providing following information:
> * log files of client and brick process
> * test-case to reproduce the issue.
> * Test environment (OS, volume profile info etc).
>
> regards,
> Raghavendra
> >
> > Cheers,
> >
> > On Mon, Jan 29, 2018 at 9:26 AM Raghavendra Gowdappa <
> rgowd...@redhat.com >
> > wrote:
> >
> >
> >
> >
> > - Original Message -
> > > From: "Pranith Kumar Karampuri" < pkara...@redhat.com >
> > > To: "Alan Orth" < alan.o...@gmail.com >
> > > Cc: "gluster-users" < gluster-users@gluster.org >
> > > Sent: Saturday, January 27, 2018 7:31:30 AM
> > > Subject: Re: [Gluster-users] parallel-readdir is not recognized in
> > > GlusterFS 3.12.4
> > >
> > > Adding devs who work on it
> > >
> > > On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote:
> > >
> > >
> > >
> > > Hello,
> > >
> > > I saw that parallel-readdir was an experimental feature in GlusterFS
> > > version
> > > 3.10.0, became stable in version 3.11.0, and is now recommended for
> small
> > > file workloads in the Red Hat Gluster Storage Server documentation[2].
> I've
> > > successfully enabled this on one of my volumes but I notice the
> following
> > > in
> > > the client mount log:
> > >
> > > [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> > > 'parallel-readdir' is not recognized
> > > [2018-01-23 10:24:24.048072] W [MSGID: 101174]
> > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
> > > 'parallel-readdir' is not recognized
> > >
> >
> > This is fixed by patch:
> > https://review.gluster.org/#/c/18572/
> >
> > Please note that the log message is harmless and parallel-readdir is
> enabled
> > even with this log. You can verify that by looking into gluster volume
> > configuration logged in glusterfs logs. If parallel-readdir is enabled,
> you
> > should see a readdir-ahead translator loaded as a parent of each DHT
> > subvolume.
> >
> > > The GlusterFS version on the client and server is 3.12.4. What is
> going on?
> > >
> > > [0]
> > >
> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
> > > [1]
> > >
> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
> > > [2]
> > >
> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
> > >
> > > Thank you,
> > >
> > >
> > > --
> > >
> > >
> > > Alan Orth
> > > alan.o...@gmail.com
> > > https://picturingjordan.com
> > > https://englishbulgaria.net
> > > https://mjanja.ch
> > >
> > > ___
> > > Gluster-users mailing list
> > > Gluster-users@gluster.org
> > > http://lists.gluster.org/mailman/listinfo/gluster-users
> > >
> > > ___
> > > Gluster-users mailing list
> > > Gluster-users@gluster.org
> > > http://lists.gluster.org/mailman/listinfo/gluster-users
> > --
> >
> >
> > Alan Orth
> > alan.o...@gmail.com
> > https://picturingjordan.com
> > https://englishbulgaria.net
> > https://mjanja.ch
> >
> > ___
> > Gluster-users mailing list
> > Gluster-users@gluster.org
> > http://lists.gluster.org/mailman/listinfo/gluster-users
>
-- 

Alan Orth
alan.o...@gmail.com
https://picturingjordan.com
https://englishbulgaria.net
https://mjanja.ch
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-30 Thread Raghavendra Gowdappa


- Original Message -
> From: "Alan Orth" <alan.o...@gmail.com>
> To: "Raghavendra Gowdappa" <rgowd...@redhat.com>
> Cc: "gluster-users" <gluster-users@gluster.org>
> Sent: Tuesday, January 30, 2018 1:37:40 PM
> Subject: Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS  
> 3.12.4
> 
> Thank you, Raghavendra. I guess this cosmetic fix will be in 3.12.6?

Yes.

> 
> I'm also looking forward to seeing stability fixes to parallel-readdir and or
> readdir-ahead in 3.12.x. :)

I am not sure what caused disconnects. readdir-ahead shouldn't result in 
purging of connections. We've not seen this behavior in our internal tests (We 
have done quite a significant amount of testing on this to ascertain 
performance impact). We need to investigate this. Can you file a bug on this 
providing following information:
* log files of client and brick process
* test-case to reproduce the issue.
* Test environment (OS, volume profile info etc).

regards,
Raghavendra
> 
> Cheers,
> 
> On Mon, Jan 29, 2018 at 9:26 AM Raghavendra Gowdappa < rgowd...@redhat.com >
> wrote:
> 
> 
> 
> 
> - Original Message -
> > From: "Pranith Kumar Karampuri" < pkara...@redhat.com >
> > To: "Alan Orth" < alan.o...@gmail.com >
> > Cc: "gluster-users" < gluster-users@gluster.org >
> > Sent: Saturday, January 27, 2018 7:31:30 AM
> > Subject: Re: [Gluster-users] parallel-readdir is not recognized in
> > GlusterFS 3.12.4
> > 
> > Adding devs who work on it
> > 
> > On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote:
> > 
> > 
> > 
> > Hello,
> > 
> > I saw that parallel-readdir was an experimental feature in GlusterFS
> > version
> > 3.10.0, became stable in version 3.11.0, and is now recommended for small
> > file workloads in the Red Hat Gluster Storage Server documentation[2]. I've
> > successfully enabled this on one of my volumes but I notice the following
> > in
> > the client mount log:
> > 
> > [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> > 'parallel-readdir' is not recognized
> > [2018-01-23 10:24:24.048072] W [MSGID: 101174]
> > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
> > 'parallel-readdir' is not recognized
> > 
> 
> This is fixed by patch:
> https://review.gluster.org/#/c/18572/
> 
> Please note that the log message is harmless and parallel-readdir is enabled
> even with this log. You can verify that by looking into gluster volume
> configuration logged in glusterfs logs. If parallel-readdir is enabled, you
> should see a readdir-ahead translator loaded as a parent of each DHT
> subvolume.
> 
> > The GlusterFS version on the client and server is 3.12.4. What is going on?
> > 
> > [0]
> > https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
> > [1]
> > https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
> > [2]
> > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
> > 
> > Thank you,
> > 
> > 
> > --
> > 
> > 
> > Alan Orth
> > alan.o...@gmail.com
> > https://picturingjordan.com
> > https://englishbulgaria.net
> > https://mjanja.ch
> > 
> > ___
> > Gluster-users mailing list
> > Gluster-users@gluster.org
> > http://lists.gluster.org/mailman/listinfo/gluster-users
> > 
> > ___
> > Gluster-users mailing list
> > Gluster-users@gluster.org
> > http://lists.gluster.org/mailman/listinfo/gluster-users
> --
> 
> 
> Alan Orth
> alan.o...@gmail.com
> https://picturingjordan.com
> https://englishbulgaria.net
> https://mjanja.ch
> 
> ___
> Gluster-users mailing list
> Gluster-users@gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users


Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-30 Thread Alan Orth
Thank you, Raghavendra. I guess this cosmetic fix will be in 3.12.6?

I'm also looking forward to seeing stability fixes to parallel-readdir and
or readdir-ahead in 3.12.x. :)

Cheers,

On Mon, Jan 29, 2018 at 9:26 AM Raghavendra Gowdappa <rgowd...@redhat.com>
wrote:

>
>
> - Original Message -
> > From: "Pranith Kumar Karampuri" <pkara...@redhat.com>
> > To: "Alan Orth" <alan.o...@gmail.com>
> > Cc: "gluster-users" <gluster-users@gluster.org>
> > Sent: Saturday, January 27, 2018 7:31:30 AM
> > Subject: Re: [Gluster-users] parallel-readdir is not recognized in
> GlusterFS  3.12.4
> >
> > Adding devs who work on it
> >
> > On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote:
> >
> >
> >
> > Hello,
> >
> > I saw that parallel-readdir was an experimental feature in GlusterFS
> version
> > 3.10.0, became stable in version 3.11.0, and is now recommended for small
> > file workloads in the Red Hat Gluster Storage Server documentation[2].
> I've
> > successfully enabled this on one of my volumes but I notice the
> following in
> > the client mount log:
> >
> > [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> > 'parallel-readdir' is not recognized
> > [2018-01-23 10:24:24.048072] W [MSGID: 101174]
> > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
> > 'parallel-readdir' is not recognized
> >
>
> This is fixed by patch:
> https://review.gluster.org/#/c/18572/
>
> Please note that the log message is harmless and parallel-readdir is
> enabled even with this log. You can verify that by looking into gluster
> volume configuration logged in glusterfs logs. If parallel-readdir is
> enabled, you should see a readdir-ahead translator loaded as a parent of
> each DHT subvolume.
>
> > The GlusterFS version on the client and server is 3.12.4. What is going
> on?
> >
> > [0]
> >
> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
> > [1]
> >
> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
> > [2]
> >
> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
> >
> > Thank you,
> >
> >
> > --
> >
> >
> > Alan Orth
> > alan.o...@gmail.com
> > https://picturingjordan.com
> > https://englishbulgaria.net
> > https://mjanja.ch
> >
> > ___
> > Gluster-users mailing list
> > Gluster-users@gluster.org
> > http://lists.gluster.org/mailman/listinfo/gluster-users
> >
> > ___
> > Gluster-users mailing list
> > Gluster-users@gluster.org
> > http://lists.gluster.org/mailman/listinfo/gluster-users
>
-- 

Alan Orth
alan.o...@gmail.com
https://picturingjordan.com
https://englishbulgaria.net
https://mjanja.ch
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-28 Thread Raghavendra Gowdappa


- Original Message -
> From: "Pranith Kumar Karampuri" <pkara...@redhat.com>
> To: "Alan Orth" <alan.o...@gmail.com>
> Cc: "gluster-users" <gluster-users@gluster.org>
> Sent: Saturday, January 27, 2018 7:31:30 AM
> Subject: Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS  
> 3.12.4
> 
> Adding devs who work on it
> 
> On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote:
> 
> 
> 
> Hello,
> 
> I saw that parallel-readdir was an experimental feature in GlusterFS version
> 3.10.0, became stable in version 3.11.0, and is now recommended for small
> file workloads in the Red Hat Gluster Storage Server documentation[2]. I've
> successfully enabled this on one of my volumes but I notice the following in
> the client mount log:
> 
> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> 'parallel-readdir' is not recognized
> [2018-01-23 10:24:24.048072] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
> 'parallel-readdir' is not recognized
> 

This is fixed by patch:
https://review.gluster.org/#/c/18572/

Please note that the log message is harmless and parallel-readdir is enabled 
even with this log. You can verify that by looking into gluster volume 
configuration logged in glusterfs logs. If parallel-readdir is enabled, you 
should see a readdir-ahead translator loaded as a parent of each DHT subvolume.

> The GlusterFS version on the client and server is 3.12.4. What is going on?
> 
> [0]
> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
> [1]
> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
> [2]
> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
> 
> Thank you,
> 
> 
> --
> 
> 
> Alan Orth
> alan.o...@gmail.com
> https://picturingjordan.com
> https://englishbulgaria.net
> https://mjanja.ch
> 
> ___
> Gluster-users mailing list
> Gluster-users@gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
> 
> ___
> Gluster-users mailing list
> Gluster-users@gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users


Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-27 Thread Alan Orth
@Vlad, yes I kept group metadata-cache and I also have
network.inode-lru-limit 9, but nothing else performance related.

@Pranith great, thank you.

Cheers,

On Sat, Jan 27, 2018 at 4:01 AM Pranith Kumar Karampuri 
wrote:

> Adding devs who work on it
>
> On 23 Jan 2018 10:40 pm, "Alan Orth"  wrote:
>
>> Hello,
>>
>> I saw that parallel-readdir was an experimental feature in GlusterFS
>> version 3.10.0, became stable in version 3.11.0, and is now recommended for
>> small file workloads in the Red Hat Gluster Storage Server
>> documentation[2]. I've successfully enabled this on one of my volumes but I
>> notice the following in the client mount log:
>>
>> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
>> 'parallel-readdir' is not recognized
>> [2018-01-23 10:24:24.048072] W [MSGID: 101174]
>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
>> 'parallel-readdir' is not recognized
>>
>> The GlusterFS version on the client and server is 3.12.4. What is going
>> on?
>>
>> [0]
>> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
>> [1]
>> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
>> [2]
>> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
>>
>> Thank you,
>>
>>
>> --
>>
>> Alan Orth
>> alan.o...@gmail.com
>> https://picturingjordan.com
>> https://englishbulgaria.net
>> https://mjanja.ch
>>
>> ___
>> Gluster-users mailing list
>> Gluster-users@gluster.org
>> http://lists.gluster.org/mailman/listinfo/gluster-users
>>
> --

Alan Orth
alan.o...@gmail.com
https://picturingjordan.com
https://englishbulgaria.net
https://mjanja.ch
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-26 Thread Pranith Kumar Karampuri
Adding devs who work on it

On 23 Jan 2018 10:40 pm, "Alan Orth"  wrote:

> Hello,
>
> I saw that parallel-readdir was an experimental feature in GlusterFS
> version 3.10.0, became stable in version 3.11.0, and is now recommended for
> small file workloads in the Red Hat Gluster Storage Server
> documentation[2]. I've successfully enabled this on one of my volumes but I
> notice the following in the client mount log:
>
> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> 'parallel-readdir' is not recognized
> [2018-01-23 10:24:24.048072] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
> 'parallel-readdir' is not recognized
>
> The GlusterFS version on the client and server is 3.12.4. What is going on?
>
> [0] https://github.com/gluster/glusterfs/blob/release-3.10/
> doc/release-notes/3.10.0.md
> [1] https://github.com/gluster/glusterfs/blob/release-3.11/
> doc/release-notes/3.11.0.md
> [2] https://access.redhat.com/documentation/en-us/red_hat_
> gluster_storage/3.3/html/administration_guide/small_
> file_performance_enhancements
>
> Thank you,
>
>
> --
>
> Alan Orth
> alan.o...@gmail.com
> https://picturingjordan.com
> https://englishbulgaria.net
> https://mjanja.ch
>
> ___
> Gluster-users mailing list
> Gluster-users@gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
>
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-26 Thread Vlad Kopylov
Thanks for info Alan,

you've kept group metadata-cache, right?

I have them enabled as well
features.cache-invalidation=on
features.cache-invalidation-timeout=600
performance.stat-prefetch=on
performance.cache-invalidation=on
performance.md-cache-timeout=600
network.inode-lru-limit=5


but on the brick I suspect issues only partialy
performance.stat-prefetch=on
performance.md-cache-timeout=600


will try without parallel-readdir and readdir-ahead

v

On Fri, Jan 26, 2018 at 6:59 AM, Alan Orth  wrote:
> Dear Vlad,
>
> I'm sorry, I don't want to test this again on my system just yet! It caused
> too much instability for my users and I don't have enough resources for a
> development environment. The only other variables that changed before the
> crashes was the group metadata-cache[0], which I enabled the same day as the
> parallel-readdir and readdir-ahead options:
>
> $ gluster volume set homes group metadata-cache
>
> I'm hoping Atin or Poornima can shed some light and squash this bug.
>
> [0]
> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
>
> Regards,
>
> On Fri, Jan 26, 2018 at 6:10 AM Vlad Kopylov  wrote:
>>
>> can you please test parallel-readdir or readdir-ahead gives
>> disconnects? so we know which to disable
>>
>> parallel-readdir doing magic ran on pdf from last year
>>
>> https://events.static.linuxfound.org/sites/events/files/slides/Gluster_DirPerf_Vault2017_0.pdf
>>
>> -v
>>
>> On Thu, Jan 25, 2018 at 8:20 AM, Alan Orth  wrote:
>> > By the way, on a slightly related note, I'm pretty sure either
>> > parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x.
>> > We
>> > are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6.
>> >
>> > I updated my servers and clients to 3.12.4 and enabled these two options
>> > after reading about them in the 3.10.0 and 3.11.0 release notes. In the
>> > days
>> > after enabling these two options all of my clients kept getting
>> > disconnected
>> > from the volume. The error upon attempting to list a directory or read a
>> > file was "Transport endpoint is not connected", after which I would
>> > force
>> > unmount the volume with `umount -fl /home` and remount it, only to have
>> > it
>> > get disconnected again a few hours later.
>> >
>> > Every time the volume disconnected I looked in the client mount log and
>> > only
>> > found information such as:
>> >
>> > [2018-01-24 05:52:27.695225] I [MSGID: 108026]
>> > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
>> > Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168.
>> > sources=[0]  sinks=1
>> > [2018-01-24 05:52:27.700611] I [MSGID: 108026]
>> > [afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do]
>> > 2-homes-replicate-1: performing metadata selfheal on
>> > b6a53629-a831-4ee3-a35e-f47c04297aaa
>> > [2018-01-24 05:52:27.703021] I [MSGID: 108026]
>> > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
>> > Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa.
>> > sources=[0]  sinks=1
>> >
>> > I enabled debug logging for that volume's client mount with `gluster
>> > volume
>> > set homes diagnostics.client-log-level DEBUG` and then I saw this in the
>> > client mount log the next time it disconnected:
>> >
>> > [2018-01-24 08:55:19.138810] D [MSGID: 0]
>> > [io-threads.c:358:iot_schedule]
>> > 0-homes-io-threads: LOOKUP scheduled as fast fop
>> > [2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup]
>> > 0-homes-dht: Calling fresh lookup for
>> > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on
>> > homes-readdir-ahead-1
>> > [2018-01-24 08:55:19.138928] D [MSGID: 0]
>> > [io-threads.c:358:iot_schedule]
>> > 0-homes-io-threads: FSTAT scheduled as fast fop
>> > [2018-01-24 08:55:19.138958] D [MSGID: 0]
>> > [afr-read-txn.c:220:afr_read_txn]
>> > 0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation
>> > now vs
>> > cached: 2, 2
>> > [2018-01-24 08:55:19.139187] D [MSGID: 0]
>> > [dht-common.c:2294:dht_lookup_cbk]
>> > 0-homes-dht: fresh_lookup returned for
>> > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0
>> > [2018-01-24 08:55:19.139200] D [MSGID: 0]
>> > [dht-layout.c:873:dht_layout_preset] 0-homes-dht: file =
>> > ----, subvol = homes-readdir-ahead-1
>> > [2018-01-24 08:55:19.139257] D [MSGID: 0]
>> > [io-threads.c:358:iot_schedule]
>> > 0-homes-io-threads: READDIRP scheduled as fast fop
>> >
>> > On a hunch I disabled both parallel-readdir and readdir-ahead, which I
>> > had
>> > only enabled a few days before, and now all of the clients are much more
>> > stable, with zero disconnections in the days since I disabled those two
>> > volume options.
>> >
>> > Please take a look! Thanks,
>> >
>> > On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjee 
>> > wrote:
>> >>
>> >> Adding Poornima to take a look 

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-26 Thread Alan Orth
Dear Vlad,

I'm sorry, I don't want to test this again on my system just yet! It caused
too much instability for my users and I don't have enough resources for a
development environment. The only other variables that changed before the
crashes was the group metadata-cache[0], which I enabled the same day as
the parallel-readdir and readdir-ahead options:

$ gluster volume set homes group metadata-cache

I'm hoping Atin or Poornima can shed some light and squash this bug.

[0]
https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md

Regards,

On Fri, Jan 26, 2018 at 6:10 AM Vlad Kopylov  wrote:

> can you please test parallel-readdir or readdir-ahead gives
> disconnects? so we know which to disable
>
> parallel-readdir doing magic ran on pdf from last year
>
> https://events.static.linuxfound.org/sites/events/files/slides/Gluster_DirPerf_Vault2017_0.pdf
>
> -v
>
> On Thu, Jan 25, 2018 at 8:20 AM, Alan Orth  wrote:
> > By the way, on a slightly related note, I'm pretty sure either
> > parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x.
> We
> > are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6.
> >
> > I updated my servers and clients to 3.12.4 and enabled these two options
> > after reading about them in the 3.10.0 and 3.11.0 release notes. In the
> days
> > after enabling these two options all of my clients kept getting
> disconnected
> > from the volume. The error upon attempting to list a directory or read a
> > file was "Transport endpoint is not connected", after which I would force
> > unmount the volume with `umount -fl /home` and remount it, only to have
> it
> > get disconnected again a few hours later.
> >
> > Every time the volume disconnected I looked in the client mount log and
> only
> > found information such as:
> >
> > [2018-01-24 05:52:27.695225] I [MSGID: 108026]
> > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
> > Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168.
> > sources=[0]  sinks=1
> > [2018-01-24 05:52:27.700611] I [MSGID: 108026]
> > [afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do]
> > 2-homes-replicate-1: performing metadata selfheal on
> > b6a53629-a831-4ee3-a35e-f47c04297aaa
> > [2018-01-24 05:52:27.703021] I [MSGID: 108026]
> > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
> > Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa.
> > sources=[0]  sinks=1
> >
> > I enabled debug logging for that volume's client mount with `gluster
> volume
> > set homes diagnostics.client-log-level DEBUG` and then I saw this in the
> > client mount log the next time it disconnected:
> >
> > [2018-01-24 08:55:19.138810] D [MSGID: 0] [io-threads.c:358:iot_schedule]
> > 0-homes-io-threads: LOOKUP scheduled as fast fop
> > [2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup]
> > 0-homes-dht: Calling fresh lookup for
> > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on
> > homes-readdir-ahead-1
> > [2018-01-24 08:55:19.138928] D [MSGID: 0] [io-threads.c:358:iot_schedule]
> > 0-homes-io-threads: FSTAT scheduled as fast fop
> > [2018-01-24 08:55:19.138958] D [MSGID: 0]
> [afr-read-txn.c:220:afr_read_txn]
> > 0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation
> now vs
> > cached: 2, 2
> > [2018-01-24 08:55:19.139187] D [MSGID: 0]
> [dht-common.c:2294:dht_lookup_cbk]
> > 0-homes-dht: fresh_lookup returned for
> > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0
> > [2018-01-24 08:55:19.139200] D [MSGID: 0]
> > [dht-layout.c:873:dht_layout_preset] 0-homes-dht: file =
> > ----, subvol = homes-readdir-ahead-1
> > [2018-01-24 08:55:19.139257] D [MSGID: 0] [io-threads.c:358:iot_schedule]
> > 0-homes-io-threads: READDIRP scheduled as fast fop
> >
> > On a hunch I disabled both parallel-readdir and readdir-ahead, which I
> had
> > only enabled a few days before, and now all of the clients are much more
> > stable, with zero disconnections in the days since I disabled those two
> > volume options.
> >
> > Please take a look! Thanks,
> >
> > On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjee 
> wrote:
> >>
> >> Adding Poornima to take a look at it and comment.
> >>
> >> On Tue, Jan 23, 2018 at 10:39 PM, Alan Orth 
> wrote:
> >>>
> >>> Hello,
> >>>
> >>> I saw that parallel-readdir was an experimental feature in GlusterFS
> >>> version 3.10.0, became stable in version 3.11.0, and is now
> recommended for
> >>> small file workloads in the Red Hat Gluster Storage Server
> documentation[2].
> >>> I've successfully enabled this on one of my volumes but I notice the
> >>> following in the client mount log:
> >>>
> >>> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> >>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> >>> 'parallel-readdir' is not recognized
> >>> [2018-01-23 

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-25 Thread Vlad Kopylov
can you please test parallel-readdir or readdir-ahead gives
disconnects? so we know which to disable

parallel-readdir doing magic ran on pdf from last year
https://events.static.linuxfound.org/sites/events/files/slides/Gluster_DirPerf_Vault2017_0.pdf

-v

On Thu, Jan 25, 2018 at 8:20 AM, Alan Orth  wrote:
> By the way, on a slightly related note, I'm pretty sure either
> parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x. We
> are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6.
>
> I updated my servers and clients to 3.12.4 and enabled these two options
> after reading about them in the 3.10.0 and 3.11.0 release notes. In the days
> after enabling these two options all of my clients kept getting disconnected
> from the volume. The error upon attempting to list a directory or read a
> file was "Transport endpoint is not connected", after which I would force
> unmount the volume with `umount -fl /home` and remount it, only to have it
> get disconnected again a few hours later.
>
> Every time the volume disconnected I looked in the client mount log and only
> found information such as:
>
> [2018-01-24 05:52:27.695225] I [MSGID: 108026]
> [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
> Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168.
> sources=[0]  sinks=1
> [2018-01-24 05:52:27.700611] I [MSGID: 108026]
> [afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do]
> 2-homes-replicate-1: performing metadata selfheal on
> b6a53629-a831-4ee3-a35e-f47c04297aaa
> [2018-01-24 05:52:27.703021] I [MSGID: 108026]
> [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
> Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa.
> sources=[0]  sinks=1
>
> I enabled debug logging for that volume's client mount with `gluster volume
> set homes diagnostics.client-log-level DEBUG` and then I saw this in the
> client mount log the next time it disconnected:
>
> [2018-01-24 08:55:19.138810] D [MSGID: 0] [io-threads.c:358:iot_schedule]
> 0-homes-io-threads: LOOKUP scheduled as fast fop
> [2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup]
> 0-homes-dht: Calling fresh lookup for
> /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on
> homes-readdir-ahead-1
> [2018-01-24 08:55:19.138928] D [MSGID: 0] [io-threads.c:358:iot_schedule]
> 0-homes-io-threads: FSTAT scheduled as fast fop
> [2018-01-24 08:55:19.138958] D [MSGID: 0] [afr-read-txn.c:220:afr_read_txn]
> 0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation now vs
> cached: 2, 2
> [2018-01-24 08:55:19.139187] D [MSGID: 0] [dht-common.c:2294:dht_lookup_cbk]
> 0-homes-dht: fresh_lookup returned for
> /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0
> [2018-01-24 08:55:19.139200] D [MSGID: 0]
> [dht-layout.c:873:dht_layout_preset] 0-homes-dht: file =
> ----, subvol = homes-readdir-ahead-1
> [2018-01-24 08:55:19.139257] D [MSGID: 0] [io-threads.c:358:iot_schedule]
> 0-homes-io-threads: READDIRP scheduled as fast fop
>
> On a hunch I disabled both parallel-readdir and readdir-ahead, which I had
> only enabled a few days before, and now all of the clients are much more
> stable, with zero disconnections in the days since I disabled those two
> volume options.
>
> Please take a look! Thanks,
>
> On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjee  wrote:
>>
>> Adding Poornima to take a look at it and comment.
>>
>> On Tue, Jan 23, 2018 at 10:39 PM, Alan Orth  wrote:
>>>
>>> Hello,
>>>
>>> I saw that parallel-readdir was an experimental feature in GlusterFS
>>> version 3.10.0, became stable in version 3.11.0, and is now recommended for
>>> small file workloads in the Red Hat Gluster Storage Server documentation[2].
>>> I've successfully enabled this on one of my volumes but I notice the
>>> following in the client mount log:
>>>
>>> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
>>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
>>> 'parallel-readdir' is not recognized
>>> [2018-01-23 10:24:24.048072] W [MSGID: 101174]
>>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
>>> 'parallel-readdir' is not recognized
>>>
>>> The GlusterFS version on the client and server is 3.12.4. What is going
>>> on?
>>>
>>> [0]
>>> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
>>> [1]
>>> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
>>> [2]
>>> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
>>>
>>> Thank you,
>>>
>>>
>>> --
>>>
>>> Alan Orth
>>> alan.o...@gmail.com
>>> https://picturingjordan.com
>>> https://englishbulgaria.net
>>> https://mjanja.ch
>>>
>>>
>>> ___
>>> Gluster-users mailing list
>>> 

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-25 Thread Vlad Kopylov
Same here, even after update to 3.12.5-2
[2018-01-26 02:48:58.113996] W [MSGID: 101174]
[graph.c:363:_log_if_unknown_option] 0-q-readdir-ahead-0: option
'parallel-readdir' is not recognized

-v

On Tue, Jan 23, 2018 at 12:09 PM, Alan Orth  wrote:
> Hello,
>
> I saw that parallel-readdir was an experimental feature in GlusterFS version
> 3.10.0, became stable in version 3.11.0, and is now recommended for small
> file workloads in the Red Hat Gluster Storage Server documentation[2]. I've
> successfully enabled this on one of my volumes but I notice the following in
> the client mount log:
>
> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> 'parallel-readdir' is not recognized
> [2018-01-23 10:24:24.048072] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
> 'parallel-readdir' is not recognized
>
> The GlusterFS version on the client and server is 3.12.4. What is going on?
>
> [0]
> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
> [1]
> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
> [2]
> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
>
> Thank you,
>
>
> --
>
> Alan Orth
> alan.o...@gmail.com
> https://picturingjordan.com
> https://englishbulgaria.net
> https://mjanja.ch
>
>
> ___
> Gluster-users mailing list
> Gluster-users@gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users


Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-25 Thread Alan Orth
By the way, on a slightly related note, I'm pretty sure either
parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x. We
are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6.

I updated my servers and clients to 3.12.4 and enabled these two options
after reading about them in the 3.10.0 and 3.11.0 release notes. In the
days after enabling these two options all of my clients kept getting
disconnected from the volume. The error upon attempting to list a directory
or read a file was "Transport endpoint is not connected", after which I
would force unmount the volume with `umount -fl /home` and remount it, only
to have it get disconnected again a few hours later.

Every time the volume disconnected I looked in the client mount log and
only found information such as:

[2018-01-24 05:52:27.695225] I [MSGID: 108026]
[afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168.
sources=[0]  sinks=1
[2018-01-24 05:52:27.700611] I [MSGID: 108026]
[afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do]
2-homes-replicate-1: performing metadata selfheal on
b6a53629-a831-4ee3-a35e-f47c04297aaa
[2018-01-24 05:52:27.703021] I [MSGID: 108026]
[afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1:
Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa.
sources=[0]  sinks=1

I enabled debug logging for that volume's client mount with `gluster volume
set homes diagnostics.client-log-level DEBUG` and then I saw this in the
client mount log the next time it disconnected:

[2018-01-24 08:55:19.138810] D [MSGID: 0] [io-threads.c:358:iot_schedule]
0-homes-io-threads: LOOKUP scheduled as fast fop
[2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup]
0-homes-dht: Calling fresh lookup for
/vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on
homes-readdir-ahead-1
[2018-01-24 08:55:19.138928] D [MSGID: 0] [io-threads.c:358:iot_schedule]
0-homes-io-threads: FSTAT scheduled as fast fop
[2018-01-24 08:55:19.138958] D [MSGID: 0] [afr-read-txn.c:220:afr_read_txn]
0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation now
vs cached: 2, 2
[2018-01-24 08:55:19.139187] D [MSGID: 0]
[dht-common.c:2294:dht_lookup_cbk] 0-homes-dht: fresh_lookup returned for
/vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0
[2018-01-24 08:55:19.139200] D [MSGID: 0]
[dht-layout.c:873:dht_layout_preset] 0-homes-dht: file =
----, subvol = homes-readdir-ahead-1
[2018-01-24 08:55:19.139257] D [MSGID: 0] [io-threads.c:358:iot_schedule]
0-homes-io-threads: READDIRP scheduled as fast fop

On a hunch I disabled both parallel-readdir and readdir-ahead, which I had
only enabled a few days before, and now all of the clients are much more
stable, with zero disconnections in the days since I disabled those two
volume options.

Please take a look! Thanks,

On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjee  wrote:

> Adding Poornima to take a look at it and comment.
>
> On Tue, Jan 23, 2018 at 10:39 PM, Alan Orth  wrote:
>
>> Hello,
>>
>> I saw that parallel-readdir was an experimental feature in GlusterFS
>> version 3.10.0, became stable in version 3.11.0, and is now recommended for
>> small file workloads in the Red Hat Gluster Storage Server
>> documentation[2]. I've successfully enabled this on one of my volumes but I
>> notice the following in the client mount log:
>>
>> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
>> 'parallel-readdir' is not recognized
>> [2018-01-23 10:24:24.048072] W [MSGID: 101174]
>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
>> 'parallel-readdir' is not recognized
>>
>> The GlusterFS version on the client and server is 3.12.4. What is going
>> on?
>>
>> [0]
>> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
>> [1]
>> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
>> [2]
>> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements
>>
>> Thank you,
>>
>>
>> --
>>
>> Alan Orth
>> alan.o...@gmail.com
>> https://picturingjordan.com
>> https://englishbulgaria.net
>> https://mjanja.ch
>>
>> ___
>> Gluster-users mailing list
>> Gluster-users@gluster.org
>> http://lists.gluster.org/mailman/listinfo/gluster-users
>>
>
> --

Alan Orth
alan.o...@gmail.com
https://picturingjordan.com
https://englishbulgaria.net
https://mjanja.ch
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-23 Thread Atin Mukherjee
Adding Poornima to take a look at it and comment.

On Tue, Jan 23, 2018 at 10:39 PM, Alan Orth  wrote:

> Hello,
>
> I saw that parallel-readdir was an experimental feature in GlusterFS
> version 3.10.0, became stable in version 3.11.0, and is now recommended for
> small file workloads in the Red Hat Gluster Storage Server
> documentation[2]. I've successfully enabled this on one of my volumes but I
> notice the following in the client mount log:
>
> [2018-01-23 10:24:24.048055] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
> 'parallel-readdir' is not recognized
> [2018-01-23 10:24:24.048072] W [MSGID: 101174]
> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
> 'parallel-readdir' is not recognized
>
> The GlusterFS version on the client and server is 3.12.4. What is going on?
>
> [0] https://github.com/gluster/glusterfs/blob/release-3.10/
> doc/release-notes/3.10.0.md
> [1] https://github.com/gluster/glusterfs/blob/release-3.11/
> doc/release-notes/3.11.0.md
> [2] https://access.redhat.com/documentation/en-us/red_hat_
> gluster_storage/3.3/html/administration_guide/small_
> file_performance_enhancements
>
> Thank you,
>
>
> --
>
> Alan Orth
> alan.o...@gmail.com
> https://picturingjordan.com
> https://englishbulgaria.net
> https://mjanja.ch
>
> ___
> Gluster-users mailing list
> Gluster-users@gluster.org
> http://lists.gluster.org/mailman/listinfo/gluster-users
>
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users

[Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4

2018-01-23 Thread Alan Orth
Hello,

I saw that parallel-readdir was an experimental feature in GlusterFS
version 3.10.0, became stable in version 3.11.0, and is now recommended for
small file workloads in the Red Hat Gluster Storage Server
documentation[2]. I've successfully enabled this on one of my volumes but I
notice the following in the client mount log:

[2018-01-23 10:24:24.048055] W [MSGID: 101174]
[graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option
'parallel-readdir' is not recognized
[2018-01-23 10:24:24.048072] W [MSGID: 101174]
[graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option
'parallel-readdir' is not recognized

The GlusterFS version on the client and server is 3.12.4. What is going on?

[0]
https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md
[1]
https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md
[2]
https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements

Thank you,


-- 

Alan Orth
alan.o...@gmail.com
https://picturingjordan.com
https://englishbulgaria.net
https://mjanja.ch
___
Gluster-users mailing list
Gluster-users@gluster.org
http://lists.gluster.org/mailman/listinfo/gluster-users