Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Thanks, Raghavendra. I won't be able to test readdir-ahead again because it caused a lot of issues for my users and I don't have the resources to set up a test environment right now. I hope someone can help figure this bug out eventually though. Cheers, On Tue, Jan 30, 2018 at 1:36 PM Raghavendra Gowdappa <rgowd...@redhat.com> wrote: > > > - Original Message - > > From: "Alan Orth" <alan.o...@gmail.com> > > To: "Raghavendra Gowdappa" <rgowd...@redhat.com> > > Cc: "gluster-users" <gluster-users@gluster.org> > > Sent: Tuesday, January 30, 2018 1:37:40 PM > > Subject: Re: [Gluster-users] parallel-readdir is not recognized in > GlusterFS 3.12.4 > > > > Thank you, Raghavendra. I guess this cosmetic fix will be in 3.12.6? > > Yes. > > > > > I'm also looking forward to seeing stability fixes to parallel-readdir > and or > > readdir-ahead in 3.12.x. :) > > I am not sure what caused disconnects. readdir-ahead shouldn't result in > purging of connections. We've not seen this behavior in our internal tests > (We have done quite a significant amount of testing on this to ascertain > performance impact). We need to investigate this. Can you file a bug on > this providing following information: > * log files of client and brick process > * test-case to reproduce the issue. > * Test environment (OS, volume profile info etc). > > regards, > Raghavendra > > > > Cheers, > > > > On Mon, Jan 29, 2018 at 9:26 AM Raghavendra Gowdappa < > rgowd...@redhat.com > > > wrote: > > > > > > > > > > - Original Message - > > > From: "Pranith Kumar Karampuri" < pkara...@redhat.com > > > > To: "Alan Orth" < alan.o...@gmail.com > > > > Cc: "gluster-users" < gluster-users@gluster.org > > > > Sent: Saturday, January 27, 2018 7:31:30 AM > > > Subject: Re: [Gluster-users] parallel-readdir is not recognized in > > > GlusterFS 3.12.4 > > > > > > Adding devs who work on it > > > > > > On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote: > > > > > > > > > > > > Hello, > > > > > > I saw that parallel-readdir was an experimental feature in GlusterFS > > > version > > > 3.10.0, became stable in version 3.11.0, and is now recommended for > small > > > file workloads in the Red Hat Gluster Storage Server documentation[2]. > I've > > > successfully enabled this on one of my volumes but I notice the > following > > > in > > > the client mount log: > > > > > > [2018-01-23 10:24:24.048055] W [MSGID: 101174] > > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > > > 'parallel-readdir' is not recognized > > > [2018-01-23 10:24:24.048072] W [MSGID: 101174] > > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option > > > 'parallel-readdir' is not recognized > > > > > > > This is fixed by patch: > > https://review.gluster.org/#/c/18572/ > > > > Please note that the log message is harmless and parallel-readdir is > enabled > > even with this log. You can verify that by looking into gluster volume > > configuration logged in glusterfs logs. If parallel-readdir is enabled, > you > > should see a readdir-ahead translator loaded as a parent of each DHT > > subvolume. > > > > > The GlusterFS version on the client and server is 3.12.4. What is > going on? > > > > > > [0] > > > > https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md > > > [1] > > > > https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md > > > [2] > > > > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements > > > > > > Thank you, > > > > > > > > > -- > > > > > > > > > Alan Orth > > > alan.o...@gmail.com > > > https://picturingjordan.com > > > https://englishbulgaria.net > > > https://mjanja.ch > > > > > > ___ > > > Gluster-users mailing list > > > Gluster-users@gluster.org > > > http://lists.gluster.org/mailman/listinfo/gluster-users > > > > > > ___ > > > Gluster-users mailing list > > > Gluster-users@gluster.org > > > http://lists.gluster.org/mailman/listinfo/gluster-users > > -- > > > > > > Alan Orth > > alan.o...@gmail.com > > https://picturingjordan.com > > https://englishbulgaria.net > > https://mjanja.ch > > > > ___ > > Gluster-users mailing list > > Gluster-users@gluster.org > > http://lists.gluster.org/mailman/listinfo/gluster-users > -- Alan Orth alan.o...@gmail.com https://picturingjordan.com https://englishbulgaria.net https://mjanja.ch ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
- Original Message - > From: "Alan Orth" <alan.o...@gmail.com> > To: "Raghavendra Gowdappa" <rgowd...@redhat.com> > Cc: "gluster-users" <gluster-users@gluster.org> > Sent: Tuesday, January 30, 2018 1:37:40 PM > Subject: Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS > 3.12.4 > > Thank you, Raghavendra. I guess this cosmetic fix will be in 3.12.6? Yes. > > I'm also looking forward to seeing stability fixes to parallel-readdir and or > readdir-ahead in 3.12.x. :) I am not sure what caused disconnects. readdir-ahead shouldn't result in purging of connections. We've not seen this behavior in our internal tests (We have done quite a significant amount of testing on this to ascertain performance impact). We need to investigate this. Can you file a bug on this providing following information: * log files of client and brick process * test-case to reproduce the issue. * Test environment (OS, volume profile info etc). regards, Raghavendra > > Cheers, > > On Mon, Jan 29, 2018 at 9:26 AM Raghavendra Gowdappa < rgowd...@redhat.com > > wrote: > > > > > - Original Message - > > From: "Pranith Kumar Karampuri" < pkara...@redhat.com > > > To: "Alan Orth" < alan.o...@gmail.com > > > Cc: "gluster-users" < gluster-users@gluster.org > > > Sent: Saturday, January 27, 2018 7:31:30 AM > > Subject: Re: [Gluster-users] parallel-readdir is not recognized in > > GlusterFS 3.12.4 > > > > Adding devs who work on it > > > > On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote: > > > > > > > > Hello, > > > > I saw that parallel-readdir was an experimental feature in GlusterFS > > version > > 3.10.0, became stable in version 3.11.0, and is now recommended for small > > file workloads in the Red Hat Gluster Storage Server documentation[2]. I've > > successfully enabled this on one of my volumes but I notice the following > > in > > the client mount log: > > > > [2018-01-23 10:24:24.048055] W [MSGID: 101174] > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > > 'parallel-readdir' is not recognized > > [2018-01-23 10:24:24.048072] W [MSGID: 101174] > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option > > 'parallel-readdir' is not recognized > > > > This is fixed by patch: > https://review.gluster.org/#/c/18572/ > > Please note that the log message is harmless and parallel-readdir is enabled > even with this log. You can verify that by looking into gluster volume > configuration logged in glusterfs logs. If parallel-readdir is enabled, you > should see a readdir-ahead translator loaded as a parent of each DHT > subvolume. > > > The GlusterFS version on the client and server is 3.12.4. What is going on? > > > > [0] > > https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md > > [1] > > https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md > > [2] > > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements > > > > Thank you, > > > > > > -- > > > > > > Alan Orth > > alan.o...@gmail.com > > https://picturingjordan.com > > https://englishbulgaria.net > > https://mjanja.ch > > > > ___ > > Gluster-users mailing list > > Gluster-users@gluster.org > > http://lists.gluster.org/mailman/listinfo/gluster-users > > > > ___ > > Gluster-users mailing list > > Gluster-users@gluster.org > > http://lists.gluster.org/mailman/listinfo/gluster-users > -- > > > Alan Orth > alan.o...@gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > > ___ > Gluster-users mailing list > Gluster-users@gluster.org > http://lists.gluster.org/mailman/listinfo/gluster-users ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Thank you, Raghavendra. I guess this cosmetic fix will be in 3.12.6? I'm also looking forward to seeing stability fixes to parallel-readdir and or readdir-ahead in 3.12.x. :) Cheers, On Mon, Jan 29, 2018 at 9:26 AM Raghavendra Gowdappa <rgowd...@redhat.com> wrote: > > > - Original Message - > > From: "Pranith Kumar Karampuri" <pkara...@redhat.com> > > To: "Alan Orth" <alan.o...@gmail.com> > > Cc: "gluster-users" <gluster-users@gluster.org> > > Sent: Saturday, January 27, 2018 7:31:30 AM > > Subject: Re: [Gluster-users] parallel-readdir is not recognized in > GlusterFS 3.12.4 > > > > Adding devs who work on it > > > > On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote: > > > > > > > > Hello, > > > > I saw that parallel-readdir was an experimental feature in GlusterFS > version > > 3.10.0, became stable in version 3.11.0, and is now recommended for small > > file workloads in the Red Hat Gluster Storage Server documentation[2]. > I've > > successfully enabled this on one of my volumes but I notice the > following in > > the client mount log: > > > > [2018-01-23 10:24:24.048055] W [MSGID: 101174] > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > > 'parallel-readdir' is not recognized > > [2018-01-23 10:24:24.048072] W [MSGID: 101174] > > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option > > 'parallel-readdir' is not recognized > > > > This is fixed by patch: > https://review.gluster.org/#/c/18572/ > > Please note that the log message is harmless and parallel-readdir is > enabled even with this log. You can verify that by looking into gluster > volume configuration logged in glusterfs logs. If parallel-readdir is > enabled, you should see a readdir-ahead translator loaded as a parent of > each DHT subvolume. > > > The GlusterFS version on the client and server is 3.12.4. What is going > on? > > > > [0] > > > https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md > > [1] > > > https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md > > [2] > > > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements > > > > Thank you, > > > > > > -- > > > > > > Alan Orth > > alan.o...@gmail.com > > https://picturingjordan.com > > https://englishbulgaria.net > > https://mjanja.ch > > > > ___ > > Gluster-users mailing list > > Gluster-users@gluster.org > > http://lists.gluster.org/mailman/listinfo/gluster-users > > > > ___ > > Gluster-users mailing list > > Gluster-users@gluster.org > > http://lists.gluster.org/mailman/listinfo/gluster-users > -- Alan Orth alan.o...@gmail.com https://picturingjordan.com https://englishbulgaria.net https://mjanja.ch ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
- Original Message - > From: "Pranith Kumar Karampuri" <pkara...@redhat.com> > To: "Alan Orth" <alan.o...@gmail.com> > Cc: "gluster-users" <gluster-users@gluster.org> > Sent: Saturday, January 27, 2018 7:31:30 AM > Subject: Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS > 3.12.4 > > Adding devs who work on it > > On 23 Jan 2018 10:40 pm, "Alan Orth" < alan.o...@gmail.com > wrote: > > > > Hello, > > I saw that parallel-readdir was an experimental feature in GlusterFS version > 3.10.0, became stable in version 3.11.0, and is now recommended for small > file workloads in the Red Hat Gluster Storage Server documentation[2]. I've > successfully enabled this on one of my volumes but I notice the following in > the client mount log: > > [2018-01-23 10:24:24.048055] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > 'parallel-readdir' is not recognized > [2018-01-23 10:24:24.048072] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option > 'parallel-readdir' is not recognized > This is fixed by patch: https://review.gluster.org/#/c/18572/ Please note that the log message is harmless and parallel-readdir is enabled even with this log. You can verify that by looking into gluster volume configuration logged in glusterfs logs. If parallel-readdir is enabled, you should see a readdir-ahead translator loaded as a parent of each DHT subvolume. > The GlusterFS version on the client and server is 3.12.4. What is going on? > > [0] > https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md > [1] > https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md > [2] > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements > > Thank you, > > > -- > > > Alan Orth > alan.o...@gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > > ___ > Gluster-users mailing list > Gluster-users@gluster.org > http://lists.gluster.org/mailman/listinfo/gluster-users > > ___ > Gluster-users mailing list > Gluster-users@gluster.org > http://lists.gluster.org/mailman/listinfo/gluster-users ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
@Vlad, yes I kept group metadata-cache and I also have network.inode-lru-limit 9, but nothing else performance related. @Pranith great, thank you. Cheers, On Sat, Jan 27, 2018 at 4:01 AM Pranith Kumar Karampuriwrote: > Adding devs who work on it > > On 23 Jan 2018 10:40 pm, "Alan Orth" wrote: > >> Hello, >> >> I saw that parallel-readdir was an experimental feature in GlusterFS >> version 3.10.0, became stable in version 3.11.0, and is now recommended for >> small file workloads in the Red Hat Gluster Storage Server >> documentation[2]. I've successfully enabled this on one of my volumes but I >> notice the following in the client mount log: >> >> [2018-01-23 10:24:24.048055] W [MSGID: 101174] >> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option >> 'parallel-readdir' is not recognized >> [2018-01-23 10:24:24.048072] W [MSGID: 101174] >> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option >> 'parallel-readdir' is not recognized >> >> The GlusterFS version on the client and server is 3.12.4. What is going >> on? >> >> [0] >> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md >> [1] >> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md >> [2] >> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements >> >> Thank you, >> >> >> -- >> >> Alan Orth >> alan.o...@gmail.com >> https://picturingjordan.com >> https://englishbulgaria.net >> https://mjanja.ch >> >> ___ >> Gluster-users mailing list >> Gluster-users@gluster.org >> http://lists.gluster.org/mailman/listinfo/gluster-users >> > -- Alan Orth alan.o...@gmail.com https://picturingjordan.com https://englishbulgaria.net https://mjanja.ch ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Adding devs who work on it On 23 Jan 2018 10:40 pm, "Alan Orth"wrote: > Hello, > > I saw that parallel-readdir was an experimental feature in GlusterFS > version 3.10.0, became stable in version 3.11.0, and is now recommended for > small file workloads in the Red Hat Gluster Storage Server > documentation[2]. I've successfully enabled this on one of my volumes but I > notice the following in the client mount log: > > [2018-01-23 10:24:24.048055] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > 'parallel-readdir' is not recognized > [2018-01-23 10:24:24.048072] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option > 'parallel-readdir' is not recognized > > The GlusterFS version on the client and server is 3.12.4. What is going on? > > [0] https://github.com/gluster/glusterfs/blob/release-3.10/ > doc/release-notes/3.10.0.md > [1] https://github.com/gluster/glusterfs/blob/release-3.11/ > doc/release-notes/3.11.0.md > [2] https://access.redhat.com/documentation/en-us/red_hat_ > gluster_storage/3.3/html/administration_guide/small_ > file_performance_enhancements > > Thank you, > > > -- > > Alan Orth > alan.o...@gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > > ___ > Gluster-users mailing list > Gluster-users@gluster.org > http://lists.gluster.org/mailman/listinfo/gluster-users > ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Thanks for info Alan, you've kept group metadata-cache, right? I have them enabled as well features.cache-invalidation=on features.cache-invalidation-timeout=600 performance.stat-prefetch=on performance.cache-invalidation=on performance.md-cache-timeout=600 network.inode-lru-limit=5 but on the brick I suspect issues only partialy performance.stat-prefetch=on performance.md-cache-timeout=600 will try without parallel-readdir and readdir-ahead v On Fri, Jan 26, 2018 at 6:59 AM, Alan Orthwrote: > Dear Vlad, > > I'm sorry, I don't want to test this again on my system just yet! It caused > too much instability for my users and I don't have enough resources for a > development environment. The only other variables that changed before the > crashes was the group metadata-cache[0], which I enabled the same day as the > parallel-readdir and readdir-ahead options: > > $ gluster volume set homes group metadata-cache > > I'm hoping Atin or Poornima can shed some light and squash this bug. > > [0] > https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md > > Regards, > > On Fri, Jan 26, 2018 at 6:10 AM Vlad Kopylov wrote: >> >> can you please test parallel-readdir or readdir-ahead gives >> disconnects? so we know which to disable >> >> parallel-readdir doing magic ran on pdf from last year >> >> https://events.static.linuxfound.org/sites/events/files/slides/Gluster_DirPerf_Vault2017_0.pdf >> >> -v >> >> On Thu, Jan 25, 2018 at 8:20 AM, Alan Orth wrote: >> > By the way, on a slightly related note, I'm pretty sure either >> > parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x. >> > We >> > are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6. >> > >> > I updated my servers and clients to 3.12.4 and enabled these two options >> > after reading about them in the 3.10.0 and 3.11.0 release notes. In the >> > days >> > after enabling these two options all of my clients kept getting >> > disconnected >> > from the volume. The error upon attempting to list a directory or read a >> > file was "Transport endpoint is not connected", after which I would >> > force >> > unmount the volume with `umount -fl /home` and remount it, only to have >> > it >> > get disconnected again a few hours later. >> > >> > Every time the volume disconnected I looked in the client mount log and >> > only >> > found information such as: >> > >> > [2018-01-24 05:52:27.695225] I [MSGID: 108026] >> > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: >> > Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168. >> > sources=[0] sinks=1 >> > [2018-01-24 05:52:27.700611] I [MSGID: 108026] >> > [afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do] >> > 2-homes-replicate-1: performing metadata selfheal on >> > b6a53629-a831-4ee3-a35e-f47c04297aaa >> > [2018-01-24 05:52:27.703021] I [MSGID: 108026] >> > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: >> > Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa. >> > sources=[0] sinks=1 >> > >> > I enabled debug logging for that volume's client mount with `gluster >> > volume >> > set homes diagnostics.client-log-level DEBUG` and then I saw this in the >> > client mount log the next time it disconnected: >> > >> > [2018-01-24 08:55:19.138810] D [MSGID: 0] >> > [io-threads.c:358:iot_schedule] >> > 0-homes-io-threads: LOOKUP scheduled as fast fop >> > [2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup] >> > 0-homes-dht: Calling fresh lookup for >> > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on >> > homes-readdir-ahead-1 >> > [2018-01-24 08:55:19.138928] D [MSGID: 0] >> > [io-threads.c:358:iot_schedule] >> > 0-homes-io-threads: FSTAT scheduled as fast fop >> > [2018-01-24 08:55:19.138958] D [MSGID: 0] >> > [afr-read-txn.c:220:afr_read_txn] >> > 0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation >> > now vs >> > cached: 2, 2 >> > [2018-01-24 08:55:19.139187] D [MSGID: 0] >> > [dht-common.c:2294:dht_lookup_cbk] >> > 0-homes-dht: fresh_lookup returned for >> > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0 >> > [2018-01-24 08:55:19.139200] D [MSGID: 0] >> > [dht-layout.c:873:dht_layout_preset] 0-homes-dht: file = >> > ----, subvol = homes-readdir-ahead-1 >> > [2018-01-24 08:55:19.139257] D [MSGID: 0] >> > [io-threads.c:358:iot_schedule] >> > 0-homes-io-threads: READDIRP scheduled as fast fop >> > >> > On a hunch I disabled both parallel-readdir and readdir-ahead, which I >> > had >> > only enabled a few days before, and now all of the clients are much more >> > stable, with zero disconnections in the days since I disabled those two >> > volume options. >> > >> > Please take a look! Thanks, >> > >> > On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjee >> > wrote: >> >> >> >> Adding Poornima to take a look
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Dear Vlad, I'm sorry, I don't want to test this again on my system just yet! It caused too much instability for my users and I don't have enough resources for a development environment. The only other variables that changed before the crashes was the group metadata-cache[0], which I enabled the same day as the parallel-readdir and readdir-ahead options: $ gluster volume set homes group metadata-cache I'm hoping Atin or Poornima can shed some light and squash this bug. [0] https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md Regards, On Fri, Jan 26, 2018 at 6:10 AM Vlad Kopylovwrote: > can you please test parallel-readdir or readdir-ahead gives > disconnects? so we know which to disable > > parallel-readdir doing magic ran on pdf from last year > > https://events.static.linuxfound.org/sites/events/files/slides/Gluster_DirPerf_Vault2017_0.pdf > > -v > > On Thu, Jan 25, 2018 at 8:20 AM, Alan Orth wrote: > > By the way, on a slightly related note, I'm pretty sure either > > parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x. > We > > are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6. > > > > I updated my servers and clients to 3.12.4 and enabled these two options > > after reading about them in the 3.10.0 and 3.11.0 release notes. In the > days > > after enabling these two options all of my clients kept getting > disconnected > > from the volume. The error upon attempting to list a directory or read a > > file was "Transport endpoint is not connected", after which I would force > > unmount the volume with `umount -fl /home` and remount it, only to have > it > > get disconnected again a few hours later. > > > > Every time the volume disconnected I looked in the client mount log and > only > > found information such as: > > > > [2018-01-24 05:52:27.695225] I [MSGID: 108026] > > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: > > Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168. > > sources=[0] sinks=1 > > [2018-01-24 05:52:27.700611] I [MSGID: 108026] > > [afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do] > > 2-homes-replicate-1: performing metadata selfheal on > > b6a53629-a831-4ee3-a35e-f47c04297aaa > > [2018-01-24 05:52:27.703021] I [MSGID: 108026] > > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: > > Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa. > > sources=[0] sinks=1 > > > > I enabled debug logging for that volume's client mount with `gluster > volume > > set homes diagnostics.client-log-level DEBUG` and then I saw this in the > > client mount log the next time it disconnected: > > > > [2018-01-24 08:55:19.138810] D [MSGID: 0] [io-threads.c:358:iot_schedule] > > 0-homes-io-threads: LOOKUP scheduled as fast fop > > [2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup] > > 0-homes-dht: Calling fresh lookup for > > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on > > homes-readdir-ahead-1 > > [2018-01-24 08:55:19.138928] D [MSGID: 0] [io-threads.c:358:iot_schedule] > > 0-homes-io-threads: FSTAT scheduled as fast fop > > [2018-01-24 08:55:19.138958] D [MSGID: 0] > [afr-read-txn.c:220:afr_read_txn] > > 0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation > now vs > > cached: 2, 2 > > [2018-01-24 08:55:19.139187] D [MSGID: 0] > [dht-common.c:2294:dht_lookup_cbk] > > 0-homes-dht: fresh_lookup returned for > > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0 > > [2018-01-24 08:55:19.139200] D [MSGID: 0] > > [dht-layout.c:873:dht_layout_preset] 0-homes-dht: file = > > ----, subvol = homes-readdir-ahead-1 > > [2018-01-24 08:55:19.139257] D [MSGID: 0] [io-threads.c:358:iot_schedule] > > 0-homes-io-threads: READDIRP scheduled as fast fop > > > > On a hunch I disabled both parallel-readdir and readdir-ahead, which I > had > > only enabled a few days before, and now all of the clients are much more > > stable, with zero disconnections in the days since I disabled those two > > volume options. > > > > Please take a look! Thanks, > > > > On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjee > wrote: > >> > >> Adding Poornima to take a look at it and comment. > >> > >> On Tue, Jan 23, 2018 at 10:39 PM, Alan Orth > wrote: > >>> > >>> Hello, > >>> > >>> I saw that parallel-readdir was an experimental feature in GlusterFS > >>> version 3.10.0, became stable in version 3.11.0, and is now > recommended for > >>> small file workloads in the Red Hat Gluster Storage Server > documentation[2]. > >>> I've successfully enabled this on one of my volumes but I notice the > >>> following in the client mount log: > >>> > >>> [2018-01-23 10:24:24.048055] W [MSGID: 101174] > >>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > >>> 'parallel-readdir' is not recognized > >>> [2018-01-23
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
can you please test parallel-readdir or readdir-ahead gives disconnects? so we know which to disable parallel-readdir doing magic ran on pdf from last year https://events.static.linuxfound.org/sites/events/files/slides/Gluster_DirPerf_Vault2017_0.pdf -v On Thu, Jan 25, 2018 at 8:20 AM, Alan Orthwrote: > By the way, on a slightly related note, I'm pretty sure either > parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x. We > are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6. > > I updated my servers and clients to 3.12.4 and enabled these two options > after reading about them in the 3.10.0 and 3.11.0 release notes. In the days > after enabling these two options all of my clients kept getting disconnected > from the volume. The error upon attempting to list a directory or read a > file was "Transport endpoint is not connected", after which I would force > unmount the volume with `umount -fl /home` and remount it, only to have it > get disconnected again a few hours later. > > Every time the volume disconnected I looked in the client mount log and only > found information such as: > > [2018-01-24 05:52:27.695225] I [MSGID: 108026] > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: > Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168. > sources=[0] sinks=1 > [2018-01-24 05:52:27.700611] I [MSGID: 108026] > [afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do] > 2-homes-replicate-1: performing metadata selfheal on > b6a53629-a831-4ee3-a35e-f47c04297aaa > [2018-01-24 05:52:27.703021] I [MSGID: 108026] > [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: > Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa. > sources=[0] sinks=1 > > I enabled debug logging for that volume's client mount with `gluster volume > set homes diagnostics.client-log-level DEBUG` and then I saw this in the > client mount log the next time it disconnected: > > [2018-01-24 08:55:19.138810] D [MSGID: 0] [io-threads.c:358:iot_schedule] > 0-homes-io-threads: LOOKUP scheduled as fast fop > [2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup] > 0-homes-dht: Calling fresh lookup for > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on > homes-readdir-ahead-1 > [2018-01-24 08:55:19.138928] D [MSGID: 0] [io-threads.c:358:iot_schedule] > 0-homes-io-threads: FSTAT scheduled as fast fop > [2018-01-24 08:55:19.138958] D [MSGID: 0] [afr-read-txn.c:220:afr_read_txn] > 0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation now vs > cached: 2, 2 > [2018-01-24 08:55:19.139187] D [MSGID: 0] [dht-common.c:2294:dht_lookup_cbk] > 0-homes-dht: fresh_lookup returned for > /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0 > [2018-01-24 08:55:19.139200] D [MSGID: 0] > [dht-layout.c:873:dht_layout_preset] 0-homes-dht: file = > ----, subvol = homes-readdir-ahead-1 > [2018-01-24 08:55:19.139257] D [MSGID: 0] [io-threads.c:358:iot_schedule] > 0-homes-io-threads: READDIRP scheduled as fast fop > > On a hunch I disabled both parallel-readdir and readdir-ahead, which I had > only enabled a few days before, and now all of the clients are much more > stable, with zero disconnections in the days since I disabled those two > volume options. > > Please take a look! Thanks, > > On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjee wrote: >> >> Adding Poornima to take a look at it and comment. >> >> On Tue, Jan 23, 2018 at 10:39 PM, Alan Orth wrote: >>> >>> Hello, >>> >>> I saw that parallel-readdir was an experimental feature in GlusterFS >>> version 3.10.0, became stable in version 3.11.0, and is now recommended for >>> small file workloads in the Red Hat Gluster Storage Server documentation[2]. >>> I've successfully enabled this on one of my volumes but I notice the >>> following in the client mount log: >>> >>> [2018-01-23 10:24:24.048055] W [MSGID: 101174] >>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option >>> 'parallel-readdir' is not recognized >>> [2018-01-23 10:24:24.048072] W [MSGID: 101174] >>> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option >>> 'parallel-readdir' is not recognized >>> >>> The GlusterFS version on the client and server is 3.12.4. What is going >>> on? >>> >>> [0] >>> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md >>> [1] >>> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md >>> [2] >>> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements >>> >>> Thank you, >>> >>> >>> -- >>> >>> Alan Orth >>> alan.o...@gmail.com >>> https://picturingjordan.com >>> https://englishbulgaria.net >>> https://mjanja.ch >>> >>> >>> ___ >>> Gluster-users mailing list >>>
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Same here, even after update to 3.12.5-2 [2018-01-26 02:48:58.113996] W [MSGID: 101174] [graph.c:363:_log_if_unknown_option] 0-q-readdir-ahead-0: option 'parallel-readdir' is not recognized -v On Tue, Jan 23, 2018 at 12:09 PM, Alan Orthwrote: > Hello, > > I saw that parallel-readdir was an experimental feature in GlusterFS version > 3.10.0, became stable in version 3.11.0, and is now recommended for small > file workloads in the Red Hat Gluster Storage Server documentation[2]. I've > successfully enabled this on one of my volumes but I notice the following in > the client mount log: > > [2018-01-23 10:24:24.048055] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > 'parallel-readdir' is not recognized > [2018-01-23 10:24:24.048072] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option > 'parallel-readdir' is not recognized > > The GlusterFS version on the client and server is 3.12.4. What is going on? > > [0] > https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md > [1] > https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md > [2] > https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements > > Thank you, > > > -- > > Alan Orth > alan.o...@gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > > > ___ > Gluster-users mailing list > Gluster-users@gluster.org > http://lists.gluster.org/mailman/listinfo/gluster-users ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
By the way, on a slightly related note, I'm pretty sure either parallel-readdir or readdir-ahead has a regression in GlusterFS 3.12.x. We are running CentOS 7 with kernel-3.10.0-693.11.6.el7.x86_6. I updated my servers and clients to 3.12.4 and enabled these two options after reading about them in the 3.10.0 and 3.11.0 release notes. In the days after enabling these two options all of my clients kept getting disconnected from the volume. The error upon attempting to list a directory or read a file was "Transport endpoint is not connected", after which I would force unmount the volume with `umount -fl /home` and remount it, only to have it get disconnected again a few hours later. Every time the volume disconnected I looked in the client mount log and only found information such as: [2018-01-24 05:52:27.695225] I [MSGID: 108026] [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: Completed metadata selfheal on ed3fbafc-734b-41ca-ab30-216399fb9168. sources=[0] sinks=1 [2018-01-24 05:52:27.700611] I [MSGID: 108026] [afr-self-heal-metadata.c:52:__afr_selfheal_metadata_do] 2-homes-replicate-1: performing metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa [2018-01-24 05:52:27.703021] I [MSGID: 108026] [afr-self-heal-common.c:1656:afr_log_selfheal] 2-homes-replicate-1: Completed metadata selfheal on b6a53629-a831-4ee3-a35e-f47c04297aaa. sources=[0] sinks=1 I enabled debug logging for that volume's client mount with `gluster volume set homes diagnostics.client-log-level DEBUG` and then I saw this in the client mount log the next time it disconnected: [2018-01-24 08:55:19.138810] D [MSGID: 0] [io-threads.c:358:iot_schedule] 0-homes-io-threads: LOOKUP scheduled as fast fop [2018-01-24 08:55:19.138849] D [MSGID: 0] [dht-common.c:2711:dht_lookup] 0-homes-dht: Calling fresh lookup for /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas on homes-readdir-ahead-1 [2018-01-24 08:55:19.138928] D [MSGID: 0] [io-threads.c:358:iot_schedule] 0-homes-io-threads: FSTAT scheduled as fast fop [2018-01-24 08:55:19.138958] D [MSGID: 0] [afr-read-txn.c:220:afr_read_txn] 0-homes-replicate-1: e6ee0427-b17d-4464-a738-e8ea70d77d95: generation now vs cached: 2, 2 [2018-01-24 08:55:19.139187] D [MSGID: 0] [dht-common.c:2294:dht_lookup_cbk] 0-homes-dht: fresh_lookup returned for /vchebii/revtrans/Hircus-XM_018067032.1.pep.align.fas with op_ret 0 [2018-01-24 08:55:19.139200] D [MSGID: 0] [dht-layout.c:873:dht_layout_preset] 0-homes-dht: file = ----, subvol = homes-readdir-ahead-1 [2018-01-24 08:55:19.139257] D [MSGID: 0] [io-threads.c:358:iot_schedule] 0-homes-io-threads: READDIRP scheduled as fast fop On a hunch I disabled both parallel-readdir and readdir-ahead, which I had only enabled a few days before, and now all of the clients are much more stable, with zero disconnections in the days since I disabled those two volume options. Please take a look! Thanks, On Wed, Jan 24, 2018 at 5:59 AM Atin Mukherjeewrote: > Adding Poornima to take a look at it and comment. > > On Tue, Jan 23, 2018 at 10:39 PM, Alan Orth wrote: > >> Hello, >> >> I saw that parallel-readdir was an experimental feature in GlusterFS >> version 3.10.0, became stable in version 3.11.0, and is now recommended for >> small file workloads in the Red Hat Gluster Storage Server >> documentation[2]. I've successfully enabled this on one of my volumes but I >> notice the following in the client mount log: >> >> [2018-01-23 10:24:24.048055] W [MSGID: 101174] >> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option >> 'parallel-readdir' is not recognized >> [2018-01-23 10:24:24.048072] W [MSGID: 101174] >> [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option >> 'parallel-readdir' is not recognized >> >> The GlusterFS version on the client and server is 3.12.4. What is going >> on? >> >> [0] >> https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md >> [1] >> https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md >> [2] >> https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements >> >> Thank you, >> >> >> -- >> >> Alan Orth >> alan.o...@gmail.com >> https://picturingjordan.com >> https://englishbulgaria.net >> https://mjanja.ch >> >> ___ >> Gluster-users mailing list >> Gluster-users@gluster.org >> http://lists.gluster.org/mailman/listinfo/gluster-users >> > > -- Alan Orth alan.o...@gmail.com https://picturingjordan.com https://englishbulgaria.net https://mjanja.ch ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
Re: [Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Adding Poornima to take a look at it and comment. On Tue, Jan 23, 2018 at 10:39 PM, Alan Orthwrote: > Hello, > > I saw that parallel-readdir was an experimental feature in GlusterFS > version 3.10.0, became stable in version 3.11.0, and is now recommended for > small file workloads in the Red Hat Gluster Storage Server > documentation[2]. I've successfully enabled this on one of my volumes but I > notice the following in the client mount log: > > [2018-01-23 10:24:24.048055] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option > 'parallel-readdir' is not recognized > [2018-01-23 10:24:24.048072] W [MSGID: 101174] > [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option > 'parallel-readdir' is not recognized > > The GlusterFS version on the client and server is 3.12.4. What is going on? > > [0] https://github.com/gluster/glusterfs/blob/release-3.10/ > doc/release-notes/3.10.0.md > [1] https://github.com/gluster/glusterfs/blob/release-3.11/ > doc/release-notes/3.11.0.md > [2] https://access.redhat.com/documentation/en-us/red_hat_ > gluster_storage/3.3/html/administration_guide/small_ > file_performance_enhancements > > Thank you, > > > -- > > Alan Orth > alan.o...@gmail.com > https://picturingjordan.com > https://englishbulgaria.net > https://mjanja.ch > > ___ > Gluster-users mailing list > Gluster-users@gluster.org > http://lists.gluster.org/mailman/listinfo/gluster-users > ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users
[Gluster-users] parallel-readdir is not recognized in GlusterFS 3.12.4
Hello, I saw that parallel-readdir was an experimental feature in GlusterFS version 3.10.0, became stable in version 3.11.0, and is now recommended for small file workloads in the Red Hat Gluster Storage Server documentation[2]. I've successfully enabled this on one of my volumes but I notice the following in the client mount log: [2018-01-23 10:24:24.048055] W [MSGID: 101174] [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-1: option 'parallel-readdir' is not recognized [2018-01-23 10:24:24.048072] W [MSGID: 101174] [graph.c:363:_log_if_unknown_option] 0-homes-readdir-ahead-0: option 'parallel-readdir' is not recognized The GlusterFS version on the client and server is 3.12.4. What is going on? [0] https://github.com/gluster/glusterfs/blob/release-3.10/doc/release-notes/3.10.0.md [1] https://github.com/gluster/glusterfs/blob/release-3.11/doc/release-notes/3.11.0.md [2] https://access.redhat.com/documentation/en-us/red_hat_gluster_storage/3.3/html/administration_guide/small_file_performance_enhancements Thank you, -- Alan Orth alan.o...@gmail.com https://picturingjordan.com https://englishbulgaria.net https://mjanja.ch ___ Gluster-users mailing list Gluster-users@gluster.org http://lists.gluster.org/mailman/listinfo/gluster-users