Re: [Gluster-users] glusterfs 4.1.7 + nfs-ganesha 2.7.1 freeze during write

2019-03-28 Thread Soumya Koduri



On 2/8/19 11:53 AM, Soumya Koduri wrote:



On 2/8/19 3:20 AM, Maurits Lamers wrote:

Hi,



[2019-02-07 10:11:24.812606] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(yøêÙ

 Mz„–îSL4_@) failed
[2019-02-07 10:11:24.819376] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(eTnôEU«H.[2019-02-07 10:11:24.833299] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(gÇLÁèFà»0bЯk) failed
[2019-02-07 10:25:01.642509] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-2: 
server [node1]:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:01.642805] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-1: 
server [node2]:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:01.642946] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-4: 
server [node3]:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:02.643120] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-3: 
server 127.0.1.1:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:02.643314] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-0: 
server [node4]:49152 has not responded in the last 42 seconds, 
disconnecting.


Strange that synctask failed. Could you please turn off 
features.cache-invalidation volume option and check if the issue 
still persists.




Turning the cache invalidation option off seems to have solved the 
freeze. Still testing, but it looks promising.




If thats the case, please turn on cache invalidation option back and 
collect couple of stack traces (using gstack) when the system freezes 
again.


FYI - Have got a chance to reproduce and RCA the issue [1]. Posted fix 
for review in the upstream [2]


Thanks,
Soumya

[1] https://bugzilla.redhat.com/show_bug.cgi?id=1693575
[2] https://review.gluster.org/22436



Thanks,
Soumya

cheers

Maurits


___
Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users

___
Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] glusterfs 4.1.7 + nfs-ganesha 2.7.1 freeze during write

2019-02-07 Thread Soumya Koduri



On 2/8/19 3:20 AM, Maurits Lamers wrote:

Hi,



[2019-02-07 10:11:24.812606] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(yøêÙ

 Mz„–îSL4_@) failed
[2019-02-07 10:11:24.819376] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(eTnôEU«H.[2019-02-07 10:11:24.833299] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(gÇLÁèFà»0bЯk) failed
[2019-02-07 10:25:01.642509] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-2: 
server [node1]:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:01.642805] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-1: 
server [node2]:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:01.642946] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-4: 
server [node3]:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:02.643120] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-3: 
server 127.0.1.1:49152 has not responded in the last 42 seconds, 
disconnecting.
[2019-02-07 10:25:02.643314] C 
[rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-0: 
server [node4]:49152 has not responded in the last 42 seconds, 
disconnecting.


Strange that synctask failed. Could you please turn off 
features.cache-invalidation volume option and check if the issue still 
persists.




Turning the cache invalidation option off seems to have solved the 
freeze. Still testing, but it looks promising.




If thats the case, please turn on cache invalidation option back and 
collect couple of stack traces (using gstack) when the system freezes again.


Thanks,
Soumya

cheers

Maurits


___
Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users

[Gluster-users] glusterfs 4.1.7 + nfs-ganesha 2.7.1 freeze during write

2019-02-07 Thread Maurits Lamers
Hi all,

I am trying to find out more about why a nfs mount through nfs-ganesha of a 
glusterfs volume freezes.

Little bit of a background: 
The system consists of one glusterfs volume across 5 nodes. Every node runs 
Ubuntu 16.04, gluster 4.1.7 and nfs-ganesha 2.7.1
The gluster volume is exported using the setup described on the first half of 
https://docs.gluster.org/en/latest/Administrator%20Guide/NFS-Ganesha%20GlusterFS%20Integration/

The node which freezes is running Nextcloud in a docker setup, where the entire 
application is stored on a path, which is a nfs-ganesha mount of the glusterfs 
volume.
When I am running a synchronisation operation with this nextcloud instance, at 
some point the entire system freezes. The only solution is to completely 
restart the node, 
Just before this freeze the /var/log/ganesha/ganesha-gfapi.log file contains an 
error, which seems to result to timeouts after a short while.

The node running the nextcloud instance is the only one freezing, the rest of 
the cluster seems to not be affected.

2019-02-07 10:11:17.342132] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.345776] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.346079] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.396853] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.397650] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.398036] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.407839] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:24.812606] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(yøêÙ
  Mz„–îSL4_@) failed
[2019-02-07 10:11:24.819376] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(eTnôEU«H.https://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] glusterfs 4.1.7 + nfs-ganesha 2.7.1 freeze during write

2019-02-07 Thread Maurits Lamers
Hi,

> 
>> [2019-02-07 10:11:24.812606] E [MSGID: 104055] 
>> [glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
>> event_type(1) and gfid(yøêÙ
>>  Mz„–îSL4_@) failed
>> [2019-02-07 10:11:24.819376] E [MSGID: 104055] 
>> [glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
>> event_type(1) and gfid(eTnôEU«H.> [2019-02-07 10:11:24.833299] E [MSGID: 104055] 
>> [glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
>> event_type(1) and gfid(gÇLÁèFà»0bЯk) failed
>> [2019-02-07 10:25:01.642509] C 
>> [rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-2: server 
>> [node1]:49152 has not responded in the last 42 seconds, disconnecting.
>> [2019-02-07 10:25:01.642805] C 
>> [rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-1: server 
>> [node2]:49152 has not responded in the last 42 seconds, disconnecting.
>> [2019-02-07 10:25:01.642946] C 
>> [rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-4: server 
>> [node3]:49152 has not responded in the last 42 seconds, disconnecting.
>> [2019-02-07 10:25:02.643120] C 
>> [rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-3: server 
>> 127.0.1.1:49152 has not responded in the last 42 seconds, disconnecting.
>> [2019-02-07 10:25:02.643314] C 
>> [rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-0: server 
>> [node4]:49152 has not responded in the last 42 seconds, disconnecting.
> 
> Strange that synctask failed. Could you please turn off 
> features.cache-invalidation volume option and check if the issue still 
> persists.
>> 

Turning the cache invalidation option off seems to have solved the freeze. 
Still testing, but it looks promising.

cheers

Maurits

___
Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users

Re: [Gluster-users] glusterfs 4.1.7 + nfs-ganesha 2.7.1 freeze during write

2019-02-07 Thread Maurits Lamers
Hi,


> Op 7 feb. 2019, om 18:51 heeft Soumya Koduri  het 
> volgende geschreven:
> 
> On 2/7/19 6:01 PM, Maurits Lamers wrote:
>> Hi all,
>> I am trying to find out more about why a nfs mount through nfs-ganesha of a 
>> glusterfs volume freezes.
>> Little bit of a background:
>> The system consists of one glusterfs volume across 5 nodes. Every node runs 
>> Ubuntu 16.04, gluster 4.1.7 and nfs-ganesha 2.7.1
>> The gluster volume is exported using the setup described on the first half 
>> of 
>> https://docs.gluster.org/en/latest/Administrator%20Guide/NFS-Ganesha%20GlusterFS%20Integration/
>> The node which freezes is running Nextcloud in a docker setup, where the 
>> entire application is stored on a path, which is a nfs-ganesha mount of the 
>> glusterfs volume.
>> When I am running a synchronisation operation with this nextcloud instance, 
>> at some point the entire system freezes. The only solution is to completely 
>> restart the node,
>> Just before this freeze the /var/log/ganesha/ganesha-gfapi.log file contains 
>> an error, which seems to result to timeouts after a short while.
>> The node running the nextcloud instance is the only one freezing, the rest 
>> of the cluster seems to not be affected.
>> 2019-02-07 10:11:17.342132] W [dict.c:671:dict_ref] 
>> (-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
>>  [0x7f2f035139b4] 
>> -->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
>>  [0x7f2f037242cd] 
>> -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
>> [0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
>> [2019-02-07 10:11:17.345776] W [dict.c:671:dict_ref] 
>> (-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
>>  [0x7f2f035139b4] 
>> -->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
>>  [0x7f2f037242cd] 
>> -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
>> [0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
>> [2019-02-07 10:11:17.346079] W [dict.c:671:dict_ref] 
>> (-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
>>  [0x7f2f035139b4] 
>> -->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
>>  [0x7f2f037242cd] 
>> -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
>> [0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
>> [2019-02-07 10:11:17.396853] W [dict.c:671:dict_ref] 
>> (-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
>>  [0x7f2f035139b4] 
>> -->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
>>  [0x7f2f037242cd] 
>> -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
>> [0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
>> [2019-02-07 10:11:17.397650] W [dict.c:671:dict_ref] 
>> (-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
>>  [0x7f2f035139b4] 
>> -->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
>>  [0x7f2f037242cd] 
>> -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
>> [0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
>> [2019-02-07 10:11:17.398036] W [dict.c:671:dict_ref] 
>> (-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
>>  [0x7f2f035139b4] 
>> -->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
>>  [0x7f2f037242cd] 
>> -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
>> [0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
>> [2019-02-07 10:11:17.407839] W [dict.c:671:dict_ref] 
>> (-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
>>  [0x7f2f035139b4] 
>> -->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
>>  [0x7f2f037242cd] 
>> -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
>> [0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
> 
> There is a patch [1] submitted and under review which fixes above error 
> messages.
> 
>> [2019-02-07 10:11:24.812606] E [MSGID: 104055] 
>> [glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
>> event_type(1) and gfid(yøêÙ
>>  Mz„–îSL4_@) failed
>> [2019-02-07 10:11:24.819376] E [MSGID: 104055] 
>> [glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
>> event_type(1) and gfid(eTnôEU«H.> [2019-02-07 10:11:24.833299] E [MSGID: 104055] 
>> [glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
>> event_type(1) and gfid(gÇLÁèFà»0bЯk) failed
>> [2019-02-07 10:25:01.642509] C 
>> [rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-2: server 
>> [node1]:49152 has not responded in the last 42 seconds, disconnecting.
>> [2019-02-07 10:25:01.642805] C 
>> [rpc-clnt-ping.c:166:rpc_clnt_ping_timer_expired] 0-gv0-client-1: server 
>> 

Re: [Gluster-users] glusterfs 4.1.7 + nfs-ganesha 2.7.1 freeze during write

2019-02-07 Thread Soumya Koduri



On 2/7/19 6:01 PM, Maurits Lamers wrote:

Hi all,

I am trying to find out more about why a nfs mount through nfs-ganesha of a 
glusterfs volume freezes.

Little bit of a background:
The system consists of one glusterfs volume across 5 nodes. Every node runs 
Ubuntu 16.04, gluster 4.1.7 and nfs-ganesha 2.7.1
The gluster volume is exported using the setup described on the first half of 
https://docs.gluster.org/en/latest/Administrator%20Guide/NFS-Ganesha%20GlusterFS%20Integration/

The node which freezes is running Nextcloud in a docker setup, where the entire 
application is stored on a path, which is a nfs-ganesha mount of the glusterfs 
volume.
When I am running a synchronisation operation with this nextcloud instance, at 
some point the entire system freezes. The only solution is to completely 
restart the node,
Just before this freeze the /var/log/ganesha/ganesha-gfapi.log file contains an 
error, which seems to result to timeouts after a short while.

The node running the nextcloud instance is the only one freezing, the rest of 
the cluster seems to not be affected.

2019-02-07 10:11:17.342132] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd) 
[0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.345776] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd) 
[0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.346079] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd) 
[0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.396853] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd) 
[0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.397650] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd) 
[0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.398036] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd) 
[0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.407839] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd) 
[0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]


There is a patch [1] submitted and under review which fixes above error 
messages.



[2019-02-07 10:11:24.812606] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(yøêÙ
  Mz„–îSL4_@) failed
[2019-02-07 10:11:24.819376] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall event_type(1) 
and gfid(eTnôEU«H.

Strange that synctask failed. Could you please turn off 
features.cache-invalidation volume option and check if the issue still 
persists.


Also next time the system freezes, please check following:

1) if all the brick servers of that volume are up and running. Sometimes 
if the brick servers are not reachable, it takes a while for outstanding 
requests to timeout and get application (nfs-ganesha) back to normal 
state. Please wait for a while and check if the mount becomes accessible.


2) check if nfs-ganesha server is responding to other requests -
#showmount -e localhost (on the node server is running)
# try mount and I/Os from any other client.

3) and if the ganesha server isnt responding to any client, please try 
collecting 2-3 cores/stack traces of nfs-ganesha server 

[Gluster-users] glusterfs 4.1.7 + nfs-ganesha 2.7.1 freeze during write

2019-02-07 Thread Maurits Lamers
Hi all,

I am trying to find out more about why a nfs mount through nfs-ganesha of a 
glusterfs volume freezes.

Little bit of a background: 
The system consists of one glusterfs volume across 5 nodes. Every node runs 
Ubuntu 16.04, gluster 4.1.7 and nfs-ganesha 2.7.1
The gluster volume is exported using the setup described on the first half of 
https://docs.gluster.org/en/latest/Administrator%20Guide/NFS-Ganesha%20GlusterFS%20Integration/

The node which freezes is running Nextcloud in a docker setup, where the entire 
application is stored on a path, which is a nfs-ganesha mount of the glusterfs 
volume.
When I am running a synchronisation operation with this nextcloud instance, at 
some point the entire system freezes. The only solution is to completely 
restart the node, 
Just before this freeze the /var/log/ganesha/ganesha-gfapi.log file contains an 
error, which seems to result to timeouts after a short while.

The node running the nextcloud instance is the only one freezing, the rest of 
the cluster seems to not be affected.

2019-02-07 10:11:17.342132] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.345776] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.346079] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.396853] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.397650] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.398036] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:17.407839] W [dict.c:671:dict_ref] 
(-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/quick-read.so(+0x59b4)
 [0x7f2f035139b4] 
-->/usr/lib/x86_64-linux-gnu/glusterfs/4.1.7/xlator/performance/io-cache.so(+0xa2cd)
 [0x7f2f037242cd] -->/usr/lib/x86_64-linux-gnu/libglusterfs.so.0(dict_ref+0x50) 
[0x7f2f0f312370] ) 0-dict: dict is NULL [Invalid argument]
[2019-02-07 10:11:24.812606] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(yøêÙ
 Mz„–îSL4_@) failed
[2019-02-07 10:11:24.819376] E [MSGID: 104055] 
[glfs-fops.c:4955:glfs_cbk_upcall_data] 0-gfapi: Synctak for Upcall 
event_type(1) and gfid(eTnôEU«H.https://lists.gluster.org/mailman/listinfo/gluster-users