[Gluster-users] Thousands of EPOLLERR - disconnecting now

2018-02-08 Thread Gino Lisignoli
Hello I have a large cluster in which every node is logging: I [socket.c:2474:socket_event_handler] 0-transport: EPOLLERR - disconnecting now At a rate of of around 4 or 5 per second per node, which is adding up to a lot of messages. This seems to happen while my cluster is idle.

[Gluster-users] trusted.ec.dirty attribute

2018-02-08 Thread Dmitri Chebotarov
Hi I've got a problem on a EC volume where heal doesn't seem to work for just few files (heal info shows no progress for few days, Warning on the client mount) I ran getfattr -m . -d -e hex / across all servers in the cluster and 'trusted.ec.dirty' attr is non-zero on all files which don't

Re: [Gluster-users] Thousands of EPOLLERR - disconnecting now

2018-02-08 Thread Vijay Bellur
On Thu, Feb 8, 2018 at 2:04 PM, Gino Lisignoli wrote: > Hello > > I have a large cluster in which every node is logging: > > I [socket.c:2474:socket_event_handler] 0-transport: EPOLLERR - > disconnecting now > > At a rate of of around 4 or 5 per second per node, which is

Re: [Gluster-users] trusted.ec.dirty attribute

2018-02-08 Thread Ashish Pandey
It is not at all good. It should be healed and dirty xattr should be set to all zero. Please provide all the xattrs of all the fragments of this file. Provide gluster v heal info Run gluster v heal and provide glustershd.logs Provide gluster v status --- Ashish - Original

[Gluster-users] How to fix an out-of-sync node?

2018-02-08 Thread Frizz
I have a setup with 3 nodes running GlusterFS. gluster volume create myBrick replica 3 node01:/mnt/data/myBrick node02:/mnt/data/myBrick node03:/mnt/data/myBrick Unfortunately node1 seemed to stop syncing with the other nodes, but this was undetected for weeks! When I noticed it, I did a

Re: [Gluster-users] Thousands of EPOLLERR - disconnecting now

2018-02-08 Thread Atin Mukherjee
Are you running gluster version <= 3.12? Did you happen to start seeing this flood after rebalance? I'm just trying to eliminate you're not hitting https://bugzilla.redhat.com/show_bug.cgi?id=1484885 . On Fri, Feb 9, 2018 at 4:45 AM, Vijay Bellur wrote: > > On Thu, Feb 8,

Re: [Gluster-users] self-heal trouble after changing arbiter brick

2018-02-08 Thread Karthik Subrahmanya
Hey, Did the heal completed and you still have some entries pending heal? If yes then can you provide the following informations to debug the issue. 1. Which version of gluster you are running 2. gluster volume heal info summary or gluster volume heal info 3. getfattr -d -e hex -m . output of

Re: [Gluster-users] self-heal trouble after changing arbiter brick

2018-02-08 Thread Karthik Subrahmanya
On Fri, Feb 9, 2018 at 11:46 AM, Karthik Subrahmanya wrote: > Hey, > > Did the heal completed and you still have some entries pending heal? > If yes then can you provide the following informations to debug the issue. > 1. Which version of gluster you are running > 2. Output

Re: [Gluster-users] How to fix an out-of-sync node?

2018-02-08 Thread Karthik Subrahmanya
Hi, >From the information you provided, I am guessing that you have a replica 3 volume configured. In that case you can run "gluster volume heal " which should do the trick for you. Regards, Karthik On Thu, Feb 8, 2018 at 6:16 AM, Frizz wrote: > I have a setup with