There is not easy.

# lsof | grep drbd
drbd1_wor 3414 root cwd DIR 253,0 4096 2 / drbd1_wor 3414 root rtd DIR 253,0 4096 2 / drbd1_wor 3414 root txt unknown /proc/3414/exe

# ps aux  |grep 3414
root 3414 0.0 0.0 0 0 ? S Dec02 0:00 [drbd1_worker]
root      4690  0.0  0.0  61232   744 pts/1    R+   14:00   0:00 grep 3414
# lsof -p 3414
COMMAND    PID USER   FD      TYPE DEVICE SIZE NODE NAME
drbd1_wor 3414 root  cwd       DIR  253,0 4096    2 /
drbd1_wor 3414 root  rtd       DIR  253,0 4096    2 /
drbd1_wor 3414 root  txt   unknown                  /proc/3414/exe

kill -9 3414 doesn't do anything. I even tried to restart two nodes simultaneously - no luck.

Ivan.

On 12/05/2011 01:50 PM, Digimer wrote:
On 12/04/2011 09:25 PM, Ivan Pavlenko wrote:
Hi ALL,

Digimer, thank you again for your answer I'm really appreciate it!
Unfortunately, I've tried to fixes split brain manually several times.
It doesn't work.

# drbdadm disconnect r0
[root@infplsm017 ~]# drbdadm secondary r0
1: State change failed: (-12) Device is held open by someone
Command 'drbdsetup 1 secondary' terminated with exit code 11
# drbdadm -- --discard-my-data connect r0
1: Failure: (123) --discard-my-data not allowed when primary.
Command 'drbdsetup 1 net 10.10.24.10:7789 10.10.24.11:7789 C
--set-defaults --create-device --ping-timeout=20
--after-sb-2pri=disconnect --after-sb-1pri=discard-secondary
--after-sb-0pri=discard-zero-changes --allow-two-primaries
--discard-my-data' terminated with exit code 10
#

I guess I need to stop cluster daemons, don't I?

Thank you again,
Ivan
Something is, as the error indicates, still trying to use the DRBD
resource. Find it, stop it, and then you can demote the resource. Look
at the 'lsof' command, that will probably help you find the program
still using it.

_______________________________________________
drbd-user mailing list
drbd-user@lists.linbit.com
http://lists.linbit.com/mailman/listinfo/drbd-user

Reply via email to