Hello fellow drbd users
First post on the mailing list - thanks to the drbd devs and
contributors for this great piece of software. I've been using it for
years, it's been working flawlessly since then.
I have a production two-node pacemaker cluster (centos6) hosting a bunch
of virtual machines and I'm doing some tests for a planned upgrade to
centos7. With the elrepo rpm as well as the latest drbd-utils git, I get
the following syslog error message after "drbdadm up someresourcename":
NAME="$env{DEVICE}" ignored, kernel device nodes can not be renamed;
please fix it in /usr/lib/udev/rules.d/65-drbd.rules:8
Which effectively prevents /dev/drbd_blah from being created (eg,
"/dev/drbd_vm-mail-data"). This is easily fixed by symlinking:
-NAME=="", ENV{DEVICE}!="", NAME="$env{DEVICE}"
+ENV{DEVICE}!="", SYMLINK+="$env{DEVICE}"
(side question: I read that the by-res/ naming is 8.x legacy. Will the
9x series drop that and use exclusively /dev/drbd[0-9]+ and
/dev/drbd_{resourcename} devices ?)
Which leads me to a related issue with pacemaker: if I use
/dev/drbd_blah when defining resources without fixing the udv rules, the
drbd ocf resource file loops forever in my_udevsettle(). That's normal
since /dev/drbd_blah is not created, but there's no error message in
pacemaker log files so it took me a while to figure what the problem was
until I saw that the drbd ocf was stuck with sleep 1 processes (note to
others: look at the output of ps axf and any processes stuck under lrmd).
Isn't it possible to have more meaningful error reporting (or a
documented error code, but I don't see any in the OCF_ ones) ? Is it OK
to set a timeout in the ocf filer rather than relying on pacemaker's
defined timeout ? It would save some a lot of debugging time for the
8x->9x transition.
cheers
ivan
_______________________________________________
drbd-user mailing list
drbd-user@lists.linbit.com
http://lists.linbit.com/mailman/listinfo/drbd-user