Re: Weird issues with UNIX-Sockets on 2.1.x

Christian Ruppert Mon, 30 Mar 2020 05:38:09 -0700

On 2020-03-27 16:58, Christian Ruppert wrote:

On 2020-03-27 16:49, Olivier Houchard wrote:

On Fri, Mar 27, 2020 at 04:32:21PM +0100, Christian Ruppert wrote:

On 2020-03-27 16:27, Olivier Houchard wrote:
> On Fri, Mar 27, 2020 at 04:21:20PM +0100, Christian Ruppert wrote:
>> During the reload I just found something in the daemon log:
>> Mar 27 13:37:54 somelb haproxy[20799]: [ALERT] 086/133748 (20799) :
>> Starting proxy someotherlistener: cannot bind socket [0.0.0.0:18540]
>> Mar 27 13:37:54 somelb haproxy[20799]: [ALERT] 086/133748 (20799) :
>> Starting proxy someotherlistener: cannot bind socket [:::18540]
>>
>> So during the reload, this happened and seems to have caused any
>> further
>> issues/trouble.
>>
>
> That would make sense. Does that mean you have old processes hanging
> around ? Do you use seemless reload ? If so, it shouldn't attempt to
> bind the socket, but get them from the old process.

I remember that it was necessary to have a systemd wrapper around, asit

caused trouble otherwise, due to PID being changed etc.
Not sure if that wrapper is still in use. In this case it's systemd
though.
[Unit]
Description=HAProxy Load Balancer
After=network.target

[Service]

Environment="CONFIG=/etc/haproxy/haproxy.cfg""PIDFILE=/run/haproxy.pid"

ExecStartPre=/usr/sbin/haproxy -f $CONFIG -c -q
ExecStart=/usr/sbin/haproxy -Ws -f $CONFIG -p $PIDFILE
ExecReload=/usr/sbin/haproxy -f $CONFIG -c -q
ExecReload=/bin/kill -USR2 $MAINPID
KillMode=mixed
Restart=always
SuccessExitStatus=143
TimeoutStopSec=30
Type=notify


[...]

We've added the TimeoutStopSec=30 for some reason (I'd have to ask my
college, something took longer or something like that, since we have
quite a lot of frontends/listener/backend)

Only the two processes I mentioned before are / were running. Seemslike

the fallback didn't work properly?


The wrapper is no longer needed, it has been superceeded by the
master-worker (which you seem to be using, given you're using -Ws).

It is possible the old process refuse to die, and you end up hittingthe

timeout and it gets killed eventually, but it's too late.
Do you have a expose-fd listeners on the unix stats socket ? Using it

will allow the new process to connect to the old process' statssocket,

and get all the listening sockets, so that it won't have to bind them.


Oh, that sounds quite handy. I wasn't aware of it. I'll add it
soonish. Thanks for the hint!


https://www.haproxy.com/de/blog/hitless-reloads-with-haproxy-howto/

"Please note that this step does not need to be performed if yourHAProxy configuration already contains the directive “master-worker”, orif it is started with the option -W."


I have steps to reproduce it:
A C sample to bind the socket (nc doesn't work for some reason):
#include <stdio.h>
#include <sys/socket.h>
#include <arpa/inet.h>
#include <unistd.h>

int main() {
        int sock;
        struct sockaddr_in server;

        sock = socket(AF_INET , SOCK_STREAM , 0);
        if (sock == -1) {
                printf("Failed to create socket!\n");
        }

        server.sin_family = AF_INET;
        server.sin_addr.s_addr = INADDR_ANY;
        server.sin_port = htons(1338);

        if( bind(sock,(struct sockaddr *) &server , sizeof(server)) == -1) {
                printf("Failed to bind socket!\n");
        }

        while(1) {
                sleep(1);
        }

        return 0;
}

gcc socket.c -o socket
./socket

Having a initial HAProxy config:
global
        user haproxy
        group haproxy

        log-send-hostname

        log 127.0.0.1 len 65535 local0

stats socket unix@/run/haproxy.stat user haproxy gid haproxy mode600 level admin



frontend unixsocket_reload
        bind 127.0.0.1:1337
        bind unix@/run/haproxy-sockettest.sock user haproxy group root mode 600
        mode http
        log global


And starting it, with sytemd, ending up in:
/usr/sbin/haproxy -Ws -f /etc/haproxy/haproxy.cfg -p /run/haproxy.pid

Testing:
curl --unix-socket /run/haproxy-sockettest.sock http://127.0.0.1 -vs
echo help | socat unix-connect:/run/haproxy.stat stdio

Adding a second frontend to the haproxy.cfg:
frontend unixsocket_reload2
        bind 127.0.0.1:1338

bind unix@/run/haproxy-sockettest-2.sock user haproxy group root mode600

        mode http
        log global

systemctl reload haproxy

curl and socat doesn't work anymore while the TCP socket still works.

Now restarting HAProxy with the initial config but with the adjustedstats socket:stats socket unix@/run/haproxy.stat user haproxy gid haproxy mode 600level admin expose-fd listeners

Note that the -x will be appended automatically (at least for systemd-Ws)

And doing the same again. curl and socat still works. The new frontenddoes not even though the UNIX socket it created.I think the way that works is ok for me then. Thanks for pointing outthe expose-fd listeners!

Regards,

Olivier


--
Regards,
Christian Ruppert

Re: Weird issues with UNIX-Sockets on 2.1.x

Reply via email to