Hi Yury,

For the crash -> is there any core file to check ?

For mem usage -> you should try to get a memory dump for further investigation [1].

[1] https://opensips.org/Documentation/TroubleShooting-OutOfMem

Best regards,

Bogdan-Andrei Iancu

OpenSIPS Founder and Developer
  https://www.opensips-solutions.com
OpenSIPS Summit 27-30 Sept 2022, Athens
  https://www.opensips.org/events/Summit-2022Athens/

On 9/14/22 10:13 PM, Yury Kirsanov wrote:
Hi Bogdan,
Thanks a lot for your help and support! The only question I know have is why OpenSIPS was going into a crash if all TCP processes were blocked waiting for connection? It was starting to consume more and more memory and then it was crashing with a segfault upon reaching then -m memory parameter. I do understand that TCP listeners were in a blocking mode and were not able to do any work until the session could be fully established, not being able to forward any SIP packets, but isn't that a bug that OpenSIPS was starting to eat memory and then crash? Do I need to open a bug report on this? Thanks!

Best regards,
Yury.

On Wed, Sep 14, 2022 at 10:58 PM Bogdan-Andrei Iancu <bog...@opensips.org <mailto:bog...@opensips.org>> wrote:

    Hi Yury,

    You need to check the TCP setting and to be sure your OpenSIPS
    will (1) not try to perform TCP connect against destination known
    not to be able to accept (like TCP/WS end points behind NAT) - see
    the tcp_no_new_conn_bflag [1] - or (2) not block for long time
    while attempting a connect - see the tcp_connect_timeout [2] or
    consider enabling async [3].

    [1]
    
https://www.opensips.org/Documentation/Script-CoreParameters-3-2#tcp_no_new_conn_bflag
    
<https://www.opensips.org/Documentation/Script-CoreParameters-3-2#tcp_no_new_conn_bflag>
    [2]
    
https://www.opensips.org/Documentation/Script-CoreParameters-3-2#tcp_connect_timeout
    
<https://www.opensips.org/Documentation/Script-CoreParameters-3-2#tcp_connect_timeout>
    [3]
    https://opensips.org/html/docs/modules/3.2.x/proto_tcp.html#idp168992
    <https://opensips.org/html/docs/modules/3.2.x/proto_tcp.html#idp168992>

    Regards,

    Bogdan-Andrei Iancu

    OpenSIPS Founder and Developer
       https://www.opensips-solutions.com  <https://www.opensips-solutions.com>
    OpenSIPS Summit 27-30 Sept 2022, Athens
       https://www.opensips.org/events/Summit-2022Athens/  
<https://www.opensips.org/events/Summit-2022Athens/>

    On 9/13/22 12:01 PM, Yury Kirsanov wrote:
    Hi Bogdan,
    Thanks for this update, but it looks like I can't check
    autoscaler because of this first issue with blocking TCP connect.
    Is there a way to resolve it? Am I doing something wrong? Or is
    that something to do with OpenSIPS code? As yes, you're right, as
    soon as I restart OpenSIPS having a lot of SIP devices trying to
    connect to it - it goes crazy, starts to consume memory and stops
    to forward packets sitting there at 100% load until it runs out
    of memory and segfaults. Sometimes I can't even restart it to
    come to normal state to make it work, it just loops into same
    crash whatever I try to do.

    I've compiled OpenSIPS 3.3.1 with your patch and was able to
    start it but not sure, maybe I was just lucky this time.

    What should I do? Thanks!

    Best regards,
    Yury.

    On Tue, 13 Sept 2022, 18:56 Bogdan-Andrei Iancu,
    <bog...@opensips.org <mailto:bog...@opensips.org>> wrote:

        Hi Yury,

        it looks like you some multiple issues, overlapping here. The
        traps you sent here have nothing to do with the auto-scaling,
        but with a blocking TCP connect for SIP - most of the procs
        get blocked into a sync TCP connect.

        Regards,

        Bogdan-Andrei Iancu

        OpenSIPS Founder and Developer
           https://www.opensips-solutions.com  
<https://www.opensips-solutions.com>
        OpenSIPS Summit 27-30 Sept 2022, Athens
           https://www.opensips.org/events/Summit-2022Athens/  
<https://www.opensips.org/events/Summit-2022Athens/>

        On 9/12/22 4:39 PM, Yury Kirsanov wrote:
        Hi Bogdan,
        I've applied the patch (had to find where to apply it
        manually for 3.2.8 downloaded from Web page, line 1568
        instead of 1652) and restarted the server with only about
        300-350 SIP devices and immediately got into same issue. I'm
        attaching two GDB dumps made within several minutes from
        each other. Autoscale was now OFF, please see my previous
        message as currently for some reason I'm experiencing
        lockups even when it's off :(

        Best regards,
        Yury.

        On Mon, Sep 12, 2022 at 7:48 PM Bogdan-Andrei Iancu
        <bog...@opensips.org <mailto:bog...@opensips.org>> wrote:

            Hi Yuri,

            Could you give this patch a try? it should fix the
            blocking you experience (it should apply on 3.2 too).

            Best regards,

            Bogdan-Andrei Iancu

            OpenSIPS Founder and Developer
               https://www.opensips-solutions.com  
<https://www.opensips-solutions.com>
            OpenSIPS Summit 27-30 Sept 2022, Athens
               https://www.opensips.org/events/Summit-2022Athens/  
<https://www.opensips.org/events/Summit-2022Athens/>

            On 9/7/22 2:54 PM, Bogdan-Andrei Iancu wrote:
            Hi Yury,

            Thanks for the details info here - let me do a review
            of some code and run some tests, as at this point I
            have a good idea on the direction to dig into.

            I will update here.

            Best regards,
            Bogdan-Andrei Iancu

            OpenSIPS Founder and Developer
               https://www.opensips-solutions.com  
<https://www.opensips-solutions.com>
            OpenSIPS Summit 27-30 Sept 2022, Athens
               https://www.opensips.org/events/Summit-2022Athens/  
<https://www.opensips.org/events/Summit-2022Athens/>
            On 9/6/22 11:24 AM, Yury Kirsanov wrote:
            Hi Bogdan,
            Yes, I'm listening on all types of sockets including
            UDP, TCP and TLS on the outside public interface and
            then forward traffic into internal LAN via UDP only.

            Previously it was getting stuck quite easily, now I
            had to wait for a while before this actually happened.
            I've routed part of my customers to this server to
            obtain this result so I will have to do that again.

            As soon as I see one of the processes stuck I'll dot
            the trap command and send you all the details
            including processes load, ps output and so on.

            For now I had to switch autoscaling off and just
            create many listeners. Do I understand correctly that
            I need to restart OpenSIPS in order to apply
            autoscaling profiles and reload-routes is not sufficient?

            Also, do I need separate UDP profiles for public and
            private interfaces? And do I need to apply autoscaling
            profile just to a socket or I need to specify udp or
            tcp_workers with autoscaler too?

            Thanks and best regards,
            Yury.

            On Tue, 6 Sept 2022, 18:18 Bogdan-Andrei Iancu,
            <bog...@opensips.org <mailto:bog...@opensips.org>> wrote:

                Hi Yury,

                Thanks for the info. I see that the stuck process
                (24) is an auto-scalled one (based on its id). Do
                you have SIP traffic from UDP to TCP or doing some
                HEP capturing for SIP ? I saw a recent similar
                report where a UDP auto-scalled worked got stuck
                when trying to do some communication with the TCP
                main/manager process (in order to handle a TCP
                operation).

                BTW, any chance to do a "opensips-cli -x trap"
                when you have that stuck process, just to see
                where is it stuck? and is it hard to reproduce? as
                I may ask you to extract some information from the
                running process....

                Regards,

                Bogdan-Andrei Iancu

                OpenSIPS Founder and Developer
                   https://www.opensips-solutions.com  
<https://www.opensips-solutions.com>
                OpenSIPS Summit 27-30 Sept 2022, Athens
                   https://www.opensips.org/events/Summit-2022Athens/  
<https://www.opensips.org/events/Summit-2022Athens/>

                On 9/3/22 6:54 PM, Yury Kirsanov wrote:



            _______________________________________________
            Users mailing list
            Users@lists.opensips.org  <mailto:Users@lists.opensips.org>
            http://lists.opensips.org/cgi-bin/mailman/listinfo/users  
<http://lists.opensips.org/cgi-bin/mailman/listinfo/users>




_______________________________________________
Users mailing list
Users@lists.opensips.org
http://lists.opensips.org/cgi-bin/mailman/listinfo/users

Reply via email to