Bugs item #3552688, was opened at 2012-07-31 09:28
Message generated for change (Comment added) made by dragosoancea
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=1086410&aid=3552688&group_id=232389

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: core
Group: 1.7.x
Status: Open
Resolution: Accepted
Priority: 5
Private: No
Submitted By: Dragos Oancea (dragosoancea)
Assigned to: Bogdan-Andrei Iancu (bogdan_iancu)
Summary: opensips crash with TLS 

Initial Comment:

Hi all,

opensips 1.7.2 crashes when using  TLS with create_dialog("Pp")  in the routing 
script -  that would send OPTIONS (nat ping) both to caller and callee during a 
dialog . 


The TLS-related relevant lines in the routing script are: 

tls_verify_server = 1
tls_verify_client = 0
tls_require_client_certificate = 0
tls_method = TLSv1
tls_certificate = "/etc/pki/CA/certs/x.crt"
tls_private_key = "/etc/pki/CA/private/x.key"
tls_ca_list = "/etc/pki/CA/certs/ca.crt"


listen = tls:X.X.X.X:5061
listen = tcp:X.X.X.X:5060



syslog:
http://pastebin.com/Kkdns7Cr

backtrace:

http://pastebin.com/7P4ADL9y


Apparently it crashes just after trying to send an OPTIONS or BYE to a device 
that is not there anymore (it's not on the socket opensips expects it to be - 
opensips
usually generates a "477 SendFailed" reply in situations like this) .


and interesting enough, if I add an udp port to listen to with 
"listen=udp:X.X.X.X:5060" does not crash anymore.


Regards,
Dragos


----------------------------------------------------------------------

>Comment By: Dragos Oancea (dragosoancea)
Date: 2012-10-30 04:24

Message:
Hi Bogdan,

I am running 1.8.1 with the tls_init.c patch  and the dlg_unref.patch (
bug-id 3570495 ) .
I had no crash anymore.
However, I cannot be 100% sure that the bug is not  there anymore because I
had to make some changes in the routing script.
When it was crashing (and not patched) I had :
modparam("dialog", "ping_interval", 20)
modparam("tm", "fr_timer", 15) 
Few days after I patched it had to change these values to (because opensips
was dropping calls - the reply to OPTIONS from the clients was not coming
due to network issues ): 
modparam("dialog", "ping_interval", 40)
modparam("tm", "fr_timer", 30)
So I ran about  3 days with the old config and the patched version and then
I changed these params , and it is running like this since then (about a
month I think) . 

#opensips -V
version: opensips 1.8.1-tls (x86_64/linux)
flags: STATS: Off, USE_IPV6, USE_TCP, USE_TLS, DISABLE_NAGLE, USE_MCAST,
SHM_MEM, SHM_MMAP, PKG_MALLOC, F_MALLOC, FAST_LOCK-ADAPTIVE_WAIT
ADAPTIVE_WAIT_LOOPS=1024, MAX_RECV_BUFFER_SIZE 262144, MAX_LISTEN 16,
MAX_URI_SIZE 1024, BUF_SIZE 65535
poll method support: poll, epoll_lt, epoll_et, sigio_rt, select.
svnrevision: unknown
@(#) $Id: main.c 8772 2012-03-08 11:16:13Z bogdan_iancu $
main.c compiled on 11:49:51 Oct 15 2012 with gcc 4.4.6


Regards,
Dragos

----------------------------------------------------------------------

Comment By: Bogdan-Andrei Iancu (bogdan_iancu)
Date: 2012-10-30 01:54

Message:
Hi Dragos,

Is this still happening with the latest code from 1.8 SVN branch ?

Regards,
Bogdan

----------------------------------------------------------------------

Comment By: Dragos Oancea (dragosoancea)
Date: 2012-08-29 06:25

Message:
Hi

It happened again. This time I had only two phones registered via TLS and I
was just making a call between them.

gdb:
http://pastebin.com/1miGb7ct

log (stderr):
http://pastebin.com/bvDjLzPh

can someone confirm it is related to 3522861 ? 

Cheers,
Dragos 

----------------------------------------------------------------------

Comment By: Dragos Oancea (dragosoancea)
Date: 2012-08-16 04:00

Message:
Hi

I think there is only one problem.
so because the OPTIONS is sent to the wrong interface , no reply will come
back , and opensips will generate and send a BYE bothways. but in my case
everything is running fine..with OPTIONS being sent to the right places at
first , then something happends (memory corruption) , and maybe the
function that does the pinging is first to access some unallocated memory.
The crash could also happen when the callee or the caller sends  BYE.

Some extra informations: 

There are mobile devices under NAT running on TCP or  TLS involved in this
whole scenario. So when the mobile device is not there anymore (it ran out
of battery durring a call for example) , the crash is most likely to
happen.
Also , I noticed that there is no problem if I only listen to tls (not
listening on tcp, not listening on udp).
But I need tcp , so I cannot disable it.

another gdb backtrace :

http://pastebin.com/aXgABJtE

I hope to replicate this in a controlled environment with memlog/memdump
soon and let you.


----------------------------------------------------------------------

Comment By: Bogdan-Andrei Iancu (bogdan_iancu)
Date: 2012-08-15 06:16

Message:
Probably related to 3522861

----------------------------------------------------------------------

Comment By: Bogdan-Andrei Iancu (bogdan_iancu)
Date: 2012-08-08 08:55

Message:
Hi,

I see here 2 issues - one is the crash itself (which seems to be a memory
corruption) ; second one is related to pinging, which seems not to choose
the right interface (selects a UDP one instead TLS).

I suggest first trying to identify the mem issue, and for this you need to
recompile with memory debugging support
(http://www.opensips.org/Resources/DocsTsMem , set memlog=6, memdump=1) .
most probably the interface issues  triggers some bogus mem ops..

Regards,
Bogdan

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=1086410&aid=3552688&group_id=232389

_______________________________________________
Devel mailing list
Devel@lists.opensips.org
http://lists.opensips.org/cgi-bin/mailman/listinfo/devel

Reply via email to