[SR-Users] Solutions to missing BYEs, accounting for them

Alex Balashov Wed, 21 Apr 2010 02:21:22 -0700

Hi all,

Please forgive the slightly long post, but if you have anything tocontribute on this topic, please consider giving it a read as I couldreally use your input. :-)

As I'm sure many others of you running proxy-based service deliveryplatforms of some description also, I am faced with the problem oftrying to account for calls with missing BYEs in a realistic way.There is no shortage of mailing list posts over the years on thistopic. Inevitably, in a platform with sufficient call volume, withsome NAT'd and/or endpoint diversity and other technical causes, therewill be some calls that are never officially terminated from the pointof view of a proxy.

The ability of the 'dialog' module to spoof bidirectional BYEs ontimeout[1] goes a long way toward addressing this problemtheoretically. However, there are practical obstacles to relying onit solely as a solution, mainly because there is not an acceptabletimeout value to use as a trade-off. If the timeout period is set toa very low value, users will obviously complain, and in any case,depending on the destination, the worst-case scenario for maximum callbilling may still be far too high. If the timeout period is sethigh--perhaps something like 5-8 hours--then all calls that fail toend in the normal way will be billed some excessively large amountthat certainly will not sit well with users either.

If either the core delivery element of the platform or the user agentis tightly controlled by the operator of the proxy from anadministrative point of view, it is indeed probably possible to relyon RTP timeouts or SIP Session Timers (SSTs) on one of the endpoints.

That doesn't create a satisfying resolution for those of us dealingwith indeterminate call completion scenarios with a great deal of userand vendor diversity, though. For instance, I route to about 15 ITSPsand carriers; I think maybe one of them does 15-minute SSTs, and therest are certainly not going to turn them on just for me, even iftheir SBCs/switches/things have the capability. The user endpointsare mostly Asterisk and do RTP timeout, of course, and in most cases Ido get the resulting BYE. However, this discussion is about theminute but nontrivial percentage of cases in which I do not get theBYE, whether because of NAT statekeeping problems or networkreachability or whatever underlying causes--in truth, I cannotaccurately characterise these.

So, it seems to me that from a theoretical point of view, there arebasically two directions someone in this position can go from here:


1) Inline B2BUA in the signaling path of all calls;

1a) Make it do SSTs; or

1b) Make it relay media, too, and hang up the call (bidirectional BYE)on RTP receive timeout;

2) Couple the proxy to an RTP relay and provide some mechanism bywhich the proxy can be made aware, in an asynchronous fashion, that anRTP timeout was detected by the relay.

It seems to me from a brief and informal survey of prior mailing listliterature that #1 is the usually recommended option here.

If #1 is pursued, what is the best tool to use in theKamailio/SIP-Router-oriented ecosystem? My default instinct would saySEMS; I really like SEMS, and use it a lot for various related chores.

The problem is that the pre-built modules and examples for SEMS mostlycenter on application-level functionality, while low-leveldocumentation of its powerful C++ API is a bit impoverished, so thiswould take a lot of work.

Needless to say, I am interested in the option that requires the leastwork but still solves the problem in an elegant way from a technicaland--dare I say--aesthetic perspective.

For instance, it seems clear from looking at the SEMS-1.1.1 sourcesthat SSTs are supported in principle in core/plug-in/session_timer.But unless I am missing something, I cannot find anywhere in thesources or examples where it is actually used.

So, I suppose one option is to figure out how to make this stuff workin SEMS, and make it work. But for some reason who is not attune tothe universe of its C++ API, it is a rather formidable chore. I thinkthe same would hold true of making it observe bidirectional RTP timeout.

Turning attention to option #2, I have looked at rtpproxy (mypreferred default), iptrtpproxy, and mediaproxy modules but have notfound any evidence that the control protocols Kamailio/SR uses toengage them support any notion of backward asynchronous feedback incase of RTP timeout.

It would be really nice if one of these stream control protocols wasaugmented to kick back a packet to Kamailio that can be caught in aspecial event_route, like event_route[nathelper:rtp-stream-timeout],but that is clearly not the case today.

To be honest, I would not use MediaProxy even if it had this feature,because, well, let's be bluntly honest and acknowledge what the morepolitically aware presumably already conjecture: in light of AGProjects' zealous OpenSIPS partnership, it's difficult to musterconfidence in future compatibility of MediaProxy with Kamailio. Themodule is there, it works, and I'm sure its maintainers are dedicatedto doing whatever it takes to reverse engineer and keep it working,lift patches from OpenSIPS as necessary, etc., but who wants to be onthe wrong side of the project ecosystem fence? Not I.

That leaves iptrtpproxy, whose 'switchboard' concept I do not fullycomprehend due to lack of experience with it, but which holds apotentially viable, if slightly kludgy/Rube Goldbergian answer. Ofthe three RTP proxies, it is the only one that provides a ready meansof exporting a list of media streams it is currently tracking,together with statistics on how many packets have been received, etc.It is not inconceivable to cook up an external process that willfrequently check this 'switchboard', as it were, and inciteKamailio/SR to do dlg_bye() via MI if it appears that the media streamhas disappeared from either side; the dialog module helpfully exportsthe MI command dlg_end_dlg.

Still, this does not seem nearly as parsimonious and reliable asolution as simply building some kind of RTP stream leg timeoutnotification into the control socket. After all, the control socketis open persistently, right, not on-demand? The various RTP proxiesall seem to have some kind of dead peer detection internally in orderto have some means of gracefully expiring resources allocated to mediastreams that have gone away, so it would just be a matter of passing acontrol frame up the socket to Kamailio/SR and wiring that to a customevent_route or a more static callback in the code.

By the way, I should mention that I am aware of and historically verysympathetic to the perspective that this kind of call control is aliento the nature of a proxy, and an appropriate job for UAs and notproxies at all. However, we all have to make pragmatic concessions tothe realities of real-world operation, which I assume is themotivation for dialog timeouts, dlg_bye(), and other perversions fromthe point of view of a purist. :-)

I welcome your thoughts and suggestions about the easiest and mosttechnically meritorious approach.


Thanks,

-- Alex

[1] Enabled via $dlg_ctx(timeout_bye) = 1

--
Alex Balashov - Principal
Evariste Systems LLC
1170 Peachtree Street
12th Floor, Suite 1200
Atlanta, GA 30309
Tel: +1-678-954-0670
Fax: +1-404-961-1892
Web: http://www.evaristesys.com/

_______________________________________________
SIP Express Router (SER) and Kamailio (OpenSER) - sr-users mailing list
sr-users@lists.sip-router.org
http://lists.sip-router.org/cgi-bin/mailman/listinfo/sr-users

[SR-Users] Solutions to missing BYEs, accounting for them

Reply via email to