[squid-users] High memory usage associated with ssl_bump and broken clients

Steve Hill Fri, 08 Sep 2017 09:38:37 -0700

I've identified a problem with Squid 3.5.26 using a lot of memory whensome broken clients are on the network. Strictly speaking this isn'treally Squid's fault, but it is a denial of service mechanism so Iwonder if Squid can help mitigate it.


The situation is this:

Squid is set up as a transparent proxy performing SSL bumping.

A client makes an HTTPS connection, which Squid intercepts. The clientsends a TLS client handshake and squid responds with a handshake and thebumped certificate. The client doesn't like the bumped certificate, butrather than cleanly aborting the TLS session and then sending a TCP FIN,it just tears down the connection with a TCP RST packet.

Ordinarily, Squid's side of the connection would be torn down inresponse to the RST, so there would be no problem. But unfortunately,under high network loads the RST packet sometimes gets dropped and asfar as Squid is concerned the connection never gets closed.

The busted clients I'm seeing the most problems with retry theconnection immediately rather than waiting for a retry timer.



Problems:

1. A connection that hasn't completed the TLS handshake doesn't appearto ever time out (in this case, the server handshake and certificateexchange has been completed, but the key exchange never starts).

2. If the client sends an RST and the RST is lost, the client won't sendanother RST until Squid sends some data to it on the aborted connection.In this case, Squid is waiting for data from the client, which willnever come, and will not send any new data to the client. Squid willnever know that the client aborted the connection.

3. There is a lot of memory associated with each connection - my testssuggest around 1MB. In normal operation these kinds of dead connectionscan gradually stack up, leading to a slow but significant memory "leak";when a really badly behaved client is on the network it can open tens ofthousands of connections per minute and the memory consumption bringsdown the server.

4. We can expect similar problems with devices on flakey networkconnections, even when the clients are well behaved.



My thoughts:

Connections should have a reasonably short timeout during the TLShandshake - if a client hasn't completed the handshake and made an HTTPrequest over the encrypted connection within a few seconds, something isbroken and Squid should tear down the connection. These connectionscertainly shouldn't be able to persist forever with neither side sendingany data.



Testing:

I wrote a Python script that makes 1000 concurrent connections asquickly as it can and send a TLS client handshake over them. Once allof the connections are open, it then waits for responses from Squid(which would contain the server handshake and certificate) and quits,tearing down all of the the connections with an RST.

It seems that the RST packets for around 300 of those connections weredropped - this sounds surprising, but since all 1000 connections wereaborted simultaneously, there would be a flood of RST packets and itsprobably reasonable to expect a significant number to be dropped. Theend result was that netstat showed Squid still had about 300 establishedconnections, which would never go away.


--
 - Steve Hill
   Technical Director
   Opendium    Online Safety / Web Filtering    http://www.opendium.com

   Enquiries                 Support
   ---------                 -------
   sa...@opendium.com        supp...@opendium.com
   +44-1792-824568           +44-1792-825748
_______________________________________________
squid-users mailing list
squid-users@lists.squid-cache.org
http://lists.squid-cache.org/listinfo/squid-users

[squid-users] High memory usage associated with ssl_bump and broken clients

Reply via email to