Re: Linux TCP in the presence of delays or drops...

Oumer Teyeb Mon, 31 Jul 2006 10:52:06 -0700

Hi,

it would be so great if some of you could spare a few minutes and take alook at the traces I provided.....see below for the original postng...Ijust had a couple of things to add which I noticed in linux TCPbehaviour which I have not seen documented anywhere else (or which Imight have misread..:-)...and below I have given yet another trace thatillustrates one of the TCP linux behaviour which I am having troubleunderstanding....

-If multiple timeouts occur for one packet then even if we are using thetimestamp option or FRTO TCP linux is not able to detect spuriousretransmissions... and TCP linux is able to detect spuriousretransmissions only for a single timeout for one packet or fastretransmissions that are caused by duplicate ACK reception.....I havesome traces that show this behaviour, let me know if you are interested.

-In the cases where TCP timestamp or FRTO is not able to detect spuriousretransmissions, the performance degrades even more than when TCPtimestamp or FRTO option are not used....

I also have one additional trace that shows the problem with the case ofan explained pause in the tcp sender during retransmission which I foundreally hard to explain.... it is similar to the case 1) but this time Iam doing an upgrade instead from a 384kbps connection to 1Mbpsconnection.... the traces and tcptrace time sequence curve can be foundat...

http://kom.aau.dk/~oumer/drop_0_delay_UPGRADE_SERVER.dat
http://kom.aau.dk/~oumer/drop_0_delay_UPGRADE_CLIENT.dat
and the tcptrace time sequence curve can be found in
http://kom.aau.dk/~oumer/drop_0_delay_UPGRADE.ps

as you can see from the server side trace... (all the packets shown hereare retransmissions because I flushed the sender's buffer at timeinstant 17:26:24.657)

17:26:26.261972  2267693336:2267694796(1460) ack 3498775069 win 5840 (DF)
17:26:26.319180  . ack 2267694796 win 61320 (DF) [tos 0x8]
17:26:26.321961  2267694796:2267696256(1460) ack 3498775069 win 5840 (DF)
17:26:26.379160  . ack 2267696256 win 61320 (DF) [tos 0x8]
17:26:26.381940 . 2267696256:2267697716(1460) ack 3498775069 win 5840 (DF)
17:26:26.439138  . ack 2267697716 win 61320 (DF) [tos 0x8]
17:26:26.441925  2267697716:2267699176(1460) ack 3498775069 win 5840 (DF)
17:26:26.499144   ack 2267699176 win 61320 (DF) [tos 0x8]
17:26:28.234327  2267699176:2267700636(1460) ack 3498775069 win 5840 (DF)

eventhough the server got an ACK with # ack 2267699176 at timeinstant17:26:26.49...it waited till 17:26:28.234 to resend the packet... whichis around1.73 seconds... I have checked with other traces where I introduceddelay and for the link the first timeout occurs after 1.73 second, whichseems to be the RTO at that time, and for no apparent reasonTCP is wating for a timeout... case 1 is quite similar but there theretransmissions were triggered by timeout to begin with, here theretransmissions are triggered by duplicate ACKs...in the case1 describedbelow this abnormal behaviour occured after only a couple of packetswere retransmitted...here it took quite some retransmissions before thesame problem happend... any insight into this is greatly appreciated!!


Thanks in advance,
Oumer

Oumer Teyeb wrote:

Hi all,
I have some questions regarding Linux TCP in the presence of delays orpacket drops. It is somehow long mail, but the questions are two orthree, just wanted to provide a detailed information so that theproblem is clear. thanx for the patience!!
Best regards,
Oumer
Note that for the traces referred here, SACK,timestamps, and FRTO areall disabled...
1) packet drops
================
I have a trace where the tcp sender window is flushed and then theconnection speed is changed from 1Mbps to 384kbps...
The trace files from both the client and the server side can be found at
http://kom.aau.dk/~oumer/drop_0_delay_SERVER.dat
http://kom.aau.dk/~oumer/drop_0_delay_CLIENT.dat
and the tcptrace time sequence curve can be found in
http://kom.aau.dk/~oumer/drop_0_delay.ps
as can be seen from the plot and the trace files at around17:19:35.705733, the window was flushed (both the sender's andreceivers), and hence packets with seq numbers from1840001135 upto 1840058075 were dropped (39 packets)...and also theACK for 1840001135 was also dropped (from the traces this can be seenas it appears
in the client trace but not on the server trace)...
and since there were still packets to be sent the sender keeps sendinga few more packets
and when  few of them are received (from the client side trace..)
17:19:35.938017 1840059535:1840060995(1460) ack 3059152863 win 5840(DF)...17:19:35.938028 ack 1840001135 win 62780 (DF) [tos 0x8]...first ACKthat is going to be received by the sender
17:19:35.969316  1840060995:1840062455(1460) ack 3059152863 win 5840 (DF)
17:19:35.969325 1840001135 win 62780 (DF) [tos 0x8]....firstduplicate ACK
17:19:36.000519  1840062455:1840063915(1460) ack 3059152863 win 5840 (DF)
17:19:36.000528 ack 1840001135 win 62780 (DF) [tos 0x8]... secondduplicate ACK
when the server gets this 2nd duplicate ACK, it retransmits thepackets (this is clearly visible from the tcptrace curve.)..eventhougha 3rd duplicate ACK soon follows.so my first question "why is the second duplicate ACK triggering aretransmission?"...
also after that, there are a couple of retransmissions triggerd by thereception of the ACK for the new ACKs and at time instant (serverside trace)17:19:36.057149 . 1840001135:1840002595(1460) ack 3059152863 win 5840(DF)..first packet retransmitted17:19:36.085569 ack 1840001135 win 62780 (DF) [tos 0x8] ...this isthe third duplicate ACK which should have caused the retrans, but letsignore it for now
17:19:36.248599  ack 1840002595 ...retransmitted packet acked
17:19:36.251382 1840002595:1840004055(1460) ack 3059152863 win 5840(DF) ... next packet retransmitted17:19:36.442831 ack 1840004055 win 61320 (DF) [tos 0x8]...2nd packetacked also17:19:36.445625 1840004055:1840005515(1460) ack 3059152863 win 5840(DF) .. third packet retransmitted17:19:36.637224 ack 1840005515 win 61320 (DF) [tos 0x8] ... thirdpacket acked17:19:37.417022 1840005515:1840006975(1460) ack 3059152863 win 5840(DF) ... fourth packet retransmitted
As you can see there is 0.8 second gap between the ack for thereception of the ACK for the third packet and the sending of thefourth packet...so my second question "why didnt the sender immediatly
send the fourth packet after the reception of the ack for the third?"
I generated the same scenario 20 times, and the same thing happens inall of them...
2)packet delays
===============
in the second scenario, I have a 2 second delay, but no packetdrops...the downgrade in bandwidth also happens, but the packets inthe window are buffered for 2 seconds and released...
The trace files from both the client and the server side can be foundat....
http://kom.aau.dk/~oumer/delay_0_drop_SERVER.dat
http://kom.aau.dk/~oumer/delay_0_drop_CLIENT.dat
and the tcptrace time sequence curve can be found in
http://kom.aau.dk/~oumer/delay_0_drop.ps

The delay is applied from 17:20:01.066725 to 17:20:03.067022
as can be seen from the traces and plot packets with seq number1858561966 to 1858618906 ( a total of 40 packets) were queued at theserver and one packet from the receiver, which is an ACK for
pkt # 1858560506  ....
at around 17:20:03.15 this ack is received and sender thinks this isthe result of its retransmission (which actually was dropped, so atthis point the receiver hasnot got any retransmissions).. and thenormal retransmission is resumed (as well as sending of some new data,as the window allows it) as can be seen from the server side traceupto time instant 17:20:04.539682...at which point we can see that on the client trace theretransmissions actually start arriving at the receiver (so far theACKs that were triggering the retransmissions were acks to thereception of the originalbut delayed packets)...and this duplicate arrivals lead to multipleduplicate ACKs... what I dont understand is why this duplicate ACKs(there are 40 duplicate ACKs.), no fast retransmission was triggered..so my third question "Why is it that the duplicate ACKs are nottiggerring fast retransmissions?" this creates a 1.3 second gaptransmission gap...actually this is better than fast retransmissionbecause it is not leading to further retransmissions...so is the linuxTCP so clever that it can figure out the problem without using SACK,timestamps or FRTO ? ...or is this a special "feature" :-)....
I have repeated this also twenty times and the traces are similar...








-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html



-
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Linux TCP in the presence of delays or drops...

Reply via email to