Yes, it's an application that we made to sync up data between to application. I do use the heartbeat API to sync up data. I have approximately 1 MB of data to sync up and I split the data into +- 30 packets of 32K.
See below the statistics, I don't see any errors or packet drops. Here is one of your statements: So, it may be that your application is pushing the communication layer's limits. Or the network limits. Hard to say without looking at the network. What do you mean about "communication layer's limits"? Network limits? Is sending 1Mbytes of data is big? This would be very surprising. Architecture is very simple. 2 nodes connected to a switch (ZNYX ZX5000). eth3 Link encap:Ethernet HWaddr 00:A0:A5:5E:08:F8 inet addr:192.168.0.149 Bcast:192.168.0.255 Mask:255.255.255.0 inet6 addr: fe80::2a0:a5ff:fe5e:8f8/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 RX packets:184419 errors:0 dropped:0 overruns:0 frame:0 TX packets:614979 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:43421083 (41.4 MiB) TX bytes:767939664 (732.3 MiB) Base address:0x5c00 Memory:f61e0000-f6200000 eth3:0 Link encap:Ethernet HWaddr 00:A0:A5:5E:08:F8 inet addr:192.168.0.100 Bcast:192.168.0.255 Mask:255.255.255.0 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 Base address:0x5c00 Memory:f61e0000-f6200000 lo Link encap:Local Loopback inet addr:127.0.0.1 Mask:255.0.0.0 inet6 addr: ::1/128 Scope:Host UP LOOPBACK RUNNING MTU:16436 Metric:1 RX packets:26 errors:0 dropped:0 overruns:0 frame:0 TX packets:26 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:0 RX bytes:1820 (1.7 KiB) TX bytes:1820 (1.7 KiB) Anymore details will be greatly appreciated! Jean-Michel Audet Concepteur Logiciel Software Designer Kontron Canada Tel: (450) 979-5320 1 800 354-4223 Fax: (450) 437-8053 <mailto:[EMAIL PROTECTED]> Sales:<http://www.kontron.com/> Support:<ftp://ftp.kontron.ca/Support/> L'information contenue dans le présent document est la propriété de Kontron Canada Inc. et est divulguée en toute confidentialité. Cette information ne doit pas être révisée, utilisée, divulguée ou distribuée sans le consentement écrit explicite de Kontron Canada Inc. Si vous n'êtes pas le destinataire prévu et avez reçu cette communication par erreur, veuillez contacter l'originateur et supprimer toute copie. The information contained in this document is confidential and property of Kontron Canada Inc. Any unauthorized review, use, disclosure or distribution is prohibited without express written consent of Kontron Canada Inc. If you are not the intended recipient, please contact the sender and destroy all copies of the original message and enclosed attachments. -----Message d'origine----- De : [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] De la part de Dejan Muhamedagic Envoyé : Wednesday, May 14, 2008 11:16 AM À : General Linux-HA mailing list Objet : Re: [Linux-HA] Lots of Retransmitting pkt Hi, On Wed, May 14, 2008 at 08:58:13AM -0400, Audet, Jean-Michel wrote: > Hi all, > > > > Hi, I have an implementation using heartbeat 2.1.2. It is based on > Master/Slave architecture. I have only two nodes on my implementation. > > > > > I have a lot of "Retransmitting pkt". At the end, all the packets are > received, but I think it is very slow. My packets are all around > 32Kbytes in size but on a 100MBits network not a lot of traffic, it > should not be any problems. I am using the function sendnodemsg to > directly send message to my slave node. You have a home-made application using the heartbeat API? > Here is a trace of one of the message. When sending for example 40 > packets, 10 to 20 will be a least retransmit once. > > May 13 14:49:43 node1 heartbeat: [29379]: info: Retransmitting pkt 92 > > May 13 14:49:43 node1 heartbeat: [29379]: info: msg size =32581, > type=msg_type_xyz > > > > Any idea on what can be the problem? Need more informations? Heartbeat uses UDP (unicast, broadcast, or multicast). Is you network healthy? Did you check the interfaces statistics? Did you try watching the traffic using the tcpdump (or similar) tool? Did you try unicast if you're using broadcast? I believe that often Heartbeat packets are close to or exceeding 32k with more complex configurations or more than 3-4 nodes, but very probably not very often, i.e. packets are big only when full CIB is being sent. I don't think that there were many complaints of similar nature. So, it may be that your application is pushing the communication layer's limits. Or the network limits. Hard to say without looking at the network. Thanks, Dejan > Jean-Michel Audet > Software, Kontron Canada > > > > _______________________________________________ > Linux-HA mailing list > Linux-HA@lists.linux-ha.org > http://lists.linux-ha.org/mailman/listinfo/linux-ha > See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems _______________________________________________ Linux-HA mailing list Linux-HA@lists.linux-ha.org http://lists.linux-ha.org/mailman/listinfo/linux-ha See also: http://linux-ha.org/ReportingProblems