hi Richard,

Did few more iterations of testing (ptp4l in BC mode) by resetting TestUnit. 
Still observing "send sync error" with txtimeout of 100ms.

Question:
1) ptp4l is running in BC mode providing clock to other connected TestUnits and 
syncing clock from another BC/GM.
    But in the below case, it would be blocked for almost 100ms (1/10 sec) 
Wouldn't this impact ptp packets to testunit or handling ptp packets from 
BC/GM? 
                res = poll(&pfd, 1, sk_tx_timeout);
                if (res < 1) {
Sync and Announcement packets values as below 
logAnnounceInterval     -3 
logSyncInterval         -4


Feb  7 09:35:18 phc2sys: [610989.748] CLOCK_REALTIME phc offset       -16 s2 
freq  -81596 delay    566
Feb  7 09:35:18 phc2sys: [610989.748] enp95s0f1 phc offset        -1 s2 freq   
-2464 delay   3725
Feb  7 09:35:18 ptp4l: [610989.818] rms    5 max   12 freq   +649 +/-   8 delay 
  370 +/-   1
Feb  7 09:35:19 phc2sys: [610990.748] CLOCK_REALTIME phc offset         0 s2 
freq  -81585 delay    576
Feb  7 09:35:19 phc2sys: [610990.748] enp95s0f1 phc offset        -1 s2 freq   
-2465 delay   3740
Feb  7 09:35:19 ptp4l: [610990.818] rms    5 max   10 freq   +648 +/-   9 delay 
  369 +/-   0
Feb  7 09:35:20 ptp4l: [610991.536] timed out while polling for tx timestamp 
revent 0 event 2
Feb  7 09:35:20 ptp4l: [610991.536] increasing tx_timestamp_timeout may correct 
this issue, but it is likely caused by a driver bug
Feb  7 09:35:20 ptp4l: [610991.536] port 2: send sync failed
Feb  7 09:35:20 ptp4l: [610991.536] port 2: MASTER to FAULTY on FAULT_DETECTED 
(FT_UNSPECIFIED)
Feb  7 09:35:20 ptp4l: [610991.557] port 2: link down
Feb  7 09:35:20 ptp4l: [610991.557] port 2: received link status notification 
DOWN
Feb  7 09:35:20 ptp4l: [610991.557] selected best master clock 
000580.fffe.07cc8a
Feb  7 09:35:20 ptp4l: [610991.557] updating UTC offset to 37
Feb  7 09:35:20 ptp4l: [610991.557] Not client interface for BC mode enp95s0f1
Feb  7 09:35:20 ptp4l: [610991.557] port 2: received link status notification 
DOWN
Feb  7 09:35:20 ptp4l: [610991.557] port 2: received link status notification 
DOWN
Feb  7 09:35:20 ptp4l: [610991.557] port 2: received link status notification 
DOWN
Feb  7 09:35:20 phc2sys: [610991.748] port 40a6b7.fffe.0da261-2 changed state
Feb  7 09:35:20 phc2sys: [610991.748] reconfiguring after port state change
Feb  7 09:35:20 phc2sys: [610991.748] selecting enp95s0f1 for synchronization
Feb  7 09:35:20 phc2sys: [610991.748] selecting CLOCK_REALTIME for 
synchronization
Feb  7 09:35:20 phc2sys: [610991.748] selecting enp175s0f1 as the master clock
Feb  7 09:35:20 phc2sys: [610991.748] CLOCK_REALTIME phc offset        22 s2 
freq  -81563 delay    574
Feb  7 09:35:20 phc2sys: [610991.748] clock jumped backward or running slower 
than expected!
Feb  7 09:35:20 phc2sys: [610991.748] enp95s0f1 phc offset -270490592 s0 freq   
-2465 delay   1856
Feb  7 09:35:20 ptp4l: [610991.818] rms    4 max    8 freq   +651 +/-   7 delay 
  370 +/-   0
Feb  7 09:35:21 ptp4l: [610992.492] port 2: received link status notification 
DOWN
Feb  7 09:35:21 phc2sys: [610992.748] CLOCK_REALTIME phc offset        -5 s2 
freq  -81583 delay    574
Feb  7 09:35:21 phc2sys: [610992.748] clock jumped backward or running slower 
than expected!
Feb  7 09:35:21 phc2sys: [610992.748] enp95s0f1 phc offset -1040559513 s0 freq  
 -2465 delay   1878
Feb  7 09:35:21 ptp4l: [610992.818] rms    5 max    9 freq   +649 +/-   8 delay 
  369 +/-   1
Feb  7 09:35:22 phc2sys: [610993.748] CLOCK_REALTIME phc offset        -8 s2 
freq  -81587 delay    573
Feb  7 09:35:22 phc2sys: [610993.748] clock jumped backward or running slower 
than expected!

Please suggest.

regards,
Ramesh

On Tuesday, January 18, 2022, 11:08:18 PM GMT+5:30, ramesh t 
<ramesh...@yahoo.com> wrote: 





hi Richard,

With Step 1, it seems to be working fine, will try few more variations and 
check.
Thank you for your help.

But have few questions:
Patches provided doesn't seems to have any relation with error "timed out while 
polling for tx timestamp" which occurred while transmitting Sync packets on 
master side. I'm not observing this error, any reason why?

After recover/reboot of remote system, phc offset of the interface connected to 
TestUnit/remote system seems to be struck at improper value.
[root@satdd-nec-worker0 ~]# /usr/sbin/phc_ctl enp95s0f1 cmp
phc_ctl[645780.131]: offset from CLOCK_REALTIME is 31335266008ns

phc2sys: [645792.375] clock jumped backward or running slower than expected!
phc2sys: [645792.375] enp95s0f1 phc offset -68335289644 s0 freq -900000000 
delay   3779
phc2sys: [645793.375] CLOCK_REALTIME phc offset       -11 s2 freq  -79884 delay 
  1143
phc2sys: [645793.375] clock jumped backward or running slower than expected!
phc2sys: [645793.375] enp95s0f1 phc offset -68335291579 s0 freq -900000000 
delay   3767
phc2sys: [645794.375] CLOCK_REALTIME phc offset         8 s2 freq  -79868 delay 
  1156
phc2sys: [645794.376] clock jumped backward or running slower than expected!
phc2sys: [645794.376] enp95s0f1 phc offset -68335293498 s0 freq -900000000 
delay   3798
phc2sys: [645795.376] CLOCK_REALTIME phc offset         4 s2 freq  -79870 delay 
  1158

Please suggest.

regards,
Ramesh

On Monday, January 17, 2022, 09:35:54 PM GMT+5:30, Richard Cochran 
<richardcoch...@gmail.com> wrote: 





On Mon, Jan 17, 2022 at 08:25:06AM +0000, ramesh t wrote:

> Please suggest.


1. Make sure your ptp4l has these five patches:

  a082bcd clockcheck: Increase minimum interval.
  6df8425 port: Don't renew raw transport.
  e117e37 port: Don't check timestamps from non-slave ports.
  262a49b clock: Reset clock check on best clock/port change.
  7e8eba5 clock: Reset state when switching port with same best clock.

  If that doesn't fix the problem, then:

2. Disable clock sanity check.

      sanity_freq_limit
              The  maximum  allowed frequency offset between uncorrected clock
              and the system monotonic clock in parts per billion (ppb).  This
              is  used  as  a  sanity  check of the synchronized clock. When a
              larger offset is measured, a warning message will be printed and
              the servo will be reset. When set to 0, the sanity check is dis‐
              abled. The default is 200000000 (20%).


_______________________________________________
Linuxptp-users mailing list
Linuxptp-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/linuxptp-users

Reply via email to