Re: [FFmpeg-user] PTS resolution[s]

Jim DeLaHunt Tue, 23 Feb 2021 00:58:36 -0800

On 2021-02-22 21:35, Mark Filipak (ffmpeg) wrote:

On 2021-02-23 00:01, Jim DeLaHunt wrote:
The Presentation Time Stamp (PTS) value which FFmpeg associates withvideo frames and audio data is a 64-bit integer. There is anassociated time base attribute for each video or audio stream, whichgives the number of seconds between successive values of PTS. Thistime base might be thought of as the resolution of PTS. Thus if youhave two PTS values pts1 and pts2, then the difference in secondsbetween them is (pts2-pts1)*time_base.
MPEG PES (Presentation Elemental Stream) uses a 27MHz (exact) clockdivided by 300 (exact), so that timebase is 1/(90000Hz)…

I've read something similar. My understanding is that MPEG PES encodesPresentation Time Stamp values as integer tick counts in the datastream. Is the timebase of 1/(90,000Hz) encoded in the data stream, orit is only defined in the spec?

…(which is 0.01[1..]ms between ticks, exactly).

Actually, for this discussion I think it's fair to say that 0.01[1..]msis not exactly 1/90 ms, it is just an approximation. Finite decimalnumbers will never get you the exact value. The rational number isexact. For this discussion, it will be clearer to use exact rationalnumbers.

…my best information so far is that, at least out of the encoder,ffmpeg encodes frames with PTS resolution = 1ms.

My impression from reading the FPS filter source code is that it isincomplete to talk about ffmpeg PTS values without also giving thecorresponding timebase value. It looks to me like the FPS filter doesnot attempt to preserve the incoming PTS values or timebase. It sets anew time base of 1/frame_rate, and generates successive integer valuesfor PTS. However, and this is crucial, it does seem to value being exactabout the value of PTS*time_base.

So, that seems to say that your statement "at least out of the encoder,ffmpeg encodes frames with PTS resolution = 1ms" is not complete withoutstating the time base value ffmpeg sets out of the encoder.

To put this into perspective, a 24fps video has delta-PTS = 41.[6..]mswhereas a 24/1.001fps video has delta-PTS = 41.708[3..]milliseconds.That means that the difference between the two is less than theresolution of the ffmpeg timebase (at least, for the encoder -- Idon't know about the decoder and the pipeline). That essentially meansthat ffmpeg can't differentiate between them based on the working PTSsthat it keeps.

But what are the time base values which ffmpeg uses for these twocases? If the time base is 1/24 in the first case, and 1,001/24,000 inthe second case, then the same integer PTS values result inPTS*time_base products being exactly the correct time offsets from thefirst frame of the video in each of the two cases.

I seek someone who can either, 1, confirm what I think, or 2, tell mewhat the resolution of the decoder and pipeline actually is.

Implicit in your use of the definite article "the" is an apparentassumption that FFmpeg has only one resolution for the decoder and thepipeline. It looks to me like FFmpeg could well take the liberty ofchanging resolution at each stage of decoder and pipeline, as long as itpreserves the values for PTS*time_base at each frame (or modifies themintentionally, as the FPS filter does).


Best regards,
     —Jim DeLaHunt

_______________________________________________
ffmpeg-user mailing list
ffmpeg-user@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-user

To unsubscribe, visit link above, or email
ffmpeg-user-requ...@ffmpeg.org with subject "unsubscribe".

Re: [FFmpeg-user] PTS resolution[s]

Reply via email to