[FFmpeg-user] 5.1 downmix to 2.0 (again) and buried dialogs

pehache Fri, 26 Aug 2022 10:33:05 -0700

Hi,

Not strictly speaking a ffmpeg (recurring) question, but ffmpeg is oftenused for that...

Since I a have only a stereo setup (albeit a decent one) attached to myTV, I started a while ago to generate downmixed 2.0 tracks with ffmpegon my video files with 5.1 (or 7.1) tracks.

My original motivation was a too low perceived loudness of the dialogscompared to the music/ambiant sound in *some* movies (not all of them!).My hypothesis at that time was that the built-in downmixing of myequiment was overweighting the left and right channels (both front andside) compared to the central channel where most dialogs are supposed tobe placed.

So I started with the "-ac 2" option in ffmpeg... Which basicallychanged nothing (as far as I could say, at least). Investigating more Ithen found the -af "pan=stereo| FL< ... | FR< ..." syntax to chose theweighting coefficient of each 5.1 channel to buiild the stereo channels.


There were recommended coefficients:
FL < 1.0*FL + 0.707*FC + 0.707*SL (and similarly from FR)
These ones were ginving the same result than -ac 2 to my ears.

There were also tons of alternate formula described on various websites... I ended up with

FL < 0.707*FL + 1.0*FC + 0.707*SL

It was doing what it was supposed to do: louder dialogs compared tomusic and ambient sounds.

However I finally observed that it was also narrowing the stereo image.Indeed, FC does not contain only voices but also a large part of themusic and ambient sounds. Overweighting FC would not narrow the stereoimage it was containing only the voices, but this is not the case.

I kept wondering why the dialog loudness is sometimes perceived too lowafter downmixing, and I have a possible explanation: the brain is verygood at isolating a voice buried in the ambient noise because it canlocated where it comes from. That's why people with hearing aids stillhave difficulties to follow a conversation when multiple people speak atthe same time: the earings aids can restore the volume, but thedirectivity is (mostly) lost... So, with a real 5.1 or 7.1 setup thebrain is not bothered by the side/rear channels when it comes to focuson the central dialogs, because they come from fully differentdirections. But after downmix, what was coming from the side/rearchannels is now coming from the front channels, making the separationtask more difficult for the brain. The solution is hence to downweightthe side/rear channels... Therefore I am now using:


FL < 1.0*FL + 0.707*FC + 0.4*SL

And it seems better to me: the dialogs are clearer, without narrowingthe stereo image. But maybe this is just what I desperately want to hear...


Any thought on all of this ?

_______________________________________________
ffmpeg-user mailing list
ffmpeg-user@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-user

To unsubscribe, visit link above, or email
ffmpeg-user-requ...@ffmpeg.org with subject "unsubscribe".

[FFmpeg-user] 5.1 downmix to 2.0 (again) and buried dialogs

Reply via email to