Re: [Sursound] Ambisonic UHJ Stereo decoder to speaker feeds

Sampo Syreeni Mon, 28 Feb 2022 12:06:05 -0800

On 2022-02-26, Eero Aro wrote:

The way I understood Mark Anderson's question, that he is looking fora _practical_ way to decode his existing UHJ recordings into surroundloudspeaker playback with a software solution, ie. a softwarereplacement for using UHJ decoding in a tuner amplifier, such as theOnkyo SV909.

Check. Once more I should have been more constructive. Angelo's solutionis quite workable to this end, but somehow I feel it's not quite theEndlösung here. Especially since conversions of this kind tend to leavethe original material deleted, so that one ought to get it "right" or"bestest" the first time around.

...unless Gerzon, Craven and Stuart's MLP work saves us here. Because inthere, they developed a rather general framework of exactly invertibletransforms in the digital domain. Utilizing that, it's posslble toapproximate to a high degree almost any short term linear andshift-invariant MIMO system in a form which comes back to the precisebits you threw at it.

So I'd say, do use Angelo's formulae, but discretize them, so that theymay be inverted back into the original UHJ at a later time, to perchancebe reprocessed.

This is easy to understand, because the demand for such software wouldbe very little. (Both Onkyo 909 and the Meridian 565 use digitalprocessing and do decode UHJ, but they are not what we are talkingabout.)

BTW, I'm not too certain about such software being in short demand. Ithink the trouble is that there is no easy plug-n-play library out thereto do the job. If there was, it could make handling UHJ/BHJ easy enoughfor it to become a much more widely used stereo encoding.

The nasty thing though is that BHJ isn't isotropic, while isotropy islargely required for gaming work. I.e. the thing driving *all* of VR andhigh end consumer sound spatialisation right now. It *can* be revertedto a form where phasiness hasn't been redistributed to the back, and soit *can* be resynthesized to any rotational frame, but again,plug'n'play libraries to do that aren't yet in place.

In particular, nobody's been bold enough yet to implement zero delayprovably constant effort convolutions of the Gardner/Lake DSP kind, sothat the library doesn't have to be thought about, as a component.

As Sampo says, when the signal has been UHJ encoded, there is no wayto retrieve the original B-Format.

Though active decoding of UHJ, mindful of its doings, could come prettyclose to the best active decode of full B-format. If did right.

One of the things I've been wondering about for the longest time then ishow to optimally, actively decode a B-format signal. How to do what Ithink Angelo Farina at one time called "an infinite order decode".

Nobody's done the exercise for a general soundfield, yet. We do have theresult that passive LTI-shelves help, via Makita theory. We do haveDirAC as an active decoder, and we do have Harpex. But none thosesolutions go to arbitrary order, and all of them are in a senseunprincipled: the classical shelving solution does an isotropicoptimization over Makita criteria, assuming distant point sources, thethe Harpex one tries to reconstruct two point sources exactly, andDirAC, as well as it does work for point reconstruction and ambienceseparately, is frankly speaking a theoretical mess. E.g. whoever told uscardioid responses have anything at all to do with how the sound fieldreally behaves?

So what would be the systematic way of dealing with active decoding?Well, I think the first two things to look at would be directional powerover time, expressed as a higher order spherical harmonicaldecomposition, and in some dual sense how singular a signal seems wrtthe decomposition we use. That is to say, as we usually detect a pointsource for active matrixing by it being "strong in a certain direction",that usually translates into a directional power operator of some sort,followed by sharpening along the detected direction. In DirAC, we alsodo a sort of division between focused power and ambience, and furtherdivide out the ambient field across speakers -- a powerful idea, and aneminently workable one as I've heard in practice at Aalto. But it'sstill unprincipled, because it cannot differentiate between zero, one,two, whatever, focused sources in pantophony, or zero, one, two, three,whatever in periphony, nor partition the ambience quite right if some ofthe sources are somewhat coherent with each other. (Think lead violinand cello in a frontal orchestra.)

I believe the principled way to go about this would be to treat thefield as complex, and harmonic, then to square it in order to find pointsources, express that instantaneous solution as a higher order complexspherical harmonical expansion, extract the out-of-phase component forDirAC-like processing, and to apply some time-running polynomial of theadjugate of the system function to set a variable time-frequencytradeoff. Wiener filtering theory might come in handy, when dealing withthe noise-signal tradeoff.

If you set it out this way, an arbitrary field with arbitrarily many,not perhaps even resolved sources, can be handled just as well as aplanewave from a single source. The amount of shapening for a sourcewould depend continuously on its coherence properties, and if more thanone source was present, their possible mutual coherence would naturallybe taken into account. DirAC processing would also take heed ofanisotropic reverberation, such as when recording close to a wall orclose to an orifice to a wider space. And the funkiest thing would bethat the math stays of finite order: every operation necessarily wouldbe of at most square the order of the original spherical harmonicaldecomposition. While describing any and all distributions of pointsources and ambience over the unit circle/sphere (pantophony/periphony).

I have quite a lot of Ambisonic UHJ CD:s and I'd rather listen to themas decoded into a surround setup than listening to them in stereo withtwo speakers.

Actually, even if you can't invert UHJ to B-format, really, it's evenmore difficult to go from B-format to something like 5.1. The optimumdecoding equations are a full nonlinear mess, even in the basic Makitasense, and suffer from multiple local optima. (I believe Bruce Wiggins'sthesis which used tabu search to find viable non-symmetric decoders wasan attempt to deal with the problem.)

DirAC is stupendously good at this, at least perceptually speaking. Itadapts to pretty much any speaker array, and from five unevenly spacedspeakers onwards sounds like there is no rig at all.

Yet I'm perfectly certain, based on the above, I could derive a physicalsignal which the system would decode badly. Pretty much every currentsystem would. Say, a narrow band signal coming from around a corner, sothat it has a large out-of-phase component, spreading sharply in space.

And by the way, the majority of UHJ encoded music releases _was_recorded with a Soundfield type microphone, because the largest numberof them were made by Nimbus Records.

Also thankfully so: the SoundField series is an unusually robust pieceof work. Solid theory, high engineering, unbelievably high adherence toacoustical theory which wasn't really even understood at the time themics were designed.

Mk4 and Mk5 have been used as *measurement* mics, in acousticalresearch. I don't think any other mic, in any other audio discipline,really has.

Nimbus didn't use the Soundfield-made microphone, they used their ownsetup made of two fig of eights and an omni.

Yes. Them idjits. 'Cause there is going to be some high frequencyphasing them. It might sound good, pace the ORTF crowd, but it isn't*real* or *accurate*.

They did that mainly because the Soundfield was too noisy and theydidn't need the Z signal, as it couldn't be encoded into UHJ andcarved onto vinyl anyway.

Actually you *do* need Z. That's the point where I alluded to ChristophFaller above: if you cut out the third dimension, your reconstructedfield will show a 1/r extra attenuation term from the rig inwards,because you're bleeding off energy to the third dimension. This is notmuch of a problem when you have a fully propagating field, but whenyou're attempting to reproduce standing waves, the problem grows muchwilder. Then you really, *really* need at least some modicum ofperiphonic control, in order to keep the central pressure field to whatit was supposed to be.

So, all Reaper users out there, please tell Mark how to do the routingin Reaper. David already was in the business.

Does anybody want to sketch out that library I talked about? I'm atheoretician, so not much of a coder. Yet I could guide a seven-year-oldthrough the process of writing such a thing, in plain C.

--
Sampo Syreeni, aka decoy - de...@iki.fi, http://decoy.iki.fi/front
+358-40-3751464, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound - unsubscribe here, edit 
account or options, view archives and so on.

Re: [Sursound] Ambisonic UHJ Stereo decoder to speaker feeds

Reply via email to