Re: [Sursound] Is it possible to mix ambisonic encoded information of different order?

Aaron Heller Mon, 12 Jun 2023 12:30:04 -0700

What I've always done is to keep the channel sets of different ambisonic
order separate, then design a decoder for each channel set, and mix the
speaker feeds.   Perhaps that's overkill.


On Sun, Jun 11, 2023 at 4:25 PM Sampo Syreeni <de...@iki.fi> wrote:

> On 2023-06-01, Jan Jacob Hofmann wrote:
>
> > is it possible/ reasonable to mix ambisonic encoded information of
> > different order?
>
> It's possible and it's reasonable, and as Fons Adriansen said above, at
> the rather high orders you're talking about, it's not much below
> optimality either. This has also been talked about in the past, with the
> — granted, a bit of a shocking — revelation to me and some others, that
> actually orders mixed this way do *not* automatically decode optimally
> in either decoder.
>
> But theoretically, this ought to be purely a decoding side issue. When
> you're mixing into or in B-format, you're essentially dealing with an
> isotropic approximation of a soundfield, around a central point. That
> approximation is always a physical one, and in ambisonic work, it's
> going to be orthogonal by the basic math. If you want to add extra
> directional accuracy, you'll add orders to your directional
> decomposition. If you can't or won't, then you don't. But in the end,
> the fact that the (3D) Fourier-Bessel series rightly normalized (too)
> preserves the power of point sources, and is an isotropic decomposition
> of an inbound far field, guarantees that the *only* thing you lose in
> lower order is directional accuracy. When going to B-format, the one
> meant to capture the physics, mixing two orders cannot lose anything.
>
> So the real trouble comes when decoding B-format into D-format. If you
> have a set of first order, POA signals, you have one particular, optimal
> equation set for how you'd lay the sound out over your speakers. If you
> had a second order HOA signal, running into something like 5.1, the
> optimal set differs quite a lot, especially in the higher frequencies,
> since the theory doesn't work by easy interference principles there, but
> by second order psychoacoustical ones, coming from the stereo work of
> Makita. Solving the problem optimally becomes rather finicky.
>
> Then, solving it for mixed orders (not usually a term used for this
> situation, but for leaving out certain spherical harmonics, e.g. for
> horizontal, pantophonic work), is even messier. How could we know in
> decoding only, blindly, that we have a superimposition of say first and
> second (arbitrary?) order signals, so that we could apply the optimum
> decoding rule to them all, at the same time?
>
> I've been toying around with this problem for a decade or so, and
> haven't found a satisfactory solution to it all. My intuition says
> this has something to do with non-negative matrix factorization and
> convex optimization, but even if that's it, I'm not quite there yet.
>
> From Dolby Surround and HARPEX -like things I've been toying around with
> doing them in the pure spherical harmonical domain to arbitrary order; a
> generizable infinite order decoder; in DirAC kind of stuff I've been
> toying around with just tensoring the STFT/MDCT-domain with the
> directional Fourier domain, complexly; and then some classical LTI DSP
> statistical learning and information/compression/rate-distortion theory
> on top. In an effort to solve the problem of how to make full spatial
> audio pack well.
>
> And then there was NFC-HOA. I was already making some progress, but that
> totally stopped me. In that one, you an mix several orders of signals,
> but suddenly you can't mix ones of separate radii. Fuck, back to the
> drawing board for me as well. :/
>
> > The sound-information (synthesized) is encoded in Ambisonic 7th order
> > while the spatial reverberation of that very sound is encoded „only“
> > to third order.
>
> In fact Fons asked you already: why go to such a high order? You'd need
> an extraordinary number of speakers to utilize such a signal. Also, an
> extraordinary computing power and a lot of real life meaasurement of
> your speaaker rig to even align your decoding solution optimally.
> Whereas in low, matched order, you can do it right with a day's
> computation time.
>
> > Reason for doing so: My reverberant information comes from several
> > directions in space. If these would not have to be encoded all up to
> > 7th order, it would save some calculation time and computation effort.
>
> They really don't have to. Take a look at Ville Pulkki's DirAC work,
> here in Finland. The gist of it is that it reconstructs both specular
> sources and reverberation, separately. The first part is identified via
> time coherence, averaging, much like Dolby Surround does it in its four
> constrainted channels, and like HARPEX does it better in the ambisonic
> work.
>
> Ville's work however is fully general and frequency dependent in its
> source recognition. And it goes beyond: it actually tries to identify
> reverberant modes from a SoundField, by using the imaginary axis of the
> Fourier transformation in time to recognize reverberant modes. Which has
> also been discussed years before on-list, when Angelo (I think) talked
> about his car interiors.
>
> > Also the reverberant information may well be more „blurry“ in respect
> > to the actual sound, as it may stay in the background of perception
> > anyway.
>
> So in reverberation, why not try out a SoundField, for a measurement?
> The original Ambisonic mic? Because it's actually calibrated to measure
> not only the pointwise pressure, as its W, but also velocity in XYZ.
> The latter are where you get the reverberant, echoing, reactive field
> measurements from.
>
> > But my emphasis is on the question, if a decode of 3rd *and* 7th order
> > information - yielding in one encoded file - would be mathematically
> > correct if it comes to the decoding of the higher order content. Would
> > there be missing something (maybe an overall lower amplitude of the
> > third order content)?
>
> As said, it will not be. As your order goes higher, in the higher order
> decoder, you'll get better and better decodings at the higher
> frequencies. Just as Fons Adriansen said, above. Doing it right, you
> will necessarily start to approach the far field diffraction limit of
> your array, both low and high.
>
> However, at the same time, your decode for the lower order will not be
> psychoacoustically optimal, and won't approach it by these principles.
> If you mix in lower order content, it won't decode optimally without
> severe extra work. At something like 3-7 differentiation, you probably
> won't hear the difference, but if you mix together even first and third
> order, you definitely will; an optimal third order decoder does not work
> well with a first order superimposed signal to the degree a specialised
> first order (esp. four (panto) or six (peri)) rig would do.
>
> The higher order stuff will though mix in when done right. It will
> spread ought to a lower order rig, even if the solution is rather
> difficult to find. E.g. on-list we've talked about many numerical
> solutions to the problems, like Wiggins's Tabu search. But if you try to
> apply the higher order optimization problem to the lower orders, it
> doesn't pan out.
>
> My long term problem is how to at least partially, blindly, tell
> arbitrary order decompositions/additions apart from each other, at
> least in part. I'm not there, even yet. :/
> --
> Sampo Syreeni, aka decoy - de...@iki.fi, http://decoy.iki.fi/front
> +358-40-3751464 <http://decoy.iki.fi/front+358-40-3751464>, 025E D175
> ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
> _______________________________________________
> Sursound mailing list
> Sursound@music.vt.edu
> https://mail.music.vt.edu/mailman/listinfo/sursound - unsubscribe here,
> edit account or options, view archives and so on.
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: 
<https://mail.music.vt.edu/mailman/private/sursound/attachments/20230612/8e3591a7/attachment.htm>
_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound - unsubscribe here, edit 
account or options, view archives and so on.

Re: [Sursound] Is it possible to mix ambisonic encoded information of different order?

Reply via email to