[Sursound] about principled rendering of ambisonic to binaural

Sampo Syreeni Sun, 11 Sep 2022 09:22:02 -0700

Hi y'all, for the longest time now... There's been a lot of discussionabout rendering ambisonic soundfields down to binaural of late, and inthe past couple of years. I don't really think this is a problem thathas been solved in any principled fashion as of yet, so I'd like toinvite some discussion. Especially since I once was about to try my handat the problem, but found my skills woefully lacking.

AFAICS, the thing here is to have a set of HRTF measurements -- thewell-known and open KEMAR set, but also any other -- and then to derivefrom it an LTI coupling from a representation of the soundfield to twoears perceiving the field. The representation ought to be isotropic, asper basic ambisonic principles, and it ought to be matched to the orderof the ambisonic field. If you had a neat set of measurements, over thewhole sphere of directions, which was designed to be in perfectquadrature, this would be easy as cheese.

The trouble is that no set of measurements really behaves this way.They're not in quadrature at all, and almost *always* you'll have asparsity, or even a full gap, towards the direction straight down. Ifthe directional sampling was statistically uniform over the wholesphere of directions, and in addition the sample of directions probedwas to be in quadrature, it would be an easy excercise in discretesummation to gain the transform matrix we need. But now it very muchisn't.

It truly isn't so when you have those gaps of coverage in the HRTF datato the above, and especially below. It leads to divergent, numericallytouchy problems in *very* high dimension: if even one of your pointsin the KEMAR set happens to be out of perfect quadrature, you're ledto an infinite order contribution from that one data point.

It also doesn't help that, directionally speaking, our known HRTF/HRIF'sdon't really come in quadrature, so that they actually contribute todirectional aliasing, *statistically*. To negate their individualerror contributions out, to a degree. But then, again, I know of *no*global, stochastic error metric out there, nor any optimizationstrategy, proven to be optimal for this sort of optimization task.

So the best framework I could think of, years past, was to try andinterpolate the incoming directional point cloud from the KEMAR andother sets, to the whole sphere, and then integrate. Using a prioriknowledge for the edge, singular cases, where a number of the empiricalobservations prove to be co-planar, and as such singular in inversion. Itried stuff such as information theoretical Kullback-Leibner divergence,and Vapnik-Cervonenkis dimension, in order to pare down the stuff. Thething I settled on was a kind of mutual recursion between thedirectional mutual information between empirical point gained/removedand Mahalanobis distance to each spherical harmonic added/removed. Itought to have worked.

But it didn't. My heuristic, even utilizing exhaustive search at points,didn't cut it even close. It didn't even approach what Gerzon didanalytically in 4.0 or 5.1.

So, any better ideas on how to interpolate and integrate, using ex anteknowledge? In order to go from arbitrary point clouds to regularized,isotropic, optimized, ambisonic -> binaural mappings?

--
Sampo Syreeni, aka decoy - de...@iki.fi, http://decoy.iki.fi/front
+358-40-3751464, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
_______________________________________________
Sursound mailing list
Sursound@music.vt.edu
https://mail.music.vt.edu/mailman/listinfo/sursound - unsubscribe here, edit 
account or options, view archives and so on.

[Sursound] about principled rendering of ambisonic to binaural

Reply via email to