Re: Graillon 1.0, VST effect fully made with D

Ola Fosheim Grøstad via Digitalmars-d-announce Sun, 29 Nov 2015 09:26:17 -0800

On Sunday, 29 November 2015 at 16:15:32 UTC, Guillaume Piolatwrote:

There is also a sample-wise FFT I've came across, which isexpensive but avoids chunking.


Hm, I don't know what that is :).

Looking for similar grains is the idea behind the popularauto-correlation pitch detection methods. Require two periodselse no autocorrelation peak though. The rumor says that thenon-realtime Autotune works with that, along with many modernpitch detection methods.

I thought they used Laroche and Dolson's FFT based one combinedwith a peak detector, but maybe that was the real time version.

There are other full spectral resynthesis methods that throw awayphase information and represent each spectral components as abandpass filter of noise. That is rather expressive since you cando morphing with it. (Like you can do with images). But since youthrow away phase information I guess some attacks suffer, so youhave to special case the attacks as "residue" samples that areleft in the time domain (the difference between what you canrepresent as spectral components and the left over bits).

I don't know what "voicedness" is? You mean things likevibrato?
vibrato is the pitch variation that occur when the larynx iswell relaxed.

Yes, so that will generate sidebands in the frequency spectrum,like FM synthesis, right? So in order to pick up fast vibrato Iwould assume you would also need to do analysis of the spectrum,or?

voicedness is the difference between sssssss(unvoiced) andzzzzzz (voiced).A phonem is voiced when there is periodic glottal closure andopenings.

Ah! In the 90s I read a paper in Computer Music journal wherethey did song synthesis by emulating the vocal tract as a"physical" filter-model. I'm not sure if they used FoF forgenerating the sound. I think there was a vinyl flexi disc withit too. :-) I have it somewhere...


You might find it interesting.

When the sound isn't voiced, there is no period. There isn't a"pitch" there. So pitch detection tend to come with aconfidence measure.

So it is a problem for real time, but in non-real time you canwork your way backwards and fill in the missing parts beforedoing resynthesis? I guess?

The devil in that is that voicedness itself is half a lie, orlet say a leaky abstraction, it breaks down for distortedvocals.

Right. You have a lot of these problems in sound analysis. Likesound separation. The brain is so impressive. I still haveproblem understanding how we can hear 3D with two ears. Likedistinguishing above and below. I understand the basics of it,but it is still impressive when you try to figure out _how_.

I guess that's why IRCAM can sell licenses to superVP. :)
Their paper on that topic are interesting, they group spectralpeaks by formants and move them together.

I've read the Laroche and Dowson paper in detail, and more orless know it by heart now, but maybe you are thinking about someother paper? Their paper was good on the science part, but theyleave the artistic engineering part open to the reader... ;-)More insight on the artistic engineering part is most welcome!!

Re: Graillon 1.0, VST effect fully made with D

Reply via email to