[music-dsp] A wavetable alternative to adjusting the frequencies of harmonics to get seamless loops.

robert bristow-johnson Fri, 17 Dec 2010 22:57:14 -0800

okay, i don't seem to get any time to deal with this except late atnight.

so this is continuing that thread that was named "A theory of optimalsplicing of audio in the time domain."


On Dec 15, 2010, at 11:20 AM, Stefan Stenzel wrote:

On 14.12.2010 06:15, robert bristow-johnson wrote:
this isn't a problem with piano, but what if the sample is of someacoustic instrument with vibrato in the recording of a singlenote. then there isn't an exact pitch for the whole sample of thenote, because it varies in time.
Right, but if you consider 1/loop length the fundamental frequncy,vibrato becomes simple FM.

well, you will have a sparse line spectrum for your "single cycle".the "real" first harmonic becomes something like the 50th harmonic ofyour 1/(loop_length) fundamental if the loop had 50 cycles of the tonebetween endpoints. then you will have a spike at around the 100thharmonic, 150th, 200th and so on. you can DFT the entire loop length(with no windowing), and the DFT will have the Fourier coefficients ofyour big, long "single cycle" (which looks like 50 cycles). if therewas no vibrato, the energy would be nearly all in the X[50], x[100],X[150], X[200] ... bins. because this is a DFT of an integer numberof cycles, the adjacent bins would be nearly zero, relatively (ifthere is no vibrato).

like with a piano, the higher harmonics would start to get a littlesharp and, say, the "real" 12th harmonic would lie perhaps at X[601]instead of X[600] if that harmonic was 2.88 cents sharp. but the 11thharmonic and the 13th harmonic would also be sharp and not by exactlyone (or some other small integer) bin. then there *will* neighboringbins with significant energy, because it would be like a sinc()function sampled off of the integer values. you would have tointerpolate around these adjacent bins to get the "true" peak location(at a fractional in-between bin location) and peak height so you wouldknow that there is not a precise integer number of cycles of thatharmonic in your loop.

now i presume that you would want to move those slightly detunedharmonics to squarely an integer bin location and you would computethe distance from the interpolated peak to the nearest integer bin.higher or lower, i'm not entirely sure - if they're, say, 0.4 binwidth sharp, you might want to bump it up to the next integer binrather down to the nearest bin where it would not be slightly sharpanymore. i dunno, we want to preserve these outa-tune harmonics tokeep the sample "live" sounding.

now one problem, i might guess, would be *if* there is also vibrato,those harmonic peaks will get spread out among the adjacent bins, andi am not sure that it will be symmetrical about the "true" peak, andif it is not, i am not sure how you determine exactly where the peakis before moving it. not necessarily a big issue. so then you moveeach peak (and adjacent bins) to an integer bin location, inverse DFT,and all of the partials should have an integer number of cyclesbetween the loop endpoints.

This might sound stoopid, as we certainly perceive it our own timedomain, but that does notmean we cannot take advantage of frequency domain processing. Theproblem here lies not so muchin the frequency alignment itself but the pitch detection, whichideally finds a multiple of
both the fundamental and the modulation frequency.

i know, selecting the correct number of cycles for the loop so thatthere is an integer number of vibrato cycles would be the maincriterion of choosing a loop length and endpoints. you would do thatwith little regard of what those sharpened harmonics are doing and fixthem later with this frequency-domain method. (and there is awavetable way to do it, that tracks the varying fundamental.)

In reality, if you choose your loop to be long enough, you canalmost get away with any length,even if this is completely unrelated to the original pitch. Considera 4 sec loop, all frequenciesare multiples of 0.25 Hz. At 440 Hz, this difference is just 1 centand hardly audible.

well if you get 1760.5 cycles in the loop (because it's not exactly440 Hz or not exactly 4 sec) then instead of 1760, you *could* get aglitch in the splice, no matter how slow the crossfade is, becausewhen the crossfade is at 50-50 (%), then you will get destructiveinterference for all odd harmonics. but, i know you would adjust it alittle to get an exact integer number of cycles. but, in my opinion,you would have to track the cycle phase tightly to do it, which wouldbe equivalent to cross-correlating (or AMDF) the two loop endpointstogether to get the best loop length.

Works for major as well as for minor chords, as for some 10CC not-in-love vocal cluster.

works for dissonance? if you were looping that, i might expect aconstant-power crossfade (that hits both envelopes at 70.7% whenhalfway through) would be better than a constant-voltage crossfade.there are sample editors that had options to do this and this optimalsplicing theory was meant to generalize the idea.

well, for sure you want the splice to be seamless for allharmonics, or better yet "partials", of any appreciable magnitude.being that there are non-harmonic partials in a lot of acousticinstruments, most certainly piano, i know why you would want toadjust them a little so that phases of all partials are aligned thejump in the loop is seamless.
Yes, very seamless, I think this is what a loop should be. I cannotsee how any frequency *not*being a multiple of the loop frequency could be represented in thatloop.
[...]
i suppose i could illustrate what i mean here with a bogus example,if i haven't made it sufficiently clear. i just think thatwavetable synthesis has application that is broader than justplaying single-cycle loops.
To be honest I didn't quite get that. It could help if the unamedmanufacturer could be named,
I cannot yet see why it should remain anonymous.

well, i told you separately, but i'm not saying it out loud. it'ssuch a litigious society we Americans have (even, =ahem=, the non-Americans). this company is known to have been involved in litigationin its history.

but i'll try to explain how you would employ wavetable analysis,modification, and resynthesis, to create the same loop with someslightly detuned harmonics.

so let's say it's equivalent to above, you have a vibrato going and,in very close to one or two vibrato cycles, you get 50 of the tonecycles. but the 12th harmonic is 2.88 cents sharp (a frequency ratioof 601/600). that's not so bad with the loop, because the sharpened12th harmonic will still have precisely 601 cycles in the loop. butthe 11th harmonic will not quite have 551 cycles in the loop, but ithas more than 500 cycle. let's say that the 11th harmonic is 1.88cents sharp and has 550.6 cycles in the loop and you want to bump itup to 551 cycles. you want to sharpen that harmonic a further 1.26cents to bump it to exactly 551 cycles in the loop so the splice isnice.

so here is the wavetable way to do it: let's say you derive somenumber (let's say 16 wavetables, for a nice number) of wavetables,equally spaced throughout the about-to-be-looped segment of tone(which has 50 cycles in it). now, without considering the vibrato forthe moment, the number of cycles between neighboring wavetables wouldbe 50/16 (or 25/8). or 3 1/8 cycles between centers of the frames youplop down and derive a wavetable from each. this means, if nowavetable alignment is done, the phase of the fundamental wouldadvance by 1/8 cycle or 45 degrees between one wavetable and thenext. so, to align the wavetables, you rotate or spin the secondwavetable back by 1/8 cycle (say, by 128 samples if we're allocating1024 samples per wavetable) to line them up. but we do ourbookkeeping and retain the fact that this wavetable was rotated 1/8cycle when resynthesizing.

now that will do nicely for the fundamental and lower harmonics thatare very harmonic. after doing this rotating, you can perform niceDFTs on the wavetables (if there are 1024 samples per wavetable, thenN=1024 in the DFT). X[0] is the DC component and let's set it to zerojust so we don't have to think about it. X[1] and X[N-1] make theFourier series coefficient for the 1st harmonic, exactly. X[2] andX[N-2] make the Fourier coefficient for the 2nd harmonic. now,because of spinning the second wavetable (lining it up with thefirst), the phase of the 1st and 2nd (and other lower harmonics) inthe second wavetable will be nearly identical to the correspondingphases of first wavetable.

but the 11th harmonic is not exactly the 11th harmonic. if it *were*exactly harmonic, its phase in the second wavetable would line up withthe phase of the first. it's really the 11.012th harmonic (11.012 =11*550.6/550). so when the fundamental advanced precisely 25/8 cycleto go from the first wavetable to the second, the 11th harmonc did notadvance by 11*25/8 cycles but that harmonic advanced 11.012*25/8.when the wavetable is aligned (to make the lower harmonics line up) byspinning it 1/8 cycle, the 11th harmonic gets off by 0.012*(25/8)cycle or 13.5 degrees. now, for each successive wavetable, the 11thharmonic will advance in phase by 13.5 degrees and in the time of 16wavetables, the 11th harmonic will advance 16*13.5 = 216 degrees (or0.6 cycle).

even though the 11th harmonic isn't at exactly 11 times thefundamental, wavetable synthesis treats it as exactly the 11thharmonic but with the phase advancing a little with each and everysuccessive wavetable that is created from the data.

now we *want* the phase of the 11th harmonic to be off a little bit,because it *is* supposed to be sharp a little. but we want thatharmonic to complete an entire extra cycle in the time of the wholeloop, so we have to help that 11th harmonic on by adding 0.4 cycle (or144 degrees) in the time of 16 wavetables. this means we have toadvance the phase (artificially, by souping-up the phase for X[11] andX[N-11] by multiplying X[11] with exp(j*phi) and X[N-11] with exp(-j*phi)) where phi = 144/16 degrees.


so here is the procedure:

1. decide on loop endpoints based on getting very nearly an integernumber of vibrato cycles in there and getting exactly an integernumber of tone cycles in the loop length.

2. divide that loop length into an decently large integer number ofequally-spaced frames. call that number of frames, K. (my exampleabove was K=16 frames.)

3. extract the period (as a possible non-integer number of samples)for each frame and derive a representative wavetable for that frame.

4. knowing what the period length is and knowing the time spacingfrom one frame to the next, you know exactly how much to spin eachsuccessive wavetable to best align it with the previous. (problem isthat the harmonics that are a little non-harmonic will not align aswell.)

5. FFT or DFT each wavetable. this is now the Fourier series datafor that waveform "snapshot" (using Andrew Horner's language) of eachframe.

6. for each harmonic observe how far out of phase the last wavetableis from the first. the last wavetable is K-1 frames displacementsaway from the first and the phase in the last frame should be off byM*360*(K-1)/K degrees from the phase of the first where M is someinteger (M=0 for a "well-tuned" harmonic, M would be the number ofcomplete cycle "slips" for that harmonic in the whole loop length).if that is the case, that means in K frame displacements, that thisharmonic advances by M cycles or M*360 degrees.

7. if the phase differential (from first to last wavetable) is offfrom that M*360*(K-1)/K degrees then that harmonic does *not* advanceby exactly M cycles, then add (with the correct sign) to thatharmonic's phase k/K times that phase differential (where 0 <= k < Kis the sequential index of each of the K equally-spaced frames). whatyou did was hurry up the phase (or slow it down) so that this harmoniccompletes an entire extra cycle (or two or some bigger integer) in thetime of the loop length.

8. inverse DFT each Fourier series snapshot data back to the time-domain wavetable.

9. recreate the time-varying tone using wavetable synthesis (andinterpolating between adjacent wavetables). every harmonic will lineup at the loop endpoints.

does this make sense? i know this is long and wordy, but withoutdrawings i don't know how to better put it. lemme know if there arequestions i might be able to answer or to better explain.


--

r b-j                  r...@audioimagination.com

"Imagination is more important than knowledge."




--
dupswapdrop -- the music-dsp mailing list and website:
subscription info, FAQ, source code archive, list archive, book reviews, dsp 
links
http://music.columbia.edu/cmc/music-dsp
http://music.columbia.edu/mailman/listinfo/music-dsp

[music-dsp] A wavetable alternative to adjusting the frequencies of harmonics to get seamless loops.

Reply via email to