Re: [Freetel-codec2] Improving synthesized speech by borrowing ideas from CELT?

Adrian Musceac Fri, 19 Apr 2024 06:22:23 -0700

Hi Tomas, 

I suspect you are right regarding the noise, I've done this myself along with 
some audio compression and I was happy with the results. Eliminating or 
smoothing high-pitch artefacts due to bit errors would also be nice.


Adrian

On 19 April 2024 11:29:46 UTC, "Tomas Härdin" <g...@haerdin.se> wrote:
>Hi
>
>Following Bruce's email I took a look at
>https://github.com/drowe67/codec2/blob/main/doc/codec2.pdf to see how
>the speech synthesis in codec2 actually works. This because I suspect
>that NN synth (LPCnet?) is doing something similar to what CELT (one
>half of Opus) does, namely filling off-peak parts of the spectrum with
>noise.
>
>Filling the spectrum with noise makes the output sound a lot less
>robot-y, and robot-y sound is known in the CELT world to be due to
>collapsing all energy into a single spectral bin per Bark band which,
>looking at equation (10), is *precisely what the codec2 synth does*!
>Or, sort of. For low F0 there may be multiple peaks in some bands.
>
>One way to make the output less robot-y could be to convolve Ŝw with a
>suitable function that smooths out the peaks. In CELT this is done via
>clever use of rotations if I remember correctly. A floor of comfort
>noise might also be a good idea.
>
>Oh and on the topic of errors in the paper, there's a spelling mistake
>on page 2: anlaysed.
>
>/Tomas
>
>
>_______________________________________________
>Freetel-codec2 mailing list
>Freetel-codec2@lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/freetel-codec2

_______________________________________________
Freetel-codec2 mailing list
Freetel-codec2@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/freetel-codec2

Re: [Freetel-codec2] Improving synthesized speech by borrowing ideas from CELT?

Reply via email to