Re: D for Speech and Signal Processing

Chris Fri, 29 Nov 2013 11:06:07 -0800

On Friday, 29 November 2013 at 16:58:47 UTC, Baz wrote:

On Thursday, 28 November 2013 at 10:30:36 UTC, Chris wrote:
There are voice analysis and speech processing toolkits likeCovarep and Voicebox (see links below) that were coded inMatlab, because they were originally only prototypes. Therehas been talk of porting them to C++. My first thought, as youmight imagine, was why not use D? However, I don't know ifthere are any performance issues, especially for real timesystems (in speech recognition), talking about GC, or in factany other issues (number grinding etc.).
A lot of the analysis tools are based on some sort of HMM(http://en.wikipedia.org/wiki/Hidden_Markov_model) and I thinkD could handle that elegantly.
https://github.com/covarep/covarep
http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html
Hi, I have a little experience in dsp programming using ooplanguages, so I'll try to give you my mind, but my mind is morerelated to entertainment dsp softwares (asio, vst, etc...).
talking about GC
In "pseudo" real time (RT) audio (one or many buffer areoverlapped) you are a in a loop (interesting example isbufferswitch in asio). It's time critical and performancecritical, so you'll never create a class neither allocate abuffer here...The idea is: what does trigger the GC: memoryallocation and dynamic class instance creation. It's like inGUI programming: you don't destroy and recreate many objects inthe "resize/realign" message handler...So the GC problem issolved: there is no GC problem because in RT dsp you won't dosomething stupid that'll trig a GC pass.
In speech recognition you'll mostly use some frequency-domaintechnics (not to name the fft), so basically if you don't wantto trigger a GC pass, don't use build-in array and make yourown array using alloc/malloc/free. For the classes it's thesame, you can still make your own class allocator/deallocator,like specified in the manual (even if they say it'sdeprecated). With user managed classes and array you'll avoidmost of the GC passes...But it doesn't mean that the mostimportant stuff is: not to allocate in the audio buffer loop.


Thanks. That's very interesting, I'll look into it.

Re: D for Speech and Signal Processing

Reply via email to