Re: D for Speech and Signal Processing

Baz Fri, 29 Nov 2013 09:01:42 -0800

On Thursday, 28 November 2013 at 10:30:36 UTC, Chris wrote:

There are voice analysis and speech processing toolkits likeCovarep and Voicebox (see links below) that were coded inMatlab, because they were originally only prototypes. There hasbeen talk of porting them to C++. My first thought, as youmight imagine, was why not use D? However, I don't know ifthere are any performance issues, especially for real timesystems (in speech recognition), talking about GC, or in factany other issues (number grinding etc.).
A lot of the analysis tools are based on some sort of HMM(http://en.wikipedia.org/wiki/Hidden_Markov_model) and I thinkD could handle that elegantly.
https://github.com/covarep/covarep
http://www.ee.ic.ac.uk/hp/staff/dmb/voicebox/voicebox.html

Hi, I have a little experience in dsp programming using ooplanguages, so I'll try to give you my mind, but my mind is morerelated to entertainment dsp softwares (asio, vst, etc...).

talking about GC

In "pseudo" real time (RT) audio (one or many buffer areoverlapped) you are a in a loop (interesting example isbufferswitch in asio). It's time critical and performancecritical, so you'll never create a class neither allocate abuffer here...The idea is: what does trigger the GC: memoryallocation and dynamic class instance creation. It's like in GUIprogramming: you don't destroy and recreate many objects in the"resize/realign" message handler...So the GC problem is solved:there is no GC problem because in RT dsp you won't do somethingstupid that'll trig a GC pass.

In speech recognition you'll mostly use some frequency-domaintechnics (not to name the fft), so basically if you don't want totrigger a GC pass, don't use build-in array and make your ownarray using alloc/malloc/free. For the classes it's the same, youcan still make your own class allocator/deallocator, likespecified in the manual (even if they say it's deprecated). Withuser managed classes and array you'll avoid most of the GCpasses...But it doesn't mean that the most important stuff is:not to allocate in the audio buffer loop.

Re: D for Speech and Signal Processing

Reply via email to