On Fri, Sep 23, 2011 at 09:33:59AM +0800, Chris McCormick wrote: > On Thu, Sep 22, 2011 at 07:42:54PM +0200, g...@itchybit.org wrote: > > The task would be to identify from a live-talk the voice of the current > > speaker amongst several. Training before is also possible .. i guess this > > could be done for sure by utilizing a simple neural network trained on a > > FFT docemposition of the voices.. so there must be some software out for > > sure... > > You will probably need this: > <http://en.wikipedia.org/wiki/Mel-frequency_cepstrum> > > The problem you are describing is incredibly difficult.
I just realised that you are probably not talking about overlapping voices, which is orders of magnitude more difficult than sequential voices. Cheers, Chris. ------------------- http://mccormick.cx _______________________________________________ Pd-list@iem.at mailing list UNSUBSCRIBE and account-management -> http://lists.puredata.info/listinfo/pd-list