Re: [PD] speaker recognition with pd ?

Mathieu Bouchard Thu, 22 Sep 2011 12:38:59 -0700

Le 2011-09-22 à 19:42:00, g...@itchybit.org a écrit :

The task would be to identify from a live-talk the voice of the currentspeaker amongst several. Training before is also possible .. i guessthis could be done for sure by utilizing a simple neural network trainedon a FFT docemposition of the voices.. so there must be some softwareout for sure...

If I recall correctly, it's better to find the log of the amplitude of thefft, and then perhaps do fft again, before trying to find such timbralinfo.

an amplitude-wise log means that the spectra of filters add up instead ofmultiplying. That's supposed to make them easier to separate.

and the 2nd fft is supposed to make it easier to separate the vowelfilters from the base pitch.

but I never tried any of that, or maybe I tried making a patch and then Ididn't really knew how I'd use that and gave up... something like that.


 _______________________________________________________________________
| Mathieu Bouchard ---- tél: +1.514.383.3801 ---- Villeray, Montréal, QC

_______________________________________________
Pd-list@iem.at mailing list
UNSUBSCRIBE and account-management -> 
http://lists.puredata.info/listinfo/pd-list

Re: [PD] speaker recognition with pd ?

Reply via email to