[gnuspeech-contact] voice control in articulatory vs. HMM-based synthesis

Eric Zoerner Tue, 22 Nov 2005 09:46:27 -0800

I have a theoretical question for the list regarding comparison ofspeech synthesis techniques and their capabilities for voice control/modification at runtime while maintaining naturalness.

It is pretty clear to me that articulatory speech synthesispotentially has a great deal of flexibility when it comes todynamically altering the voice, e.g. for natural intonation,emotional speech, singing, changing dialect or language, or changingthe identity/gender/age of the speaker, etc.

I am interested in comparing these capabilities to those in HMM-basedsynthesis. Can anyone comment on or point me to information regardingthe extent that HMM-based synthesis (e.g. using the HTS toolkit) hascapabilities in this regard?

Would it be fair to say that while there may be more control over thevoice during the training phase in HMM-based synthesis as compared tounit-concatenative approaches, the feasibility of controlling thevoice at runtime in HMM-based synthesis is about as limited as thatwith unit-concatenation (i.e. without losing its perceived"naturalness")?



_______________________________________________
gnuspeech-contact mailing list
[email protected]
http://lists.gnu.org/mailman/listinfo/gnuspeech-contact

[gnuspeech-contact] voice control in articulatory vs. HMM-based synthesis

Reply via email to