Re: [Flashcoders] Question about approximate vowel detection in AS3

Karl DeSaulniers Fri, 04 Jun 2010 14:31:45 -0700

I would say there are about 5 - 7 mouth shapes you could distributethrough your animation that would give the impression that the avataris saying the right words.Plus if your animation is fluid (meaning it doesn't look like theavatar is straining to say the words) it probably wont be noticeableif it mouths the wrong word from time to time.

JAT


Karl


On Jun 4, 2010, at 12:25 PM, Eric E. Dolecki wrote:

I was able to match a single "a" - although even with a straight"a" therecan be some subtle variation. So I mapped variations that comeclose and Idon't need to match every value in the complete waveform overtime... everycouple together or even the first value with buffer comes prettyclose. thisis with a known, unchanging vocal waveform. So I doubt this wouldbe very
useful outside of this current system, which is a bummer.

I think it's time for me to retire this code and move on. Oh well...

Eric
On Fri, Jun 4, 2010 at 9:28 AM, Eric E. Dolecki<edole...@gmail.com> wrote:
I can get waveforms... but say "a" takes 1 second to speak. I getdifferent
waveforms over that 1 second... so I'm not matching against a single
waveform, but many waveforms in succession. This seems like atricky thing
to match against.
What might be a good approach to matching values over a certainamount oftime? Is AS3 fast enough to sync quick enough? I imagine it wouldneed tocheck for all vowels every frame matching values in waveforms overa certain
amount of time.

Eric
On Fri, Jun 4, 2010 at 8:56 AM, Eric E. Dolecki<edole...@gmail.com>wrote:
I've started implementing some code this morning in the hopes tomatch thevowel "a" this morning. Of course there are several intonationsfor thisdepending on the word it's located in, but if I can get a matchon a naked"a" I may be on to something. Like you said, I have a higherchance ofsuccess since the voice is software generated and not from randompeople's
speech patterns.
If I don't get something today I'm going to bail on the engine inthehopes of finding something useful some other time. This isn't acriticalfeature for me as I have the jaw moving with precision and theeffect comes
across. Mouth shapes would be the icing on the cake.

Eric
On Fri, Jun 4, 2010 at 8:34 AM, Karim Beyrouti<ka...@kurst.co.uk> wrote:
Yeh - not sure this will help
however - a (very talented) colleague of mine worked on a simplespeechrecognition software for mobile - it was built to recogniseabout 20
commands with 90% success rate.

His approach (in my simplistic terms) was:
1) get recordings / audio samples of the commands (in your casevowels -it should be easier as it's generated so you wont have tocompare against
too many/different intonations ) -
2) create / store a graph of the audio commands ( this used FFT(s) - toabstract and simplify, the pattern of the commands - the resultwas a square
voice print graph )
3) The stored patterns/voiceprints were then compared againstthe users
voice recording.

The trickiest part of this whole business were the Fast Fourier
Transforms - these things get very complicated, and confuse thelife out of
me. Anyway, hopefully this
will help you - seems like it might be the best approach. if youdo crackit - you will end up with a simple voice recognition system.Which would be
a brilliant and useful thing bit of code to
have...

hope this was of any use..

- karim

On 4 Jun 2010, at 01:23, Karl DeSaulniers wrote:
I would try using that to figure out a way of maping the soundsand
then translate that to your project. You are able to see thewave forms insoundbooth? Haven't used it. If so, can you run your cursor overit at anypoint to get the readings? Might be a little trivial, but mayyeild a
pattern that you can utilize.
JAT

Karl

Sent from losPhone

On Jun 3, 2010, at 6:18 PM, "Eric E. Dolecki" <edole...@gmail.com>
wrote:
SoundBooth

On Thu, Jun 3, 2010 at 6:39 PM, Karl DeSaulniers <
k...@designdrumm.com>wrote:
Do you have SoundEdit? Or the like?


Karl



On Jun 3, 2010, at 5:09 PM, Eric E. Dolecki wrote:

I think I might make waveform bitmaps and then try and compare
against the
current waveform (block EQ) - and if it's a close match,then fire
off
specific vowel events. If that works, I could do consonantstoo. If
this
works, I'll do jumping jacks and shots of Jack.

So how would I compare two bitmaps to see if a waveform (
On Thu, Jun 3, 2010 at 5:18 PM, Karl DeSaulniers <
k...@designdrumm.com
wrote:
If you need any of these files or can't find them, lmk and Ican
send off
list.

Best,

Karl



On Jun 3, 2010, at 3:37 PM, Karl DeSaulniers wrote:

Don't know if this will help, but have you looked into
WaveAnalyzer.as
or
Flash MX - Audio: Sound completion event (The source filesfor
this can
be
found in the Flash MX/Samples folder.)
They both let you control the sound. I am thinking thiswill point
you
in
a good direction. Its AS2 though.

HTH,

Karl


On Jun 3, 2010, at 2:42 PM, Eric E. Dolecki wrote:
Ya - I have the data for both things, but they extend overtime
and are
difficult to compare. It's the boiling down thesignatures into
something
simple and being able to read the playing audio lookingfor the
match
(or
near match). I thought about using bitmap data and trying to
match up
waveforms, etc. but I don't know enough about it to pullthat
off. It
seems
like a hack in a way, but if it worked, who cares I suppose.

On Thu, Jun 3, 2010 at 3:31 PM, Juan Pablo Califano <
califa010.flashcod...@gmail.com> wrote:
I'm not Henrik, but I've done some lip-synch stuff for
Disney. We
did
it pretty much the way Eric described--we just usedamplitude.
It's
not as accurate as Disney would demand on a film, butit's ok in
the
kids' game market.
I see, amplitudes could be just good enough for somestuff.
Although the "speed" and the intensitiy of the speechcould give
misleading
results, I think. I'm under the impression that you should
somehow try
to
compare the shape of the waves (somehow simplifiy yourinput to
some
value
of sets of values that are easier to compare, possibly in a
"time
window")
and compare it in some meaningful way to precalculatedsamples
to find
a
matching pattern. That's the part I have no clue about!

Cheers
Juan Pablo Califano

2010/6/3 Kerry Thompson <al...@cyberiantiger.biz>

Juan Pablo Califano wrote:
Wow. That was really uncalled for.
That was my reaction, too. I didn't see Eric as
complaining--just
asking. Maybe Henrik was just having a bad day.
For me, the hard part, which you seem to imply israther simple
here,
is
*matching+ the input audio against said profiles.Admitedly, I
don't
know
anything about digital signal processing and audioprogramming
in
general,
but "matching" sounds a bit vague. Perhaps you couldenlighten
us, I
you
feel like.
I'm not Henrik, but I've done some lip-synch stuff forDisney.
We did
it pretty much the way Eric described--we just usedamplitude.
It's
not as accurate as Disney would demand on a film, butit's ok
in the
kids' game market.
Doing something more accurate would probably involve atleast 6
mouth
positions, and if you're doing it in real time, you'dhave to
do a
reverse FFT. It can be done--there was a really goodcommerciallip-synch program that generated Action Script tocontrol mouthpositions. I don't know if it's still around--that was5 years
ago,
and it was pretty expensive (about $2,500 for one seat, I
think). It
may even have been a Director Xtra that worked with aFlash
Sprite,
but let's not talk about Director :-P

Cordially,

Kerry Thompson
_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders

_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
--
http://ericd.net
Interactive design and development
_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
Karl DeSaulniers
Design Drumm
http://designdrumm.com

_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
Karl DeSaulniers
Design Drumm
http://designdrumm.com

_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
--
http://ericd.net
Interactive design and development
_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
Karl DeSaulniers
Design Drumm
http://designdrumm.com

_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
--
http://ericd.net
Interactive design and development
_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders
--
http://ericd.net
Interactive design and development
--
http://ericd.net
Interactive design and development
--
http://ericd.net
Interactive design and development
_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders


Karl DeSaulniers
Design Drumm
http://designdrumm.com

_______________________________________________
Flashcoders mailing list
Flashcoders@chattyfig.figleaf.com
http://chattyfig.figleaf.com/mailman/listinfo/flashcoders

Re: [Flashcoders] Question about approximate vowel detection in AS3

Reply via email to