animation - Google Speech API on Windows? -


i'm looking in using voice recognition identify sounds user makes , play animations accordingly.

the recognition needs fast, , need recognize sound @ time allow quick animation , lip syncing.

i've been having trouble getting accurate word/phoneme recognition on windows. know google provides api's mobile phone , web api google chrome, wondering if there known way utilize google's high quality , fast voice recognition windows application?

yikes. sounds difficult , while have no complete solution you, have suggestion may along way.

the fmod api firelight technologies has easy use function uses algorithm called fast fourier transform , allows read sound buffer , take action based on frequencies being captured.

you can use tie graphics , sound together. i've been toying around making objects move based on sound files playing. should not difficult use microphone input instead of pre-recorded sounds.

their software free use long not charging , super easy set up.

so far programming speech recognition goes. advice there sound editor audacity , start talking microphone , try , identify sound 'looks' studying waveform patterns being displayed.

fmod's fft module can populate array of values represent waveform pattern frame in game loop.

there simple example available displays waveform using opengl , it's called fmod frequencies example http://nccastaff.bournemouth.ac.uk/jmacey/robthebloke/www/opengl_programming.html#8

you may want @ fmod-ex package , of other tools. http://www.fmod.org/download/


Comments

Popular posts from this blog

javascript - RequestAnimationFrame not working when exiting fullscreen switching space on Safari -

Python ctypes access violation with const pointer arguments -