"Many Voices, One Mozilla" 

Hi All, 

Here is the summary of high level draft plans we are beginning with for 
enabling Voice Input in Open Web / Firefox OS. One of our Firefox OS 
contributors Andre Natal (Brazil community) has done lots of preparatory work 
and is currently continuing on a GSOC (Google Summer of Code) project around 
this. The proposed 2 phases of the plan are in email below. 

Please note the releases and estimates marked below are *all tentative* (will 
change) and will be refined over next several months. We will continue adding 
updates here: 

Wiki 
https://wiki.mozilla.org/SpeechRTC_-_Speech_enabling_the_open_web 
Bugzilla 
Bug 1032964 - [B2G][SpeechRTC][User Story]: Enabling Voice input in Firefox OS 

Trello board to track status: 
https://trello.com/b/UWXblmKb/webspeech-api 

Github: 
https://github.com/andrenatal/gecko-dev 

There is lots to do here and we are just starting, so if you are interested, 
pls watch this wiki and help with the dependent bug# being added to the 
meta-bug above. This kind of project could use great community participation 
with contributing code, collecting / testing voice samples with various accents 
(Remember "many voices, One Mozilla") to improve the acoustic / language 
models, creating fun gamifications to achieve that and yes, we would need tons 
& tons of testing! 

Regards, 
Sandip 
---- 
Sandip Kamat | Product Management, Firefox OS | Mozilla Corporation 
ska...@mozilla.com, @sankam 









<blockquote>

Subject: Re: Planning SpeechRTC integration for FxOS 

Thanks for joining the kickoff meeting. Here is the current proposal and next 
steps. We will have a regular (frequency tbd) cadence of meetings on this. 

Propose to approach in 2 phases 

    * Phase 1 – Grammar based Voice Command app 


        * Similar to this Nuance app: 
https://www.youtube.com/watch?v=p0majfIEIR8&noredirect=1 
        * Needs Web Speech API implementation 
        * Current App level solutions present in marketplace (Andre N) 


            * Voicity (uses pocketSphinx AM, custom LM, Verbio decoder) ( 
https://www.youtube.com/watch?v=cjjFvyH3kdc ) 
            * Offline recognition on Peak ( 
https://www.youtube.com/watch?v=FXKXhrRDEb8 ) 


    * 
Phase 2 – Free Speech (closer to Apple Siri experience) 

        * “Vaani” App - Virtual Assistant with free speech 
        * Web Speech API implementation enhancements 
        * Need to benchmark decoder options & select solution, several 
decisions pending 
        * Current App Crab (uses decoder from Nuance Ndev mobile – dev edition) 
( https://www.youtube.com/watch?v=pnCRH-Iznrc ) 
Next Steps 


    * Andre will finish most of phase 1 as a part of Google summer of code 
project. Olli Pettay helping him. Needs support from CJ Ku's team to finish 
gecko porting. 
    * Create Test scripts to test SpeechAPI (Andre, Clint). If that works well, 
we could declare support for SpeechAPI in version 2.x. The other implementation 
in phase 1 can be targeted for later version 2.x. 
    * Finalize AM/LM/decoder strategies and rocketbar integration for phase 2. 
Create release plan for phase 2 after some assessment of efforts. 
    * Strategy on Back end server (Build vs license vs hybrid). Work with 
services/Mark Mayo's team. (had a Call with Bill Maggs) 




</blockquote>



_______________________________________________
dev-webapps mailing list
dev-webapps@lists.mozilla.org
https://lists.mozilla.org/listinfo/dev-webapps

Reply via email to