[ 
https://issues.apache.org/jira/browse/STANBOL-1007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13925749#comment-13925749
 ] 

Rupert Westenthaler commented on STANBOL-1007:
----------------------------------------------

This Issue is about implementing a Stanbol EnhancementEngine [1] that uses CMU 
Sphinx to transfrom spoken language into plain text. This means an engine that 
consumes an audio file parsed with the ContentItem [2]; extracts the text from 
the audio file and adds the extracted text as "plain/text" Blob to the same 
ContentItem.

The engine will also need to support loading of "Acoustic and Language Models" 
via the Apache Stanbol DataFileProvider [3] infrastructure.

This issue needs to be seen in context to  STANBOL-1291 that describes a 
possible  Phonetic Linking enhancement chain for Apache Stanbol.


[1] http://stanbol.staging.apache.org/docs/trunk/components/enhancer/engines/
[2] http://stanbol.staging.apache.org/docs/trunk/components/enhancer/contentitem
[3] http://stanbol.staging.apache.org/docs/trunk/utils/datafileprovider

> Speech to Text Enhancement Engine using CMU Sphinx
> --------------------------------------------------
>
>                 Key: STANBOL-1007
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1007
>             Project: Stanbol
>          Issue Type: Sub-task
>          Components: Enhancer
>            Reporter: Rupert Westenthaler
>              Labels: gsoc2014, mentoring
>
> CMUSphinx is a speaker-independent large vocabulary continuous speech 
> recognizer released under BSD style license 
> (http://cmusphinx.sourceforge.net/wiki/)
> This library could be used to add Speech to Text capabilities to Stanbol. It 
> is important that Enhancement Results keep track of the temporal position of 
> the extracted text within the processed media file.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to