[ 
https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17292307#comment-17292307
 ] 

ASF GitHub Bot commented on TIKA-94:
------------------------------------

abehara2 commented on pull request #406:
URL: https://github.com/apache/tika/pull/406#issuecomment-787403113


   **Current issues we are working on**
   - Figure out how to instantiate and connect to AWS Transcribe and S3 
webservices
   - Video to audio conversion
   
   **What we have done so far**
   -  Implementations of speech-to-text through transcribe service and speech 
interface
   - Made speech interface non AWS dependent through auto key generation using 
UUID
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


> Speech recognition
> ------------------
>
>                 Key: TIKA-94
>                 URL: https://issues.apache.org/jira/browse/TIKA-94
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>              Labels: new-parser
>
> Like OCR for image files (TIKA-93), we could try using speech recognition to 
> extract text content (where available) from audio (and video!) files.
> The CMU Sphinx engine (http://cmusphinx.sourceforge.net/) looks promising and 
> comes with a friendly license.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to