[jira] [Commented] (TIKA-94) Speech recognition

Tim Allison (Jira) Mon, 01 Mar 2021 07:48:07 -0800


    [ 
https://issues.apache.org/jira/browse/TIKA-94?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17292975#comment-17292975
 ]


Tim Allison commented on TIKA-94:
---------------------------------

Not to throw a monkey wrench into your work...and I'm really grateful you're 
adding this!...

If transcription is to audio as OCR is to image, then maybe consider following 
the pattern of TesseractOCRParser and the AbstractImageParser?

In {{main}}, I just added TikaCoreProperties.CONTENT_TYPE_PARSER_OVERRIDE for 
routing parsing to the OCR parser.


> Speech recognition
> ------------------
>
>                 Key: TIKA-94
>                 URL: https://issues.apache.org/jira/browse/TIKA-94
>             Project: Tika
>          Issue Type: New Feature
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Lewis John McGibbney
>            Priority: Minor
>              Labels: new-parser
>
> Like OCR for image files (TIKA-93), we could try using speech recognition to 
> extract text content (where available) from audio (and video!) files.
> The CMU Sphinx engine (http://cmusphinx.sourceforge.net/) looks promising and 
> comes with a friendly license.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (TIKA-94) Speech recognition

Reply via email to