[ 
https://issues.apache.org/jira/browse/TIKA-991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Oliver Heger updated TIKA-991:
------------------------------

    Attachment: TIKA-991.patch

A patch implementing the extraction of an MP3 file's duration. The patch uses 
two helper classes, {{MpegStream}} and {{TailStream}} to iterate over all 
frames in the audio file. The functionality for interpreting frame headers has 
been moved into the {{MpegStream}} class; this seemed to be clearer to me.

This patch also fixes the issue reported in TIKA-990.
                
> Mp3Parser cannot extract the duration of an audio file
> ------------------------------------------------------
>
>                 Key: TIKA-991
>                 URL: https://issues.apache.org/jira/browse/TIKA-991
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>    Affects Versions: 1.2
>            Reporter: Oliver Heger
>              Labels: patch
>         Attachments: TIKA-991.patch
>
>
> The duration of an MP3 file is an important information. Currently, 
> {{Mp3Parser}} is not able to extract this data from a file.
> Unfortunately, it is not easy to implement this functionality with the 
> current design of the {{Mp3Parser}} class. In order to obtain the duration, 
> all audio frames have to be read, and their durations have to be summed up. 
> Currently, only the beginning (the ID3v2 tags) and the end (lyrics and ID3v1) 
> are evaluated. Processing the whole file would have additional benefits, e.g. 
> MP3s with variable bit rates could be detected.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to