[ https://issues.apache.org/jira/browse/TIKA-1113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13931680#comment-13931680 ]
Fabian Lange commented on TIKA-1113: ------------------------------------ This is still a bug in 1.5. We observed it by parsing the BigBuckBunny ogg video during tests. The TypeDetector detects it as audio/ogg instead of video/ogg followed by the above class cast exception during parsing. Can we expedite the fix somehow? I can contribute testing. > Parsing for OGV file results in java.lang.ClassCastException > ------------------------------------------------------------ > > Key: TIKA-1113 > URL: https://issues.apache.org/jira/browse/TIKA-1113 > Project: Tika > Issue Type: Bug > Components: metadata, parser > Affects Versions: 1.3 > Environment: OS X 10.8.3 > JDK 1.6.0_45 64-bit > Reporter: Alexander Chow > > When parsing any OGV file (e.g., > [gizmo.ogv|http://www.808.dk/pics/video/gizmo.ogv]), log will output > something like the following: > {code} > Warning - invalid checksum on page 2 of stream 3f1 (1009) > Warning - invalid checksum on page 3 of stream 3f1 (1009) > Warning - invalid checksum on page 4 of stream 3f1 (1009) > Warning - invalid checksum on page 5 of stream 3f1 (1009) > Warning - invalid checksum on page 6 of stream 3f1 (1009) > Warning - invalid checksum on page 7 of stream 3f1 (1009) > Warning - invalid checksum on page 22 of stream 3f1 (1009) > Warning - invalid checksum on page 33 of stream 3f1 (1009) > Warning - invalid checksum on page 34 of stream 3f1 (1009) > Warning - invalid checksum on page 35 of stream 3f1 (1009) > Warning - invalid checksum on page 36 of stream 3f1 (1009) > Warning - invalid checksum on page 37 of stream 3f1 (1009) > Warning - invalid checksum on page 38 of stream 3f1 (1009) > Warning - invalid checksum on page 52 of stream 3f1 (1009) > Warning - invalid checksum on page 65 of stream 3f1 (1009) > Warning - invalid checksum on page 69 of stream 3f1 (1009) > Warning - invalid checksum on page 75 of stream 3f1 (1009) > Warning - invalid checksum on page 76 of stream 3f1 (1009) > Warning - invalid checksum on page 77 of stream 3f1 (1009) > Warning - invalid checksum on page 78 of stream 3f1 (1009) > Warning - invalid checksum on page 79 of stream 3f1 (1009) > Warning - invalid checksum on page 80 of stream 3f1 (1009) > Exception in thread "main" org.apache.tika.exception.TikaException: > Unexpected RuntimeException from org.gagravarr.tika.VorbisParser@7c29e357 > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242) > at > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:120) > at com.test.OGVTest.main(OGVTest.java:31) > Caused by: java.lang.ClassCastException: org.gagravarr.vorbis.VorbisAudioData > cannot be cast to org.gagravarr.vorbis.VorbisInfo > at org.gagravarr.vorbis.VorbisFile.<init>(VorbisFile.java:78) > at org.gagravarr.vorbis.VorbisFile.<init>(VorbisFile.java:55) > at org.gagravarr.tika.VorbisParser.parse(VorbisParser.java:58) > at > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242) > ... 3 more > {code} > Testcode is the following: > {code:title=OGVTest.java} > void parse(String fileName) throws Exception { > InputStream inputStream = new FileInputStream(fileName); > > Metadata metadata = new Metadata(); > > Parser parser = new AutoDetectParser(); > > ParseContext parserContext = new ParseContext(); > parserContext.set(Parser.class, parser); > ContentHandler contentHandler = new WriteOutContentHandler( > new DummyWriter()); > parser.parse(inputStream, contentHandler, metadata, > parserContext); > > System.out.println(metadata); > } > {code} -- This message was sent by Atlassian JIRA (v6.2#6252)