[ https://issues.apache.org/jira/browse/TIKA-969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Richard Eccles updated TIKA-969: -------------------------------- Attachment: withoutISOSpeedStuffProperties.jpg withISOSpeedProperties.jpg withoutISOspeed.jpg withISOSpeed.jpg When parsing the above files, the file "withoutISOSpeed.jpg" will cause the below exception. If you go to the Properties of the file, and enter a 'ISO Speed' and then attempt to parse the file the exception is not thrown. <meta http-equiv="Content-Type" content="text/html; charset=ISO-8859-1"/> <title>Error 500 org.apache.tika.exception.TikaException: Can't read JPEG metadata org.apache.solr.common.SolrException: org.apache.tika.exception.TikaException: Can't read JPEG metadata at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:220) at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1358) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:341) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:244) at org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399) at org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766) at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450) at org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230) at org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) at org.mortbay.jetty.Server.handle(Server.java:326) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:945) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) Caused by: org.apache.tika.exception.TikaException: Can't read JPEG metadata at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:94) at org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:66) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:138) at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:199) ... 22 more Caused by: com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int. It is of type 'class [I'. at com.drew.metadata.Directory.getInt(Unknown Source) at com.drew.metadata.exif.ExifDescriptor.getIsoEquivalentDescription(Unknown Source) at com.drew.metadata.exif.ExifDescriptor.getDescription(Unknown Source) at com.drew.metadata.Directory.getDescription(Unknown Source) at com.drew.metadata.Tag.getDescription(Unknown Source) at org.apache.tika.parser.image.ImageMetadataExtractor$CopyUnknownFieldsHandler.handle(ImageMetadataExtractor.java:191) at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:133) at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:120) at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:90) ... 27 more </title> </head> <body><h2>HTTP ERROR 500</h2> <p>Problem accessing /tridion/update/extract. Reason: <pre> org.apache.tika.exception.TikaException: Can't read JPEG metadata org.apache.solr.common.SolrException: org.apache.tika.exception.TikaException: Can't read JPEG metadata at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:220) at org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(ContentStreamHandlerBase.java:54) at org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandlerBase.java:131) at org.apache.solr.core.SolrCore.execute(SolrCore.java:1358) at org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter.java:341) at org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilter.java:244) at org.mortbay.jetty.servlet.ServletHandler$CachedChain.doFilter(ServletHandler.java:1212) at org.mortbay.jetty.servlet.ServletHandler.handle(ServletHandler.java:399) at org.mortbay.jetty.security.SecurityHandler.handle(SecurityHandler.java:216) at org.mortbay.jetty.servlet.SessionHandler.handle(SessionHandler.java:182) at org.mortbay.jetty.handler.ContextHandler.handle(ContextHandler.java:766) at org.mortbay.jetty.webapp.WebAppContext.handle(WebAppContext.java:450) at org.mortbay.jetty.handler.ContextHandlerCollection.handle(ContextHandlerCollection.java:230) at org.mortbay.jetty.handler.HandlerCollection.handle(HandlerCollection.java:114) at org.mortbay.jetty.handler.HandlerWrapper.handle(HandlerWrapper.java:152) at org.mortbay.jetty.Server.handle(Server.java:326) at org.mortbay.jetty.HttpConnection.handleRequest(HttpConnection.java:542) at org.mortbay.jetty.HttpConnection$RequestHandler.content(HttpConnection.java:945) at org.mortbay.jetty.HttpParser.parseNext(HttpParser.java:756) at org.mortbay.jetty.HttpParser.parseAvailable(HttpParser.java:212) at org.mortbay.jetty.HttpConnection.handle(HttpConnection.java:404) at org.mortbay.io.nio.SelectChannelEndPoint.run(SelectChannelEndPoint.java:410) at org.mortbay.thread.QueuedThreadPool$PoolThread.run(QueuedThreadPool.java:582) Caused by: org.apache.tika.exception.TikaException: Can't read JPEG metadata at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:94) at org.apache.tika.parser.jpeg.JpegParser.parse(JpegParser.java:66) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) at org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:197) at org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:138) at org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(ExtractingDocumentLoader.java:199) ... 22 more Caused by: com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int. It is of type 'class [I'. at com.drew.metadata.Directory.getInt(Unknown Source) at com.drew.metadata.exif.ExifDescriptor.getIsoEquivalentDescription(Unknown Source) at com.drew.metadata.exif.ExifDescriptor.getDescription(Unknown Source) at com.drew.metadata.Directory.getDescription(Unknown Source) at com.drew.metadata.Tag.getDescription(Unknown Source) at org.apache.tika.parser.image.ImageMetadataExtractor$CopyUnknownFieldsHandler.handle(ImageMetadataExtractor.java:191) at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:133) at org.apache.tika.parser.image.ImageMetadataExtractor.handle(ImageMetadataExtractor.java:120) at org.apache.tika.parser.image.ImageMetadataExtractor.parseJpeg(ImageMetadataExtractor.java:90) ... 27 more </pre></p><hr /><i><small>Powered by Jetty://</small></i><br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> <br/> </body> </html> > Exception "org.apache.tika.exception.TikaException: Can't read JPEG metada" / > "com.drew.metadata.MetadataException: Tag '34855' cannot be cast to int. It > is of type 'class [I" when indexing some items > --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- > > Key: TIKA-969 > URL: https://issues.apache.org/jira/browse/TIKA-969 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 0.9 > Reporter: Richard Eccles > Attachments: withISOSpeed.jpg, withISOSpeedProperties.jpg, > withoutISOSpeedStuffProperties.jpg, withoutISOspeed.jpg > > -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira