Any update on the bug fix for this? There's a Build #46 on this page but it says "Failed" when you roll over the red circle: https://builds.apache.org/job/POI/46/
Thank you :) Warm regards, Alex Cougarman Bahá’í World Centre Haifa, Israel Office: +972-4-835-8683 Cell: +972-54-241-4742 [email protected] -----Original Message----- From: Alex Cougarman [mailto:[email protected]] Sent: 11 September 2012 11:42 AM To: 'POI Users List' Subject: RE: Bug 53380 Hi Sergey, Thank you for looking into this issue. It will make a huge difference for us :) Warm regards, Alex -----Original Message----- From: Sergey Vladimirov [mailto:[email protected]] Sent: 10 September 2012 2:13 PM To: POI Users List Subject: Re: Bug 53380 Hi I will take a look into it today or tomorrow. Sorry for the long waiting Best regards, Sergey On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <[email protected]> wrote: > Dear Yegor, > > Thank you for your reply. If I knew enough about Java, I'd go in and > fix it :) Just happy to have you guys providing such a great tool. > Thanks and keep up the great work. > > Warm regards, > Alex > > -----Original Message----- > From: Yegor Kozlov [mailto:[email protected]] > Sent: 10 September 2012 10:16 AM > To: POI Users List > Subject: Re: Bug 53380 > > We have all pre-requisites for fixing this bug, just need to find a > person to do it :) > > POI is a volunteer project and if this problem is important for you, > please do work on it and submit a patch. Otherwise please wait. > Unfortuntaly we don't have a active developer working on DOC/DOCX > modules, so fixing may take some time. > > Yegor > > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <[email protected]> wrote: > > Hi. I'm having the same issue from this bug with hundreds of our DOC > > files being fed through Solr/Tika: > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380 > > > > I downloaded the DOC file attached to the ticket and was able to > generate the same error we've been getting (please see below for the > exception). > > > > Anyone know of a solution/workaround? Is there a timeline for a fix? > > I > commented and voted on the ticket but not sure if it's a priority. Thanks. > > > > org.apache.tika.exception.TikaException > > : Unexpected RuntimeException from > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce > > org.apache.solr.common.SolrException: > > org.apache.tika.exception.TikaException: Unexpected RuntimeException > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce > > at > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr > > actingDocumentLoader.java:230) > > at > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co > > ntentStreamHandlerBase.java:74) > > at > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl > > erBase.java:129) > > at > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle > > Request(RequestHandlers.java:240) > > at org.apache.solr.core.SolrCore.execute(SolrCore.java:1656) > > at > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter > > .java:454) > > at > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte > > r.java:275) > > at > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet > > Handler.java:1337) > > at > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java > > :484) > > at > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j > > ava:119) > > at > > > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.java > :524) > > at > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl > > er.java:233) > > at > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl > > er.java:1065) > > at > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java: > > 413) > > at > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle > > r.java:192) > > at > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle > > r.java:999) > > at > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j > > ava:117) > > at > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont > > extHandlerCollection.java:250) > > at > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl > > ection.java:149) > > at > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper > > .java:111) > > at org.eclipse.jetty.server.Server.handle(Server.java:351) > > at > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac > > tHttpConnection.java:454) > > at > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin > > gHttpConnection.java:47) > > at > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra > > ctHttpConnection.java:890) > > at > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header > > Complete(AbstractHttpConnection.java:944) > > at > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642) > > at > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:230 > > ) > > > > at > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo > > nnection.java:66) > > at > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So > > cketConnector.java:254) > > at > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo > > l.java:599) > > at > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool > > .java:534) > > at java.lang.Thread.run(Unknown Source) > > Caused by: org.apache.tika.exception.TikaException: Unexpected > RuntimeException > > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce > > at > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244 > > ) > > at > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242 > > ) > > at > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1 > > 20) > > at > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr > > actingDocumentLoader.java:224) > > ... 31 more > > Caused by: java.lang.ArrayIndexOutOfBoundsException: 7 > > at > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163) > > at > > org.apache.poi.hwpf.model.Colorref.<init>(Colorref.java:81) > > at > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac > > tType.java:56) > > at > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.<init>(ShadingD > > escriptor.java:38) > > at > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera > > tion(CharacterSprmUncompressor.java:582) > > at > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char > > acterSprmUncompressor.java:65) > > at > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288) > > at > > org.apache.poi.hwpf.model.StyleSheet.<init>(StyleSheet.java:121 > > ) > > at > > org.apache.poi.hwpf.HWPFDocument.<init>(HWPFDocument.java:346) > > at > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja > > va:77) > > at > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java > > :185) > > at > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java > > :160) > > at > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242 > > ) > > ... 34 more > > > > > > Warm regards, > > Alex > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] For additional > commands, e-mail: [email protected] > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] For additional > commands, e-mail: [email protected] > > -- Sergey Vladimirov --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
