Hi,

The bug is fixed with this build: https://builds.apache.org/job/POI/47/
We confirmed with our files and all the DOC files are processed beautifully :)

Thank you.

Warm regards,
Alex Cougarman

Bahá’í World Centre
Haifa, Israel
Office: +972-4-835-8683 
Cell: +972-54-241-4742
[email protected]  

-----Original Message-----
From: Sergey Vladimirov [mailto:[email protected]] 
Sent: 13 September 2012 11:55 AM
To: POI Users List
Subject: Re: Bug 53380

Hi,

Try #47, build by Yegor:

https://builds.apache.org/job/POI/47/

Best regards,
Sergey

On Thu, Sep 13, 2012 at 9:03 AM, Alex Cougarman <[email protected]> wrote:

> Any update on the bug fix for this? There's a Build #46 on this page 
> but it says "Failed" when you roll over the red circle:
> https://builds.apache.org/job/POI/46/
>
> Thank you :)
>
> Warm regards,
> Alex Cougarman
>
> Bahá’í World Centre
> Haifa, Israel
> Office: +972-4-835-8683
> Cell: +972-54-241-4742
> [email protected]
>
>
> -----Original Message-----
> From: Alex Cougarman [mailto:[email protected]]
> Sent: 11 September 2012 11:42 AM
> To: 'POI Users List'
> Subject: RE: Bug 53380
>
> Hi Sergey,
>
> Thank you for looking into this issue. It will make a huge difference 
> for us :)
>
> Warm regards,
> Alex
>
> -----Original Message-----
> From: Sergey Vladimirov [mailto:[email protected]]
> Sent: 10 September 2012 2:13 PM
> To: POI Users List
> Subject: Re: Bug 53380
>
> Hi
>
> I will take a look into it today or tomorrow.
> Sorry for the long waiting
>
> Best regards,
> Sergey
>
> On Mon, Sep 10, 2012 at 11:19 AM, Alex Cougarman <[email protected]> wrote:
>
> > Dear Yegor,
> >
> > Thank you for your reply. If I knew enough about Java, I'd go in and 
> > fix it :) Just happy to have you guys providing such a great tool.
> > Thanks and keep up the great work.
> >
> > Warm regards,
> > Alex
> >
> > -----Original Message-----
> > From: Yegor Kozlov [mailto:[email protected]]
> > Sent: 10 September 2012 10:16 AM
> > To: POI Users List
> > Subject: Re: Bug 53380
> >
> > We have all pre-requisites for fixing this bug, just need to find a 
> > person to do it :)
> >
> > POI is a volunteer project and if this problem is important for you, 
> > please do work on it and submit a patch. Otherwise please wait.
> > Unfortuntaly we don't have a active developer working on DOC/DOCX 
> > modules, so fixing may take some time.
> >
> > Yegor
> >
> > On Mon, Sep 10, 2012 at 9:48 AM, Alex Cougarman <[email protected]>
> wrote:
> > > Hi. I'm having the same issue from this bug with hundreds of our 
> > > DOC files being fed through Solr/Tika:
> > > https://issues.apache.org/bugzilla/show_bug.cgi?id=53380
> > >
> > > I downloaded the DOC file attached to the ticket and was able to
> > generate the same error we've been getting (please see below for the 
> > exception).
> > >
> > > Anyone know of a solution/workaround? Is there a timeline for a fix?
> > > I
> > commented and voted on the ticket but not sure if it's a priority.
> Thanks.
> > >
> > > org.apache.tika.exception.TikaException
> > >     : Unexpected RuntimeException from 
> > > org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > > org.apache.solr.common.SolrException:
> > > org.apache.tika.exception.TikaException: Unexpected 
> > > RuntimeException
> > from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > >             at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > >     actingDocumentLoader.java:230)
> > >             at
> > > org.apache.solr.handler.ContentStreamHandlerBase.handleRequestBody(Co
> > >     ntentStreamHandlerBase.java:74)
> > >             at
> > > org.apache.solr.handler.RequestHandlerBase.handleRequest(RequestHandl
> > >     erBase.java:129)
> > >             at
> > > org.apache.solr.core.RequestHandlers$LazyRequestHandlerWrapper.handle
> > >     Request(RequestHandlers.java:240)
> > >             at
> org.apache.solr.core.SolrCore.execute(SolrCore.java:1656)
> > >             at
> > > org.apache.solr.servlet.SolrDispatchFilter.execute(SolrDispatchFilter
> > >     .java:454)
> > >             at
> > > org.apache.solr.servlet.SolrDispatchFilter.doFilter(SolrDispatchFilte
> > >     r.java:275)
> > >             at
> > > org.eclipse.jetty.servlet.ServletHandler$CachedChain.doFilter(Servlet
> > >     Handler.java:1337)
> > >             at
> > > org.eclipse.jetty.servlet.ServletHandler.doHandle(ServletHandler.java
> > >     :484)
> > >             at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > >     ava:119)
> > >             at
> > >
> > org.eclipse.jetty.security.SecurityHandler.handle(SecurityHandler.ja
> > va
> > :524)
> > >             at
> > > org.eclipse.jetty.server.session.SessionHandler.doHandle(SessionHandl
> > >     er.java:233)
> > >             at
> > > org.eclipse.jetty.server.handler.ContextHandler.doHandle(ContextHandl
> > >     er.java:1065)
> > >             at
> > > org.eclipse.jetty.servlet.ServletHandler.doScope(ServletHandler.java:
> > >     413)
> > >             at
> > > org.eclipse.jetty.server.session.SessionHandler.doScope(SessionHandle
> > >     r.java:192)
> > >             at
> > > org.eclipse.jetty.server.handler.ContextHandler.doScope(ContextHandle
> > >     r.java:999)
> > >             at
> > > org.eclipse.jetty.server.handler.ScopedHandler.handle(ScopedHandler.j
> > >     ava:117)
> > >             at
> > > org.eclipse.jetty.server.handler.ContextHandlerCollection.handle(Cont
> > >     extHandlerCollection.java:250)
> > >             at
> > > org.eclipse.jetty.server.handler.HandlerCollection.handle(HandlerColl
> > >     ection.java:149)
> > >            at
> > > org.eclipse.jetty.server.handler.HandlerWrapper.handle(HandlerWrapper
> > >     .java:111)
> > >             at org.eclipse.jetty.server.Server.handle(Server.java:351)
> > >             at
> > > org.eclipse.jetty.server.AbstractHttpConnection.handleRequest(Abstrac
> > >    tHttpConnection.java:454)
> > >             at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handleRequest(Blockin
> > >     gHttpConnection.java:47)
> > >             at
> > > org.eclipse.jetty.server.AbstractHttpConnection.headerComplete(Abstra
> > >     ctHttpConnection.java:890)
> > >             at
> > > org.eclipse.jetty.server.AbstractHttpConnection$RequestHandler.header
> > >     Complete(AbstractHttpConnection.java:944)
> > >             at
> > > org.eclipse.jetty.http.HttpParser.parseNext(HttpParser.java:642)
> > >             at
> > > org.eclipse.jetty.http.HttpParser.parseAvailable(HttpParser.java:2
> > > 30
> > > )
> > >
> > >             at
> > > org.eclipse.jetty.server.BlockingHttpConnection.handle(BlockingHttpCo
> > >     nnection.java:66)
> > >             at
> > > org.eclipse.jetty.server.bio.SocketConnector$ConnectorEndPoint.run(So
> > >     cketConnector.java:254)
> > >             at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool.runJob(QueuedThreadPoo
> > >     l.java:599)
> > >             at
> > > org.eclipse.jetty.util.thread.QueuedThreadPool$3.run(QueuedThreadPool
> > >     .java:534)
> > >             at java.lang.Thread.run(Unknown Source)
> > >     Caused by: org.apache.tika.exception.TikaException: Unexpected
> > RuntimeException
> > >     from org.apache.tika.parser.microsoft.OfficeParser@328c62ce
> > >             at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:244
> > >     )
> > >             at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > >     )
> > >             at
> > > org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:1
> > >     20)
> > >             at
> > > org.apache.solr.handler.extraction.ExtractingDocumentLoader.load(Extr
> > >     actingDocumentLoader.java:224)
> > >             ... 31 more
> > >     Caused by: java.lang.ArrayIndexOutOfBoundsException: 7
> > >             at
> > > org.apache.poi.util.LittleEndian.getInt(LittleEndian.java:163)
> > >             at
> > > org.apache.poi.hwpf.model.Colorref.&lt;init&gt;(Colorref.java:81)
> > >             at
> > > org.apache.poi.hwpf.model.types.SHDAbstractType.fillFields(SHDAbstrac
> > >     tType.java:56)
> > >             at
> > > org.apache.poi.hwpf.usermodel.ShadingDescriptor.&lt;init&gt;(ShadingD
> > >     escriptor.java:38)
> > >             at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.unCompressCHPOpera
> > >     tion(CharacterSprmUncompressor.java:582)
> > >             at
> > > org.apache.poi.hwpf.sprm.CharacterSprmUncompressor.uncompressCHP(Char
> > >     acterSprmUncompressor.java:65)
> > >             at
> > > org.apache.poi.hwpf.model.StyleSheet.createChp(StyleSheet.java:288)
> > >             at
> > > org.apache.poi.hwpf.model.StyleSheet.&lt;init&gt;(StyleSheet.java:121
> > >     )
> > >             at
> > > org.apache.poi.hwpf.HWPFDocument.&lt;init&gt;(HWPFDocument.java:346)
> > >             at
> > > org.apache.tika.parser.microsoft.WordExtractor.parse(WordExtractor.ja
> > >     va:77)
> > >             at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > >     :185)
> > >             at
> > > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java
> > >     :160)
> > >             at
> > > org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:242
> > >     )
> > >             ... 34 more
> > >
> > >
> > > Warm regards,
> > > Alex
> > >
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: [email protected] For 
> > additional commands, e-mail: [email protected]
> >
> >
> > --------------------------------------------------------------------
> > - To unsubscribe, e-mail: [email protected] For 
> > additional commands, e-mail: [email protected]
> >
> >
>
>
> --
> Sergey Vladimirov
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected] For additional 
> commands, e-mail: [email protected]
>



--
Sergey Vladimirov

Reply via email to