Hi Murray,

Thanks for taking a look! I was able to make an additional test, setting
jspwiki's lucene version to 7.5 (the same brought by default by tika) and
had the same outcome.

I did look at other tests retrieving attachments and PageRenamerTest grabs
them the same way. I still have to take a better look at it, though..

best regards,
juan pablo


El jue., 9 may. 2019 13:22, Murray Altheim <murra...@altheim.com> escribió:

> Hi Juan Pablo,
>
> I can duplicate the error on Ubuntu 18.10, Java version:
>
>      java version "1.8.0_211"
>      Java(TM) SE Runtime Environment (build 1.8.0_211-b12)
>      Java HotSpot(TM) 64-Bit Server VM (build 25.211-b12, mixed mode)
>
> with:
>
>      2019-05-09 23:11:26,236 [main] DEBUG
> org.apache.wiki.search.LuceneSearchProvider  - Scheduling page
> test-tika/aaa-diagram.pdf for index update
>      2019-05-09 23:11:26,236 [main] INFO
> org.apache.wiki.providers.BasicAttachmentProvider  - Uploading attachment
> favicon.png to page test-tika
>      2019-05-09 23:11:26,236 [main] INFO
> org.apache.wiki.providers.BasicAttachmentProvider  - Saving attachment
> contents to
>
> /home/altheim/workspace/jspwiki/jspwiki-tika-searchprovider/target/test-classes/testrepository1557400283189/test-tika-att/favicon.png-dir/1.png
>      2019-05-09 23:11:26,280 [main] DEBUG
> org.apache.wiki.search.LuceneSearchProvider  - Scheduling page
> test-tika/favicon.png for index update
>      [ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0, Time
> elapsed: 3.367 s <<< FAILURE! - in
> org.apache.wiki.search.tika.TikaSearchProviderTest
>      [ERROR] testGetAttachmentContent  Time elapsed: 3.358 s  <<< ERROR!
>      java.lang.NullPointerException
>         at
> org.apache.wiki.search.tika.TikaSearchProviderTest.testGetAttachmentContent(TikaSearchProviderTest.java:58)
>
>      [INFO]
>      [INFO] Results:
>      [INFO]
>      [ERROR] Errors:
>      [ERROR]   TikaSearchProviderTest.testGetAttachmentContent:58 »
> NullPointer
>      [INFO]
>      [ERROR] Tests run: 1, Failures: 0, Errors: 1, Skipped: 0
>
> It's late here so I'll hopefully get to this again tomorrow. I still
> suspect
> instantiation somewhere inside of Lucene, where I've seen strange
> behaviours
> when multiple versions are present, including NPEs.
>
> Cheers,
>
> Murray
>
> On 9/05/19 9:46 PM, Juan Pablo Santos Rodríguez wrote:
> > Hi,
> >
> > Made some updates, as I was able to grab an Ubuntu box. The build seems
> to
> > fail, consistently, on ubuntu. Regarding the NPE, the SearchManager (tsp
> on
> > the test) is *not* null, but the attachment is. Oddly this only happens
> on
> > ubuntu.
> >
> > I'll try to debug some more, but I've limited access to this box, so
> extra
> > eyes are more than welcome.
> >
> > br,
> > juan pablo
> >
> > El mar., 7 may. 2019 0:24, Juan Pablo Santos Rodríguez <
> > juanpablo.san...@gmail.com> escribió:
> >
> >> My daily update on this:
> >>
> >> definitely, seems something is weird on the ubuntu nodes; I've been
> trying
> >> several combinations of JDK 8, maven installations, increasing the -Xmx
> >> through MAVEN_OPTS, and all failed, on ubuntu nodes. I've modified the
> >> build to run on windows nodes, and suddenly there aren't any problems.
> I've
> >> also tried on my local pc (windows based) with several JDK 8 and memory
> >> settings and haven't been able to reproduce the test failure.
> >>
> >> I'll try tomorrow on an Ubuntu box at work to see if I can reproduce the
> >> test failure. Meanwhile, has anybody been able to reproduce it?
> >> What do we do with the build process? Comment out the test, move it to
> >> Windows nodes,..?
> >>
> >>
> >> best regards,
> >> juan pablo
> >>
> >> On Sun, May 5, 2019 at 11:44 PM Juan Pablo Santos Rodríguez <
> >> juanpablo.san...@gmail.com> wrote:
> >>
> >>> Hi,
> >>>
> >>> hmm, didn't realize that Tika and JSPWiki pull different versions
> (7.5.0
> >>> and 8.0.0). However, JSPWiki "forces" it's Lucene version to
> >>> Tika via dependencyManagement section in pom, so when pulling the
> >>> dependencies only Lucene 8 appears in classpath for both
> >>> JSPWiki and Tika. We could have binary incompatibilities down the line,
> >>> but I think we are not using Tika enough for them to appear.
> >>> Also, there are occasions of tests failing where Tika isn't even loaded
> >>> in memory (https://builds.apache.org/job/jspwiki-test/18/console).
> >>> Tests which have been running for ages, and with NPEs on objects that
> are
> >>> obviously not null at that point.. Tika related code runs ok,
> >>> the error comes on an NPE afterwards.
> >>>
> >>> Weirder, the offending test fails with:
> >>>
> >>> [ERROR] testGetAttachmentContent  Time elapsed: 3.892 s  <<< ERROR!
> >>> java.lang.NullPointerException
> >>>     at
> org.apache.wiki.search.tika.TikaSearchProviderTest.testGetAttachmentContent(TikaSearchProviderTest.java:58)
> >>>
> >>> being the offending line:
> >>> String pdfIndexed = tsp.getAttachmentContent( attPdf );
> >>>
> >>> and tsp being loaded as:
> >>> TikaSearchProvider tsp = ( TikaSearchProvider
> >>> )engine.getSearchManager().getSearchEngine();
> >>>
> >>> (so, only tsp could yield the NPE)
> >>>
> >>> I modified the Jenkinsfile to perform a sed on that test so the above
> >>> line gets replaced by:
> >>> TikaSearchProvider tsp = new TikaSearchProvider();tsp.initialize(
> engine,
> >>> props );
> >>>
> >>> tsp *cannot* be null beyond that point.. But it is:
> >>> https://builds.apache.org/job/jspwiki-test/19/ shows the same NPE on
> the
> >>> same
> >>> line yielding it as before.
> >>>
> >>> Perhaps is that the build is running with a very adjusted memory
> setting
> >>> (i.e.: the default maven setting is not enough for the jspwiki
> >>> build?). Will keep digging tomorrow..
> >>>
> >>>
> >>> best regards,
> >>> juan pablo
> >>>
> >>> On Sun, May 5, 2019 at 10:50 PM Murray Altheim <murra...@altheim.com>
> >>> wrote:
> >>>
> >>>> Hi Juan Pablo,
> >>>>
> >>>> My suspicion is still related to there being two Lucene engines in the
> >>>> mix.
> >>>>
> >>>> If JSPWiki and Tika use *different* versions of Lucene there will be
> two
> >>>> Lucenes the classpath, and I've personally had all sorts of issues
> with
> >>>> Lucene instantiating when two components both use it (I seem to
> remember
> >>>> the "Version" can't be resolved, or something like that). Put it this
> >>>> way:
> >>>> I've noted that two Lucene engines in the same project might not like
> >>>> each
> >>>> other too much. I think I might have even had problems if they use the
> >>>> same version number there can be contention. So when I heard
> something in
> >>>> Tika related to Lucene was popping an NPE it set off my Lucene
> Alarm...
> >>>>
> >>>> But I've got some time this week so I'll be trying out the latest
> >>>> release,
> >>>> hopefully in the next few days.
> >>>>
> >>>> Cheers,
> >>>>
> >>>> Murray
> >>>>
> >>>> On 6/05/19 8:42 AM, Juan Pablo Santos Rodríguez wrote:
> >>>>> Hi Murray,
> >>>>>
> >>>>> thanks for looking into it :-)
> >>>>>
> >>>>> My findings so far: Lucene index related test are already in the main
> >>>>> build, so that part should be covered, and regarding the Tika-related
> >>>> ones,
> >>>>> they don't interact with Lucene, they only parse a couple of files,
> and
> >>>>> check the extracted information (which is afterwards sent to Lucene,
> >>>> but
> >>>>> that's outside the scope of the test). I was suspecting of something
> >>>> weird
> >>>>> on the node in which the build kept running and made a new Jenkins
> job
> >>>> to
> >>>>> do some tests (and avoid spamming dev@j.a.o) at
> >>>>> https://builds.apache.org/job/jspwiki-test/
> >>>>> [...]
> >>>>
> >>>>
> ...........................................................................
> >>>> Murray Altheim <murray18 at altheim dot com>                       = =
> >>>> ===
> >>>> http://www.altheim.com/murray/
>  ===
> >>>> ===
> >>>>
> = =
> >>>> ===
> >>>>       In the evening
> >>>>       The rice leaves in the garden
> >>>>       Rustle in the autumn wind
> >>>>       That blows through my reed hut.
> >>>>              -- Minamoto no Tsunenobu
> >>>>
> >>>>
> >
>
> --
>
> ...........................................................................
> Murray Altheim <murray18 at altheim dot com>                       = =  ===
> http://www.altheim.com/murray/                                     ===
> ===
>                                                                     = =
> ===
>      In the evening
>      The rice leaves in the garden
>      Rustle in the autumn wind
>      That blows through my reed hut.
>             -- Minamoto no Tsunenobu
>
>

Reply via email to