Hello Chris,

The console output from Jenkins (
https://builds.apache.org/job/tika-trunk-jdk1.7/887/org.apache.tika$tika-parsers/console)
shows the models were downloaded properly.

[INFO] *--- gmaven-plugin:1.0:execute (testSetup) @ tika-parsers ---
*GET : http://opennlp.sourceforge.net/models-1.5/en-ner-person.bin ->
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-person.bin
20.5668906766% : 1071114 bytes of 5207953
40.5577008855% : 2112226 bytes of 5207953
60.6271600377% : 3157434 bytes of 5207953
80.5815259854% : 4196648 bytes of 5207953
Copy complete.
Download Complete..
GET : http://opennlp.sourceforge.net/models-1.5/en-ner-location.bin ->
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-location.bin
20.5075941298% : 1048073 bytes of 5110658
40.7299803665% : 2081570 bytes of 5110658
60.992889761% : 3117138 bytes of 5110658
81.335945391% : 4156802 bytes of 5110658
Copy complete.
Download Complete..
GET : http://opennlp.sourceforge.net/models-1.5/en-ner-organization.bin
-> 
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-organization.bin
19.7784402696% : 1047698 bytes of 5297172
39.4372317908% : 2089058 bytes of 5297172
59.0984208177% : 3130545 bytes of 5297172
78.8274573678% : 4175626 bytes of 5297172
96.1307278676% : 5092210 bytes of 5297172
Copy complete.
Download Complete..
GET : http://opennlp.sourceforge.net/models-1.5/en-ner-date.bin ->
tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-date.bin
20.7462884472% : 1043602 bytes of 5030307
41.5294334918% : 2089058 bytes of 5030307
62.1686509392% : 3127274 bytes of 5030307
82.8078683866% : 4165490 bytes of 5030307
Copy complete.
Download Complete..


But the tests are saying that the resources are not available:

17 Nov 2015 17:38:35  INFO OpenNLPNameFinder - TIME NER : Available
for service ? false
17 Nov 2015 17:38:35  WARN OpenNLPNameFinder - Couldn't find model
from org/apache/tika/parser/ner/opennlp/ner-location.bin using class
loader
17 Nov 2015 17:38:35  INFO OpenNLPNameFinder - LOCATION NER :
Available for service ? false
17 Nov 2015 17:38:35  WARN OpenNLPNameFinder - Couldn't find model
from org/apache/tika/parser/ner/opennlp/ner-organization.bin using
class loader
17 Nov 2015 17:38:35  INFO OpenNLPNameFinder - ORGANIZATION NER :
Available for service ? false
17 Nov 2015 17:38:35  WARN OpenNLPNameFinder - Couldn't find model
from org/apache/tika/parser/ner/opennlp/ner-person.bin using class
loader
17 Nov 2015 17:38:35  INFO OpenNLPNameFinder - PERSON NER : Available
for service ? false
17 Nov 2015 17:38:35  WARN OpenNLPNameFinder - Couldn't find model
from org/apache/tika/parser/ner/opennlp/ner-money.bin using class
loader
17 Nov 2015 17:38:35  INFO OpenNLPNameFinder - MONEY NER : Available
for service ? false
17 Nov 2015 17:38:35  WARN OpenNLPNameFinder - Couldn't find model
from org/apache/tika/parser/ner/opennlp/ner-percentage.bin using class
loader
17 Nov 2015 17:38:35  INFO OpenNLPNameFinder - PERCENT NER : Available
for service ? false
17 Nov 2015 17:38:35  WARN OpenNLPNameFinder - Couldn't find model
from org/apache/tika/parser/ner/opennlp/ner-date.bin using class
loader
17 Nov 2015 17:38:35  INFO OpenNLPNameFinder - DATE NER : Available
for service ? false
17 Nov 2015 17:38:35  INFO NamedEntityParser -
org.apache.tika.parser.ner.opennlp.OpenNLPNERecogniser is available ?
false


Ideally this shouldn't be happening. I guess the problem could be one of
these:

1. Classpath update problem problem: May be the Maven plugin inside jenkins
environment didn't update classpath with newly downloaded resources in
script. => If this is the case, running maven build another time should not
see this error.

2. Wrong paths for the resources: I suggest to list files in
 
'{jenkins_home}/{jobs}/tika/workspace/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp'
and verify the model files. => If models are not there then the test setup
script should be fixed.
However, I feel this is highly unlikely because the build has been already
tested in Linux And Mac OS X.


Regards,
Thamme

On Tue, Nov 17, 2015 at 9:56 AM, Mattmann, Chris A (3980) <
chris.a.mattm...@jpl.nasa.gov> wrote:

> Thamme, can you have a look here:
>
> https://builds.apache.org/job/tika-trunk-jdk1.7/887/org.apache.tika$tika-pa
> rsers/testReport/junit/org.apache.tika.parser.ner/NamedEntityParserTest/tes
> tParse/
>
>
> Tests seem to be failing (worked for me locally maybe b/c I had
> already downloaded the models?)
>
> Cheers,
> Chris
>
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Chris Mattmann, Ph.D.
> Chief Architect
> Instrument Software and Science Data Systems Section (398)
> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
> Office: 168-519, Mailstop: 168-527
> Email: chris.a.mattm...@nasa.gov
> WWW:  http://sunset.usc.edu/~mattmann/
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
> Adjunct Associate Professor, Computer Science Department
> University of Southern California, Los Angeles, CA 90089 USA
> ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
>
>
>
>
>
> -----Original Message-----
> From: "Hudson (JIRA)" <j...@apache.org>
> Date: Tuesday, November 17, 2015 at 12:48 PM
> To: jpluser <chris.a.mattm...@jpl.nasa.gov>
> Subject: [jira] [Commented] (TIKA-1787) Include Stanford Name Entity
> Recognition in Tika
>
> >
> >    [
> >
> https://issues.apache.org/jira/browse/TIKA-1787?page=com.atlassian.jira.pl
> >ugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15009116#comm
> >ent-15009116 ]
> >
> >Hudson commented on TIKA-1787:
> >------------------------------
> >
> >UNSTABLE: Integrated in tika-trunk-jdk1.7 #887 (See
> >[https://builds.apache.org/job/tika-trunk-jdk1.7/887/])
> >Fix for TIKA-1787: Include Stanford Name Entity Recognition in Tika
> >contributed by Thamme Gowda N and Yueheng He this closes #61 this closes
> >#62 (mattmann:
> >[http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1714835])
> >* trunk/.gitignore
> >* trunk/CHANGES.txt
> >* trunk/tika-parsers/pom.xml
> >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner
> >*
> >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/NERecogniser.j
> >ava
> >*
> >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/NamedEntityPar
> >ser.java
> >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp
> >*
> >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp/CoreNL
> >PNERecogniser.java
> >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp
> >*
> >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNL
> >PNERecogniser.java
> >*
> >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNL
> >PNameFinder.java
> >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/regex
> >*
> >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/regex/RegexNER
> >ecogniser.java
> >* trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner
> >* trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex
> >*
> >trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex/ner
> >-regex.txt
> >* trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner
> >*
> >trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/NamedEntityPar
> >serTest.java
> >* trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/regex
> >*
> >trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/regex/RegexNER
> >ecogniserTest.java
> >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser
> >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner
> >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp
> >*
> >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/M
> >odelGetter.groovy
> >*
> >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/g
> >et-models.sh
> >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex
> >*
> >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex/ner
> >-regex.txt
> >*
> >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/tika-conf
> >ig.xml
> >
> >
> >> Include Stanford Name Entity Recognition in Tika
> >> ------------------------------------------------
> >>
> >>                 Key: TIKA-1787
> >>                 URL: https://issues.apache.org/jira/browse/TIKA-1787
> >>             Project: Tika
> >>          Issue Type: Improvement
> >>          Components: mime, parser
> >>    Affects Versions: 1.12
> >>         Environment: Java 1.8, Mac OSX 10.11
> >>            Reporter: Yueheng He
> >>            Assignee: Chris A. Mattmann
> >>              Labels: features, newbie, test
> >>             Fix For: 1.12
> >>
> >>   Original Estimate: 168h
> >>  Remaining Estimate: 168h
> >>
> >> Using the Stanford Name Entity Recognition, Tika will be able to
> >>extract name entities like PERSON, ORGANIZATION, LOCATION, etc from the
> >>given text. The extracted name entities will be added to the metadata
> >
> >
> >
> >--
> >This message was sent by Atlassian JIRA
> >(v6.3.4#6332)
>
>


-- 
-
ThammeGowda N

Reply via email to