Hello Chris, The console output from Jenkins ( https://builds.apache.org/job/tika-trunk-jdk1.7/887/org.apache.tika$tika-parsers/console) shows the models were downloaded properly.
[INFO] *--- gmaven-plugin:1.0:execute (testSetup) @ tika-parsers --- *GET : http://opennlp.sourceforge.net/models-1.5/en-ner-person.bin -> tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-person.bin 20.5668906766% : 1071114 bytes of 5207953 40.5577008855% : 2112226 bytes of 5207953 60.6271600377% : 3157434 bytes of 5207953 80.5815259854% : 4196648 bytes of 5207953 Copy complete. Download Complete.. GET : http://opennlp.sourceforge.net/models-1.5/en-ner-location.bin -> tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-location.bin 20.5075941298% : 1048073 bytes of 5110658 40.7299803665% : 2081570 bytes of 5110658 60.992889761% : 3117138 bytes of 5110658 81.335945391% : 4156802 bytes of 5110658 Copy complete. Download Complete.. GET : http://opennlp.sourceforge.net/models-1.5/en-ner-organization.bin -> tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-organization.bin 19.7784402696% : 1047698 bytes of 5297172 39.4372317908% : 2089058 bytes of 5297172 59.0984208177% : 3130545 bytes of 5297172 78.8274573678% : 4175626 bytes of 5297172 96.1307278676% : 5092210 bytes of 5297172 Copy complete. Download Complete.. GET : http://opennlp.sourceforge.net/models-1.5/en-ner-date.bin -> tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/ner-date.bin 20.7462884472% : 1043602 bytes of 5030307 41.5294334918% : 2089058 bytes of 5030307 62.1686509392% : 3127274 bytes of 5030307 82.8078683866% : 4165490 bytes of 5030307 Copy complete. Download Complete.. But the tests are saying that the resources are not available: 17 Nov 2015 17:38:35 INFO OpenNLPNameFinder - TIME NER : Available for service ? false 17 Nov 2015 17:38:35 WARN OpenNLPNameFinder - Couldn't find model from org/apache/tika/parser/ner/opennlp/ner-location.bin using class loader 17 Nov 2015 17:38:35 INFO OpenNLPNameFinder - LOCATION NER : Available for service ? false 17 Nov 2015 17:38:35 WARN OpenNLPNameFinder - Couldn't find model from org/apache/tika/parser/ner/opennlp/ner-organization.bin using class loader 17 Nov 2015 17:38:35 INFO OpenNLPNameFinder - ORGANIZATION NER : Available for service ? false 17 Nov 2015 17:38:35 WARN OpenNLPNameFinder - Couldn't find model from org/apache/tika/parser/ner/opennlp/ner-person.bin using class loader 17 Nov 2015 17:38:35 INFO OpenNLPNameFinder - PERSON NER : Available for service ? false 17 Nov 2015 17:38:35 WARN OpenNLPNameFinder - Couldn't find model from org/apache/tika/parser/ner/opennlp/ner-money.bin using class loader 17 Nov 2015 17:38:35 INFO OpenNLPNameFinder - MONEY NER : Available for service ? false 17 Nov 2015 17:38:35 WARN OpenNLPNameFinder - Couldn't find model from org/apache/tika/parser/ner/opennlp/ner-percentage.bin using class loader 17 Nov 2015 17:38:35 INFO OpenNLPNameFinder - PERCENT NER : Available for service ? false 17 Nov 2015 17:38:35 WARN OpenNLPNameFinder - Couldn't find model from org/apache/tika/parser/ner/opennlp/ner-date.bin using class loader 17 Nov 2015 17:38:35 INFO OpenNLPNameFinder - DATE NER : Available for service ? false 17 Nov 2015 17:38:35 INFO NamedEntityParser - org.apache.tika.parser.ner.opennlp.OpenNLPNERecogniser is available ? false Ideally this shouldn't be happening. I guess the problem could be one of these: 1. Classpath update problem problem: May be the Maven plugin inside jenkins environment didn't update classpath with newly downloaded resources in script. => If this is the case, running maven build another time should not see this error. 2. Wrong paths for the resources: I suggest to list files in '{jenkins_home}/{jobs}/tika/workspace/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp' and verify the model files. => If models are not there then the test setup script should be fixed. However, I feel this is highly unlikely because the build has been already tested in Linux And Mac OS X. Regards, Thamme On Tue, Nov 17, 2015 at 9:56 AM, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Thamme, can you have a look here: > > https://builds.apache.org/job/tika-trunk-jdk1.7/887/org.apache.tika$tika-pa > rsers/testReport/junit/org.apache.tika.parser.ner/NamedEntityParserTest/tes > tParse/ > > > Tests seem to be failing (worked for me locally maybe b/c I had > already downloaded the models?) > > Cheers, > Chris > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > > > > > > -----Original Message----- > From: "Hudson (JIRA)" <j...@apache.org> > Date: Tuesday, November 17, 2015 at 12:48 PM > To: jpluser <chris.a.mattm...@jpl.nasa.gov> > Subject: [jira] [Commented] (TIKA-1787) Include Stanford Name Entity > Recognition in Tika > > > > > [ > > > https://issues.apache.org/jira/browse/TIKA-1787?page=com.atlassian.jira.pl > >ugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15009116#comm > >ent-15009116 ] > > > >Hudson commented on TIKA-1787: > >------------------------------ > > > >UNSTABLE: Integrated in tika-trunk-jdk1.7 #887 (See > >[https://builds.apache.org/job/tika-trunk-jdk1.7/887/]) > >Fix for TIKA-1787: Include Stanford Name Entity Recognition in Tika > >contributed by Thamme Gowda N and Yueheng He this closes #61 this closes > >#62 (mattmann: > >[http://svn.apache.org/viewvc/tika/trunk/?view=rev&rev=1714835]) > >* trunk/.gitignore > >* trunk/CHANGES.txt > >* trunk/tika-parsers/pom.xml > >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner > >* > >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/NERecogniser.j > >ava > >* > >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/NamedEntityPar > >ser.java > >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp > >* > >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/corenlp/CoreNL > >PNERecogniser.java > >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp > >* > >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNL > >PNERecogniser.java > >* > >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/opennlp/OpenNL > >PNameFinder.java > >* trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/regex > >* > >trunk/tika-parsers/src/main/java/org/apache/tika/parser/ner/regex/RegexNER > >ecogniser.java > >* trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner > >* trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex > >* > >trunk/tika-parsers/src/main/resources/org/apache/tika/parser/ner/regex/ner > >-regex.txt > >* trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner > >* > >trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/NamedEntityPar > >serTest.java > >* trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/regex > >* > >trunk/tika-parsers/src/test/java/org/apache/tika/parser/ner/regex/RegexNER > >ecogniserTest.java > >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser > >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner > >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp > >* > >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/M > >odelGetter.groovy > >* > >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/opennlp/g > >et-models.sh > >* trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex > >* > >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/regex/ner > >-regex.txt > >* > >trunk/tika-parsers/src/test/resources/org/apache/tika/parser/ner/tika-conf > >ig.xml > > > > > >> Include Stanford Name Entity Recognition in Tika > >> ------------------------------------------------ > >> > >> Key: TIKA-1787 > >> URL: https://issues.apache.org/jira/browse/TIKA-1787 > >> Project: Tika > >> Issue Type: Improvement > >> Components: mime, parser > >> Affects Versions: 1.12 > >> Environment: Java 1.8, Mac OSX 10.11 > >> Reporter: Yueheng He > >> Assignee: Chris A. Mattmann > >> Labels: features, newbie, test > >> Fix For: 1.12 > >> > >> Original Estimate: 168h > >> Remaining Estimate: 168h > >> > >> Using the Stanford Name Entity Recognition, Tika will be able to > >>extract name entities like PERSON, ORGANIZATION, LOCATION, etc from the > >>given text. The extracted name entities will be added to the metadata > > > > > > > >-- > >This message was sent by Atlassian JIRA > >(v6.3.4#6332) > > -- - ThammeGowda N