See <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/1214/changes>
Changes: [mattmann] - fix for NUTCH-564 External parser supports encoding attribute [mattmann] - update based on README revs from branch-1.3 to bring up to date [mattmann] - fix for NUTCH-873 Ivy configuration settings don't include Gora [mattmann] - fix for NUTCH-873 Ivy configuration settings don't include Gora [mattmann] Welcome Nutchbase, all hail our new robotic master. [mattmann] fix for NUTCH-870 Injector should add the metadata before calling injectedScore. [ab] Merge from trunk rev. 982625 - more detailed stats for benchmark. [ab] NUTCH-867 Port Nutch benchmark to Nutchbase. [ab] Exclude JMX dependencies (not needed). [jnioche] NUTCH-869 Add back parse-html [mattmann] fix for NUTCH-871 MoreIndexingFilter missing date format [jnioche] NUTCH-859 : Diff trunk and nutchbase [jnioche] NUTCH-696 Timeout for Parser [jnioche] NUTCH-868 : ParseUtil.parse() returns an empty Parse instead of null [jnioche] parse-tika : removed reference to IdentityHtmlMapper which is not part of the latest release of Tika + renamed TestDOMContentUtils to prevent it from being used during the tests. TestDOMContentUtils will be added back when Tika provides the right type of information for html documents [jnioche] Fetcher displays protocol status names in counters instead of their code [jnioche] DbUpdaterJob returns success status [jnioche] Fix WebTableReader - crashed if table was empty [jnioche] FetcherJob returns -1 if job fails [jnioche] NUTCH-840 : moved tests to parse/tika + added TestDOMContentUtil which currently fail but will help us track the progress on the Tika processing of HTML [jnioche] added mapping for sql backend ------------------------------------------ [...truncated 1018 lines...] A src/plugin/index-more/src/java/org/apache/nutch/indexer A src/plugin/index-more/src/java/org/apache/nutch/indexer/more A src/plugin/index-more/src/java/org/apache/nutch/indexer/more/MoreIndexingFilter.java A src/plugin/index-more/src/java/org/apache/nutch/indexer/more/package.html A src/plugin/index-more/plugin.xml A src/plugin/index-more/build.xml AU src/plugin/plugin.dtd A src/plugin/parse-ext A src/plugin/parse-ext/ivy.xml A src/plugin/parse-ext/src A src/plugin/parse-ext/src/test A src/plugin/parse-ext/src/test/org A src/plugin/parse-ext/src/test/org/apache A src/plugin/parse-ext/src/test/org/apache/nutch A src/plugin/parse-ext/src/test/org/apache/nutch/parse A src/plugin/parse-ext/src/test/org/apache/nutch/parse/ext A src/plugin/parse-ext/src/test/org/apache/nutch/parse/ext/TestExtParser.java A src/plugin/parse-ext/src/java A src/plugin/parse-ext/src/java/org A src/plugin/parse-ext/src/java/org/apache A src/plugin/parse-ext/src/java/org/apache/nutch A src/plugin/parse-ext/src/java/org/apache/nutch/parse A src/plugin/parse-ext/src/java/org/apache/nutch/parse/ext A src/plugin/parse-ext/src/java/org/apache/nutch/parse/ext/ExtParser.java A src/plugin/parse-ext/plugin.xml A src/plugin/parse-ext/build.xml A src/plugin/parse-ext/command A src/plugin/urlnormalizer-pass A src/plugin/urlnormalizer-pass/ivy.xml A src/plugin/urlnormalizer-pass/src A src/plugin/urlnormalizer-pass/src/test A src/plugin/urlnormalizer-pass/src/test/org A src/plugin/urlnormalizer-pass/src/test/org/apache A src/plugin/urlnormalizer-pass/src/test/org/apache/nutch A src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net A src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net/urlnormalizer A src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net/urlnormalizer/pass AU src/plugin/urlnormalizer-pass/src/test/org/apache/nutch/net/urlnormalizer/pass/TestPassURLNormalizer.java A src/plugin/urlnormalizer-pass/src/java A src/plugin/urlnormalizer-pass/src/java/org A src/plugin/urlnormalizer-pass/src/java/org/apache A src/plugin/urlnormalizer-pass/src/java/org/apache/nutch A src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net A src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net/urlnormalizer A src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net/urlnormalizer/pass AU src/plugin/urlnormalizer-pass/src/java/org/apache/nutch/net/urlnormalizer/pass/PassURLNormalizer.java AU src/plugin/urlnormalizer-pass/plugin.xml AU src/plugin/urlnormalizer-pass/build.xml A src/plugin/parse-html A src/plugin/parse-html/ivy.xml A src/plugin/parse-html/lib A src/plugin/parse-html/lib/tagsoup.LICENSE.txt A src/plugin/parse-html/src A src/plugin/parse-html/src/test A src/plugin/parse-html/src/test/org A src/plugin/parse-html/src/test/org/apache A src/plugin/parse-html/src/test/org/apache/nutch A src/plugin/parse-html/src/test/org/apache/nutch/parse A src/plugin/parse-html/src/test/org/apache/nutch/parse/html A src/plugin/parse-html/src/test/org/apache/nutch/parse/html/TestRobotsMetaProcessor.java A src/plugin/parse-html/src/test/org/apache/nutch/parse/html/TestDOMContentUtils.java A src/plugin/parse-html/src/java A src/plugin/parse-html/src/java/org A src/plugin/parse-html/src/java/org/apache A src/plugin/parse-html/src/java/org/apache/nutch A src/plugin/parse-html/src/java/org/apache/nutch/parse A src/plugin/parse-html/src/java/org/apache/nutch/parse/html A src/plugin/parse-html/src/java/org/apache/nutch/parse/html/HtmlParser.java A src/plugin/parse-html/src/java/org/apache/nutch/parse/html/XMLCharacterRecognizer.java A src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMBuilder.java A src/plugin/parse-html/src/java/org/apache/nutch/parse/html/DOMContentUtils.java A src/plugin/parse-html/src/java/org/apache/nutch/parse/html/HTMLMetaProcessor.java A src/plugin/parse-html/src/java/org/apache/nutch/parse/html/package.html AU src/plugin/parse-html/plugin.xml AU src/plugin/parse-html/build.xml A src/plugin/urlfilter-domain A src/plugin/urlfilter-domain/ivy.xml A src/plugin/urlfilter-domain/src A src/plugin/urlfilter-domain/src/test A src/plugin/urlfilter-domain/src/test/org A src/plugin/urlfilter-domain/src/test/org/apache A src/plugin/urlfilter-domain/src/test/org/apache/nutch A src/plugin/urlfilter-domain/src/test/org/apache/nutch/urlfilter A src/plugin/urlfilter-domain/src/test/org/apache/nutch/urlfilter/domain A src/plugin/urlfilter-domain/src/test/org/apache/nutch/urlfilter/domain/TestDomainURLFilter.java A src/plugin/urlfilter-domain/src/java A src/plugin/urlfilter-domain/src/java/org A src/plugin/urlfilter-domain/src/java/org/apache A src/plugin/urlfilter-domain/src/java/org/apache/nutch A src/plugin/urlfilter-domain/src/java/org/apache/nutch/urlfilter A src/plugin/urlfilter-domain/src/java/org/apache/nutch/urlfilter/domain A src/plugin/urlfilter-domain/src/java/org/apache/nutch/urlfilter/domain/DomainURLFilter.java A src/plugin/urlfilter-domain/src/java/org/apache/nutch/urlfilter/domain/package.html A src/plugin/urlfilter-domain/data A src/plugin/urlfilter-domain/data/hosts.txt A src/plugin/urlfilter-domain/plugin.xml A src/plugin/urlfilter-domain/build.xml A src/plugin/protocol-httpclient A src/plugin/protocol-httpclient/ivy.xml A src/plugin/protocol-httpclient/src A src/plugin/protocol-httpclient/src/test A src/plugin/protocol-httpclient/src/test/conf A src/plugin/protocol-httpclient/src/test/conf/nutch-site-test.xml A src/plugin/protocol-httpclient/src/test/conf/httpclient-auth-test.xml A src/plugin/protocol-httpclient/src/test/org A src/plugin/protocol-httpclient/src/test/org/apache A src/plugin/protocol-httpclient/src/test/org/apache/nutch A src/plugin/protocol-httpclient/src/test/org/apache/nutch/protocol A src/plugin/protocol-httpclient/src/test/org/apache/nutch/protocol/httpclient A src/plugin/protocol-httpclient/src/test/org/apache/nutch/protocol/httpclient/TestProtocolHttpClient.java A src/plugin/protocol-httpclient/src/java A src/plugin/protocol-httpclient/src/java/org A src/plugin/protocol-httpclient/src/java/org/apache A src/plugin/protocol-httpclient/src/java/org/apache/nutch A src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol A src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/Http.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpAuthentication.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/DummySSLProtocolSocketFactory.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpBasicAuthentication.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpAuthenticationFactory.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/DummyX509TrustManager.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpAuthenticationException.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/HttpResponse.java AU src/plugin/protocol-httpclient/src/java/org/apache/nutch/protocol/httpclient/package.html A src/plugin/protocol-httpclient/jsp A src/plugin/protocol-httpclient/jsp/ntlm.jsp A src/plugin/protocol-httpclient/jsp/cookies.jsp A src/plugin/protocol-httpclient/jsp/noauth.jsp A src/plugin/protocol-httpclient/jsp/digest.jsp A src/plugin/protocol-httpclient/jsp/basic.jsp AU src/plugin/protocol-httpclient/plugin.xml AU src/plugin/protocol-httpclient/build.xml A src/plugin/protocol-http A src/plugin/protocol-http/ivy.xml A src/plugin/protocol-http/src A src/plugin/protocol-http/src/java A src/plugin/protocol-http/src/java/org A src/plugin/protocol-http/src/java/org/apache A src/plugin/protocol-http/src/java/org/apache/nutch A src/plugin/protocol-http/src/java/org/apache/nutch/protocol A src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http AU src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/Http.java A src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/HttpResponse.java A src/plugin/protocol-http/src/java/org/apache/nutch/protocol/http/package.html AU src/plugin/protocol-http/plugin.xml AU src/plugin/protocol-http/build.xml A KEYS A README.txt A build.xml U . At revision 983614 [Nutch-trunk] $ /bin/bash -xe /var/tmp/hudson1929746833399703321.sh + PATH=/home/hudson/tools/java/latest1.6/bin:/usr/bin:/usr/ucb:/usr/local/bin:/usr/bin:/usr/sfw/bin:/usr/sfw/sbin:/opt/sfw/bin:/opt/sfw/sbin:/opt/SUNWspro/bin:/usr/X/bin:/usr/ucb:/usr/sbin:/usr/ccs/bin + export ANT_HOME=/export/home/hudson/tools/ant/latest + ANT_HOME=/export/home/hudson/tools/ant/latest + export PATH ANT_HOME + cd trunk + /export/home/hudson/tools/ant/latest/bin/ant -Dversion=2010-08-09_12-40-32 -Dtest.junit.output.format=xml nightly Buildfile: build.xml ivy-probe-antlib: ivy-download: -ivy-download-unchecked: ivy-init-antlib: ivy-init: init: [mkdir] Created dir: <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build> [mkdir] Created dir: <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/classes> [mkdir] Created dir: <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/test> [mkdir] Created dir: <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build/test/classes> [copy] Copying 9 files to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/suffix-urlfilter.txt.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/suffix-urlfilter.txt> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/httpclient-auth.xml.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/httpclient-auth.xml> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/hbase-site.xml.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/hbase-site.xml> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/prefix-urlfilter.txt.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/prefix-urlfilter.txt> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/regex-normalize.xml.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/regex-normalize.xml> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/automaton-urlfilter.txt.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/automaton-urlfilter.txt> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/subcollections.xml.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/subcollections.xml> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/regex-urlfilter.txt.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/regex-urlfilter.txt> [copy] Copying <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/nutch-site.xml.template> to <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/conf/nutch-site.xml> clean-lib: resolve-default: [ivy:resolve] :: Ivy 2.1.0 - 20090925235825 :: http://ant.apache.org/ivy/ :: [ivy:resolve] :: loading settings :: file = <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/ivy/ivysettings.xml> [ivy:resolve] downloading http://repo1.maven.org/maven2/com/healthmarketscience/sqlbuilder/sqlbuilder/2.0.6/sqlbuilder-2.0.6.jar ... [ivy:resolve] ......................... (142kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] com.healthmarketscience.sqlbuilder#sqlbuilder;2.0.6!sqlbuilder.jar (2154ms) [ivy:resolve] downloading http://repo1.maven.org/maven2/org/hsqldb/hsqldb/2.0.0/hsqldb-2.0.0.jar ... [ivy:resolve] ............................................................................................... [ivy:resolve] ............................................................................................. (1226kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] org.hsqldb#hsqldb;2.0.0!hsqldb.jar (4390ms) [ivy:resolve] downloading http://repo1.maven.org/maven2/com/healthmarketscience/common/common-util/1.0.2/common-util-1.0.2.jar ... [ivy:resolve] ....... (32kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] com.healthmarketscience.common#common-util;1.0.2!common-util.jar (1586ms) [ivy:resolve] downloading http://repo1.maven.org/maven2/org/mortbay/jetty/jetty-client/6.1.22/jetty-client-6.1.22.jar ... [ivy:resolve] ................ (67kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] org.mortbay.jetty#jetty-client;6.1.22!jetty-client.jar (2337ms) [ivy:resolve] downloading http://repo1.maven.org/maven2/javax/mail/mail/1.4/mail-1.4.jar ... [ivy:resolve] ..................................................................... (379kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] javax.mail#mail;1.4!mail.jar (2695ms) [ivy:resolve] downloading http://repo1.maven.org/maven2/javax/activation/activation/1.1/activation-1.1.jar ... [ivy:resolve] ............. (61kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] javax.activation#activation;1.1!activation.jar (1913ms) [ivy:resolve] downloading http://repo1.maven.org/maven2/org/mortbay/jetty/jetty-sslengine/6.1.22/jetty-sslengine-6.1.22.jar ... [ivy:resolve] ..... (18kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] org.mortbay.jetty#jetty-sslengine;6.1.22!jetty-sslengine.jar (2957ms) [ivy:resolve] downloading http://repo1.maven.org/maven2/org/mortbay/jetty/jetty-util5/6.1.22/jetty-util5-6.1.22.jar ... [ivy:resolve] ..... (22kB) [ivy:resolve] .. (0kB) [ivy:resolve] [SUCCESSFUL ] org.mortbay.jetty#jetty-util5;6.1.22!jetty-util5.jar (1177ms) [ivy:resolve] [ivy:resolve] :: problems summary :: [ivy:resolve] :::: WARNINGS [ivy:resolve] module not found: org.gora#gora-core;0.1 [ivy:resolve] ==== local: tried [ivy:resolve] /export/home/hudson/.ivy2/local/org.gora/gora-core/0.1/ivys/ivy.xml [ivy:resolve] -- artifact org.gora#gora-core;0.1!gora-core.jar: [ivy:resolve] /export/home/hudson/.ivy2/local/org.gora/gora-core/0.1/jars/gora-core.jar [ivy:resolve] :::::::::::::::::::::::::::::::::::::::::::::: [ivy:resolve] :: UNRESOLVED DEPENDENCIES :: [ivy:resolve] :::::::::::::::::::::::::::::::::::::::::::::: [ivy:resolve] :: org.gora#gora-core;0.1: not found [ivy:resolve] :::::::::::::::::::::::::::::::::::::::::::::: [ivy:resolve] [ivy:resolve] :: USE VERBOSE OR DEBUG MESSAGE LEVEL FOR MORE DETAILS BUILD FAILED <http://hudson.zones.apache.org/hudson/job/Nutch-trunk/ws/trunk/build.xml>:311: impossible to resolve dependencies: resolve failed - see output for details Total time: 52 seconds Publishing Javadoc Archiving artifacts Recording test results