And what do the Solr logs say?
-----Original message-----
> From:attabi <attabi...@hotmail.fr>
> Sent: Fri 23-Nov-2012 17:21
> To: user@nutch.apache.org
> Subject: Re: java.io.IOException: Job failed!
>
> It is for my final report
> french student I am trying to do integration solr (3.6.1) and
> apache-nutch-1.5-src.zip O.S (ubuntu 11.04) java (1.7)
> Every things is fine but when i try to do this command bin/nutch solrindex
> http://127.0.0.1:8983/solr/ crawl/crawldb -linkdb crawl/linkdb
> crawl/segments/* I get this error java.io.IOException: Job failed!
> I went to solr/dist and I copied apache-solr-core-3.6.1.jar,
> apache-solr-solrj-3.6.1.jar and solrj-lib to nutch/lib and remove
> solr-solrj-3.6.1.jar (inside nutch ) after I built nutch but I still get
> the same error.
> this is my hadoop.log
> 2012-11-23 12:48:37,654 INFO solr.SolrIndexer - SolrIndexer: starting at
> 2012-11-23 12:48:37
> 2012-11-23 12:48:37,761 INFO indexer.IndexerMapReduce - IndexerMapReduce:
> crawldb: crawl/crawldb
> 2012-11-23 12:48:37,761 INFO indexer.IndexerMapReduce - IndexerMapReduce:
> linkdb: crawl/linkdb
> 2012-11-23 12:48:37,761 INFO indexer.IndexerMapReduce - IndexerMapReduces:
> adding segment: crawl/segments/20121121133422
> 2012-11-23 12:48:37,975 INFO indexer.IndexerMapReduce - IndexerMapReduces:
> adding segment: crawl/segments/20121121134004
> 2012-11-23 12:48:37,979 INFO indexer.IndexerMapReduce - IndexerMapReduces:
> adding segment: crawl/segments/20121121134920
> 2012-11-23 12:48:38,129 WARN util.NativeCodeLoader - Unable to load
> native-hadoop library for your platform... using builtin-java classes where
> applicable
> 2012-11-23 12:48:39,084 INFO plugin.PluginRepository - Plugins: looking in:
> /usr/local/test/nutch/runtime/local/plugins
> 2012-11-23 12:48:39,600 INFO plugin.PluginRepository - Plugin
> Auto-activation mode: [true]
> 2012-11-23 12:48:39,600 INFO plugin.PluginRepository - Registered Plugins:
> 2012-11-23 12:48:39,600 INFO plugin.PluginRepository - the nutch core
> extension points (nutch-extensionpoints)
> 2012-11-23 12:48:39,600 INFO plugin.PluginRepository - CyberNeko HTML
> Parser (lib-nekohtml)
> 2012-11-23 12:48:39,600 INFO plugin.PluginRepository - OPIC Scoring
> Plug-in (scoring-opic)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Basic Indexing
> Filter (index-basic)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Html Parse
> Plug-in
> (parse-html)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Anchor Indexing
> Filter (index-anchor)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - HTTP Framework
> (lib-http)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Regex URL Filter
> (urlfilter-regex)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Regex URL Filter
> Framework (lib-regex-filter)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Http Protocol
> Plug-in (protocol-http)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Registered
> Extension-Points:
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Nutch URL
> Normalizer (org.apache.nutch.net.URLNormalizer)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Nutch Protocol
> (org.apache.nutch.protocol.Protocol)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Nutch Segment
> Merge
> Filter (org.apache.nutch.segment.SegmentMergeFilter)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Nutch URL Filter
> (org.apache.nutch.net.URLFilter)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - Nutch Indexing
> Filter (org.apache.nutch.indexer.IndexingFilter)
> 2012-11-23 12:48:39,601 INFO plugin.PluginRepository - HTML Parse
> Filter
> (org.apache.nutch.parse.HtmlParseFilter)
> 2012-11-23 12:48:39,602 INFO plugin.PluginRepository - Nutch Content
> Parser (org.apache.nutch.parse.Parser)
> 2012-11-23 12:48:39,602 INFO plugin.PluginRepository - Nutch Scoring
> (org.apache.nutch.scoring.ScoringFilter)
> 2012-11-23 12:48:39,606 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:48:39,608 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:48:39,608 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:48:41,868 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:48:41,868 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:48:41,868 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:48:45,040 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:48:45,040 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:48:45,041 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:48:48,013 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:48:48,015 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:48:48,015 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:48:51,086 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:48:51,086 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:48:51,086 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:48:54,067 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:48:54,067 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:48:54,067 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:48:56,940 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:48:56,940 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:48:56,940 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:00,049 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:00,049 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:00,049 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:03,053 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:03,053 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:03,053 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:06,148 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:06,149 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:06,149 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:15,901 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:15,962 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:15,962 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:17,605 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:17,647 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:17,647 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:20,551 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:20,551 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:20,552 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:24,150 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:24,150 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:24,151 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:28,574 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.basic.BasicIndexingFilter
> 2012-11-23 12:49:28,574 INFO anchor.AnchorIndexingFilter - Anchor
> deduplication is: off
> 2012-11-23 12:49:28,574 INFO indexer.IndexingFilters - Adding
> org.apache.nutch.indexer.anchor.AnchorIndexingFilter
> 2012-11-23 12:49:30,123 INFO solr.SolrMappingReader - source: content dest:
> content
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: title dest:
> title
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: host dest:
> host
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: segment dest:
> segment
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: boost dest:
> boost
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: digest dest:
> digest
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: tstamp dest:
> tstamp
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: url dest: id
> 2012-11-23 12:49:30,124 INFO solr.SolrMappingReader - source: url dest: url
> 2012-11-23 12:49:31,107 INFO solr.SolrWriter - Indexing 52 documents
> 2012-11-23 12:49:53,007 ERROR solr.SolrIndexer - java.io.IOException: Job
> failed! .
>
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Indexing-time-URL-filtering-again-tp4021793p4022029.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>