Re: Solr Update URI is not found
On 28 Oct 2013, at 01:19 , Bayu Widyasanyata bwidyasany...@gmail.com wrote: request: http://localhost:8080/solr/update?wt=javabinversion=2 I think this url is incorrect: there should be a core name between solr and update.
Re: Solr Update URI is not found
On Mon, Oct 28, 2013 at 1:26 PM, Raymond Wiker rwi...@gmail.com wrote: request: http://localhost:8080/solr/update?wt=javabinversion=2 I think this url is incorrect: there should be a core name between solr and update. I changed th SolrURL on crawl script's option to: ./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/mycollection/2 And the result now is Bad Request. I will look for another misconfiguration things... = org.apache.solr.common.SolrException: Bad Request Bad Request request: http://localhost:8080/solr/mycollection/update?wt=javabinversion=2 at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155) at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118) at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) 2013-10-28 13:30:02,804 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195) -- wassalam, [bayu]
Re: Solr Update URI is not found
This seems like a better question for the Nutch list. I see hadoop in there, so unless you've specifically configured solr to use the HDFS directory writer factory, this has to be coming from someplace else. And there are map/reduce tasks in here. BTW, it would be more helpful if you posted the URL that you successfully queried Solr with... What is the /2 on the end for? Do you use that when you query? Best, Erick On Mon, Oct 28, 2013 at 2:37 AM, Bayu Widyasanyata bwidyasany...@gmail.comwrote: On Mon, Oct 28, 2013 at 1:26 PM, Raymond Wiker rwi...@gmail.com wrote: request: http://localhost:8080/solr/update?wt=javabinversion=2 I think this url is incorrect: there should be a core name between solr and update. I changed th SolrURL on crawl script's option to: ./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/mycollection/2 And the result now is Bad Request. I will look for another misconfiguration things... = org.apache.solr.common.SolrException: Bad Request Bad Request request: http://localhost:8080/solr/mycollection/update?wt=javabinversion=2 at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155) at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118) at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) 2013-10-28 13:30:02,804 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195) -- wassalam, [bayu]
Re: Solr Update URI is not found
Hi Erick and All, The problem is solved by copying schema-solr4.xml into my collection's Solr conf (renamed to schema.xml). I didn't use hadoop there, and apologize if it's better to post on this Solr list since the problem appeared first on Solr Indexer step. Regarding /2 option it's e-mail body evolution I thought :) On my first posting, that was a crawl script syntax, as on my case: # ./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2 2 = the number of rounds. See here: http://wiki.apache.org/nutch/NutchTutorial#A3.3._Using_the_crawl_script Again, thanks everyone! On Mon, Oct 28, 2013 at 5:39 PM, Erick Erickson erickerick...@gmail.comwrote: This seems like a better question for the Nutch list. I see hadoop in there, so unless you've specifically configured solr to use the HDFS directory writer factory, this has to be coming from someplace else. And there are map/reduce tasks in here. BTW, it would be more helpful if you posted the URL that you successfully queried Solr with... What is the /2 on the end for? Do you use that when you query? Best, Erick On Mon, Oct 28, 2013 at 2:37 AM, Bayu Widyasanyata bwidyasany...@gmail.comwrote: On Mon, Oct 28, 2013 at 1:26 PM, Raymond Wiker rwi...@gmail.com wrote: request: http://localhost:8080/solr/update?wt=javabinversion=2 I think this url is incorrect: there should be a core name between solr and update. I changed th SolrURL on crawl script's option to: ./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/mycollection/2 And the result now is Bad Request. I will look for another misconfiguration things... = org.apache.solr.common.SolrException: Bad Request Bad Request request: http://localhost:8080/solr/mycollection/update?wt=javabinversion=2 at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155) at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118) at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) 2013-10-28 13:30:02,804 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195) -- wassalam, [bayu] -- wassalam, [bayu]
Solr Update URI is not found
Hi, I just installed Nutch 1.7 and latest Solr 4.5.1 successfully. But I got the error when execute the crawl script (./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2) The error is occured on Solr Indexer step. Following the error on hadoop.log: 2013-10-28 06:16:59,815 WARN mapred.LocalJobRunner - job_local1930559258_0001 org.apache.solr.common.SolrException: Not Found Not Found request: http://localhost:8080/solr/update?wt=javabinversion=2 at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155) at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118) at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) 2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195) I suspect the problem is broken URI (Not Found message) of http://localhost:8080/solr/update?wt=javabinversion=2 That URI was also report not found when I accessed from browser directly. Is there any configuration that I missed? Thanks.- -- wassalam, [bayu]
Re: Solr Update URI is not found
Additional info: - I use Tomcat 7.0.42 - Following are Tomcat/catalina's log when nutch failed on Solr index process. It replies 404 error: 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST /solr/update?wt=javabinversion=2 HTTP/1.1 404 973 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST /solr/update?wt=javabinversion=2 HTTP/1.1 404 973 Thanks.- On Mon, Oct 28, 2013 at 7:19 AM, Bayu Widyasanyata bwidyasany...@gmail.comwrote: Hi, I just installed Nutch 1.7 and latest Solr 4.5.1 successfully. But I got the error when execute the crawl script (./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2) The error is occured on Solr Indexer step. Following the error on hadoop.log: 2013-10-28 06:16:59,815 WARN mapred.LocalJobRunner - job_local1930559258_0001 org.apache.solr.common.SolrException: Not Found Not Found request: http://localhost:8080/solr/update?wt=javabinversion=2 at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155) at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118) at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) 2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195) I suspect the problem is broken URI (Not Found message) of http://localhost:8080/solr/update?wt=javabinversion=2 That URI was also report not found when I accessed from browser directly. Is there any configuration that I missed? Thanks.- -- wassalam, [bayu] -- wassalam, [bayu]
Re: Solr Update URI is not found
Can you do queries? Maybe the default collection was somehow not setup and you need to provide collection name explicitly. What endpoints does admin interface use when you do a query? Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Mon, Oct 28, 2013 at 8:54 AM, Bayu Widyasanyata bwidyasany...@gmail.comwrote: Additional info: - I use Tomcat 7.0.42 - Following are Tomcat/catalina's log when nutch failed on Solr index process. It replies 404 error: 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST /solr/update?wt=javabinversion=2 HTTP/1.1 404 973 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST /solr/update?wt=javabinversion=2 HTTP/1.1 404 973 Thanks.- On Mon, Oct 28, 2013 at 7:19 AM, Bayu Widyasanyata bwidyasany...@gmail.comwrote: Hi, I just installed Nutch 1.7 and latest Solr 4.5.1 successfully. But I got the error when execute the crawl script (./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2) The error is occured on Solr Indexer step. Following the error on hadoop.log: 2013-10-28 06:16:59,815 WARN mapred.LocalJobRunner - job_local1930559258_0001 org.apache.solr.common.SolrException: Not Found Not Found request: http://localhost:8080/solr/update?wt=javabinversion=2 at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155) at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118) at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) 2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195) I suspect the problem is broken URI (Not Found message) of http://localhost:8080/solr/update?wt=javabinversion=2 That URI was also report not found when I accessed from browser directly. Is there any configuration that I missed? Thanks.- -- wassalam, [bayu] -- wassalam, [bayu]
Re: Solr Update URI is not found
Hi Alex, I can do a common queries. Below are the json result for *:* query: { responseHeader: { status: 0, QTime: 0, params: { indent: true, q: *:*, _: 1382938341864, wt: json } }, response: { numFound: 0, start: 0, docs: [] } } On Mon, Oct 28, 2013 at 9:11 AM, Alexandre Rafalovitch arafa...@gmail.comwrote: Can you do queries? Maybe the default collection was somehow not setup and you need to provide collection name explicitly. What endpoints does admin interface use when you do a query? Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch - Time is the quality of nature that keeps events from happening all at once. Lately, it doesn't seem to be working. (Anonymous - via GTD book) On Mon, Oct 28, 2013 at 8:54 AM, Bayu Widyasanyata bwidyasany...@gmail.comwrote: Additional info: - I use Tomcat 7.0.42 - Following are Tomcat/catalina's log when nutch failed on Solr index process. It replies 404 error: 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST /solr/update?wt=javabinversion=2 HTTP/1.1 404 973 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST /solr/update?wt=javabinversion=2 HTTP/1.1 404 973 Thanks.- On Mon, Oct 28, 2013 at 7:19 AM, Bayu Widyasanyata bwidyasany...@gmail.comwrote: Hi, I just installed Nutch 1.7 and latest Solr 4.5.1 successfully. But I got the error when execute the crawl script (./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2) The error is occured on Solr Indexer step. Following the error on hadoop.log: 2013-10-28 06:16:59,815 WARN mapred.LocalJobRunner - job_local1930559258_0001 org.apache.solr.common.SolrException: Not Found Not Found request: http://localhost:8080/solr/update?wt=javabinversion=2 at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430) at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244) at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) at org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155) at org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118) at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44) at org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467) at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398) 2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer: java.io.IOException: Job failed! at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357) at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123) at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185) at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195) I suspect the problem is broken URI (Not Found message) of http://localhost:8080/solr/update?wt=javabinversion=2 That URI was also report not found when I accessed from browser directly. Is there any configuration that I missed? Thanks.- -- wassalam, [bayu] -- wassalam, [bayu] -- wassalam, [bayu]