Re: Solr Update URI is not found

2013-10-28 Thread Raymond Wiker

On 28 Oct 2013, at 01:19 , Bayu Widyasanyata bwidyasany...@gmail.com wrote:

 request: http://localhost:8080/solr/update?wt=javabinversion=2

I think this url is incorrect: there should be a core name between solr and 
update.

Re: Solr Update URI is not found

2013-10-28 Thread Bayu Widyasanyata
On Mon, Oct 28, 2013 at 1:26 PM, Raymond Wiker rwi...@gmail.com wrote:

  request: http://localhost:8080/solr/update?wt=javabinversion=2

 I think this url is incorrect: there should be a core name between solr
 and update.


I changed th SolrURL on crawl script's option to:

./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/mycollection/2

And the result now is Bad Request.
I will look for another misconfiguration things...

=

org.apache.solr.common.SolrException: Bad Request

Bad Request

request: http://localhost:8080/solr/mycollection/update?wt=javabinversion=2
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
at
org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
at
org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
at
org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
at
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
2013-10-28 13:30:02,804 ERROR indexer.IndexingJob - Indexer:
java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)



-- 
wassalam,
[bayu]


Re: Solr Update URI is not found

2013-10-28 Thread Erick Erickson
This seems like a better question for the Nutch list. I see hadoop
in there, so unless you've specifically configured solr to use
the HDFS directory writer factory, this has to be coming from
someplace else. And there are map/reduce tasks in here.

BTW, it would be more helpful if you posted the URL that you
successfully queried Solr with... What is the /2 on the end for?
Do you use that when you query?

Best,
Erick


On Mon, Oct 28, 2013 at 2:37 AM, Bayu Widyasanyata
bwidyasany...@gmail.comwrote:

 On Mon, Oct 28, 2013 at 1:26 PM, Raymond Wiker rwi...@gmail.com wrote:

   request: http://localhost:8080/solr/update?wt=javabinversion=2
 
  I think this url is incorrect: there should be a core name between solr
  and update.
 

 I changed th SolrURL on crawl script's option to:

 ./bin/crawl urls/seed.txt TestCrawl
 http://localhost:8080/solr/mycollection/2

 And the result now is Bad Request.
 I will look for another misconfiguration things...

 =

 org.apache.solr.common.SolrException: Bad Request

 Bad Request

 request:
 http://localhost:8080/solr/mycollection/update?wt=javabinversion=2
 at

 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
 at

 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
 at

 org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
 at

 org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
 at
 org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
 at

 org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
 at

 org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
 at
 org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
 at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
 at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
 2013-10-28 13:30:02,804 ERROR indexer.IndexingJob - Indexer:
 java.io.IOException: Job failed!
 at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
 at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
 at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
 at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)



 --
 wassalam,
 [bayu]



Re: Solr Update URI is not found

2013-10-28 Thread Bayu Widyasanyata
Hi Erick and All,

The problem is solved by copying schema-solr4.xml into my collection's Solr
conf (renamed to schema.xml).
I didn't use hadoop there, and apologize if it's better to post on this
Solr list since the problem appeared first on Solr Indexer step.

Regarding /2 option it's e-mail body evolution I thought :)
On my first posting, that was a crawl script syntax, as on my case:

# ./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2

2 = the number of rounds.

See here:
http://wiki.apache.org/nutch/NutchTutorial#A3.3._Using_the_crawl_script

Again, thanks everyone!


On Mon, Oct 28, 2013 at 5:39 PM, Erick Erickson erickerick...@gmail.comwrote:

 This seems like a better question for the Nutch list. I see hadoop
 in there, so unless you've specifically configured solr to use
 the HDFS directory writer factory, this has to be coming from
 someplace else. And there are map/reduce tasks in here.

 BTW, it would be more helpful if you posted the URL that you
 successfully queried Solr with... What is the /2 on the end for?
 Do you use that when you query?

 Best,
 Erick


 On Mon, Oct 28, 2013 at 2:37 AM, Bayu Widyasanyata
 bwidyasany...@gmail.comwrote:

  On Mon, Oct 28, 2013 at 1:26 PM, Raymond Wiker rwi...@gmail.com wrote:
 
request: http://localhost:8080/solr/update?wt=javabinversion=2
  
   I think this url is incorrect: there should be a core name between
 solr
   and update.
  
 
  I changed th SolrURL on crawl script's option to:
 
  ./bin/crawl urls/seed.txt TestCrawl
  http://localhost:8080/solr/mycollection/2
 
  And the result now is Bad Request.
  I will look for another misconfiguration things...
 
  =
 
  org.apache.solr.common.SolrException: Bad Request
 
  Bad Request
 
  request:
  http://localhost:8080/solr/mycollection/update?wt=javabinversion=2
  at
 
 
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
  at
 
 
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
  at
 
 
 org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
  at
 
 
 org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
  at
  org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
  at
 
 
 org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
  at
 
 
 org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
  at
  org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
  at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
  at
  org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
  2013-10-28 13:30:02,804 ERROR indexer.IndexingJob - Indexer:
  java.io.IOException: Job failed!
  at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
  at
 org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
  at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
  at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
  at
 org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)
 
 
 
  --
  wassalam,
  [bayu]
 




-- 
wassalam,
[bayu]


Solr Update URI is not found

2013-10-27 Thread Bayu Widyasanyata
Hi,

I just installed Nutch 1.7 and latest Solr 4.5.1 successfully.
But I got the error when execute the crawl script
(./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2)

The error is occured on Solr Indexer step.
Following the error on hadoop.log:

2013-10-28 06:16:59,815 WARN  mapred.LocalJobRunner -
job_local1930559258_0001
org.apache.solr.common.SolrException: Not Found

Not Found

request: http://localhost:8080/solr/update?wt=javabinversion=2
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
at
org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
at
org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
at
org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
at
org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
at
org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
at
org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
at
org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer:
java.io.IOException: Job failed!
at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)

I suspect the problem is broken URI (Not Found message) of
http://localhost:8080/solr/update?wt=javabinversion=2

That URI was also report not found when I accessed from browser directly.

Is there any configuration that I missed?

Thanks.-

-- 
wassalam,
[bayu]


Re: Solr Update URI is not found

2013-10-27 Thread Bayu Widyasanyata
Additional info:

- I use Tomcat 7.0.42
- Following are Tomcat/catalina's log when nutch failed on Solr index
process. It replies 404 error:

10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST
/solr/update?wt=javabinversion=2 HTTP/1.1 404 973
10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST
/solr/update?wt=javabinversion=2 HTTP/1.1 404 973

Thanks.-



On Mon, Oct 28, 2013 at 7:19 AM, Bayu Widyasanyata
bwidyasany...@gmail.comwrote:

 Hi,

 I just installed Nutch 1.7 and latest Solr 4.5.1 successfully.
 But I got the error when execute the crawl script
 (./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2)

 The error is occured on Solr Indexer step.
 Following the error on hadoop.log:

 2013-10-28 06:16:59,815 WARN  mapred.LocalJobRunner -
 job_local1930559258_0001
 org.apache.solr.common.SolrException: Not Found

 Not Found

 request: http://localhost:8080/solr/update?wt=javabinversion=2
 at
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
 at
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
 at
 org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
 at
 org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
 at
 org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
 at
 org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
 at
 org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
 at
 org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
 at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
 at
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
 2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer:
 java.io.IOException: Job failed!
 at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
 at org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
 at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
 at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
 at org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)

 I suspect the problem is broken URI (Not Found message) of
 http://localhost:8080/solr/update?wt=javabinversion=2

 That URI was also report not found when I accessed from browser directly.

 Is there any configuration that I missed?

 Thanks.-

 --
 wassalam,
 [bayu]




-- 
wassalam,
[bayu]


Re: Solr Update URI is not found

2013-10-27 Thread Alexandre Rafalovitch
Can you do queries? Maybe the default collection was somehow not setup and
you need to provide collection name explicitly. What endpoints does admin
interface use when you do a query?

Regards,
   Alex.

Personal website: http://www.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is the quality of nature that keeps events from happening all at
once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)


On Mon, Oct 28, 2013 at 8:54 AM, Bayu Widyasanyata
bwidyasany...@gmail.comwrote:

 Additional info:

 - I use Tomcat 7.0.42
 - Following are Tomcat/catalina's log when nutch failed on Solr index
 process. It replies 404 error:

 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST
 /solr/update?wt=javabinversion=2 HTTP/1.1 404 973
 10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST
 /solr/update?wt=javabinversion=2 HTTP/1.1 404 973

 Thanks.-



 On Mon, Oct 28, 2013 at 7:19 AM, Bayu Widyasanyata
 bwidyasany...@gmail.comwrote:

  Hi,
 
  I just installed Nutch 1.7 and latest Solr 4.5.1 successfully.
  But I got the error when execute the crawl script
  (./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2)
 
  The error is occured on Solr Indexer step.
  Following the error on hadoop.log:
 
  2013-10-28 06:16:59,815 WARN  mapred.LocalJobRunner -
  job_local1930559258_0001
  org.apache.solr.common.SolrException: Not Found
 
  Not Found
 
  request: http://localhost:8080/solr/update?wt=javabinversion=2
  at
 
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
  at
 
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
  at
 
 org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
  at
 
 org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
  at
  org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
  at
 
 org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
  at
 
 org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
  at
  org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
  at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
  at
  org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
  2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer:
  java.io.IOException: Job failed!
  at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
  at
 org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
  at org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
  at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
  at
 org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)
 
  I suspect the problem is broken URI (Not Found message) of
  http://localhost:8080/solr/update?wt=javabinversion=2
 
  That URI was also report not found when I accessed from browser directly.
 
  Is there any configuration that I missed?
 
  Thanks.-
 
  --
  wassalam,
  [bayu]
 



 --
 wassalam,
 [bayu]



Re: Solr Update URI is not found

2013-10-27 Thread Bayu Widyasanyata
Hi Alex,

I can do a common queries.
Below are the json result for *:* query:

{
  responseHeader: {
status: 0,
QTime: 0,
params: {
  indent: true,
  q: *:*,
  _: 1382938341864,
  wt: json
}
  },
  response: {
numFound: 0,
start: 0,
docs: []
  }
}




On Mon, Oct 28, 2013 at 9:11 AM, Alexandre Rafalovitch
arafa...@gmail.comwrote:

 Can you do queries? Maybe the default collection was somehow not setup and
 you need to provide collection name explicitly. What endpoints does admin
 interface use when you do a query?

 Regards,
Alex.

 Personal website: http://www.outerthoughts.com/
 LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
 - Time is the quality of nature that keeps events from happening all at
 once. Lately, it doesn't seem to be working.  (Anonymous  - via GTD book)


 On Mon, Oct 28, 2013 at 8:54 AM, Bayu Widyasanyata
 bwidyasany...@gmail.comwrote:

  Additional info:
 
  - I use Tomcat 7.0.42
  - Following are Tomcat/catalina's log when nutch failed on Solr index
  process. It replies 404 error:
 
  10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST
  /solr/update?wt=javabinversion=2 HTTP/1.1 404 973
  10.1.160.40 - - [28/Oct/2013:08:50:02 +0700] POST
  /solr/update?wt=javabinversion=2 HTTP/1.1 404 973
 
  Thanks.-
 
 
 
  On Mon, Oct 28, 2013 at 7:19 AM, Bayu Widyasanyata
  bwidyasany...@gmail.comwrote:
 
   Hi,
  
   I just installed Nutch 1.7 and latest Solr 4.5.1 successfully.
   But I got the error when execute the crawl script
   (./bin/crawl urls/seed.txt TestCrawl http://localhost:8080/solr/ 2)
  
   The error is occured on Solr Indexer step.
   Following the error on hadoop.log:
  
   2013-10-28 06:16:59,815 WARN  mapred.LocalJobRunner -
   job_local1930559258_0001
   org.apache.solr.common.SolrException: Not Found
  
   Not Found
  
   request: http://localhost:8080/solr/update?wt=javabinversion=2
   at
  
 
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:430)
   at
  
 
 org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:244)
   at
  
 
 org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105)
   at
  
 
 org.apache.nutch.indexwriter.solr.SolrIndexWriter.close(SolrIndexWriter.java:155)
   at
   org.apache.nutch.indexer.IndexWriters.close(IndexWriters.java:118)
   at
  
 
 org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:44)
   at
  
 
 org.apache.hadoop.mapred.ReduceTask$OldTrackingRecordWriter.close(ReduceTask.java:467)
   at
   org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:535)
   at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:421)
   at
  
 org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:398)
   2013-10-28 06:17:00,243 ERROR indexer.IndexingJob - Indexer:
   java.io.IOException: Job failed!
   at
 org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1357)
   at
  org.apache.nutch.indexer.IndexingJob.index(IndexingJob.java:123)
   at
 org.apache.nutch.indexer.IndexingJob.run(IndexingJob.java:185)
   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)
   at
  org.apache.nutch.indexer.IndexingJob.main(IndexingJob.java:195)
  
   I suspect the problem is broken URI (Not Found message) of
   http://localhost:8080/solr/update?wt=javabinversion=2
  
   That URI was also report not found when I accessed from browser
 directly.
  
   Is there any configuration that I missed?
  
   Thanks.-
  
   --
   wassalam,
   [bayu]
  
 
 
 
  --
  wassalam,
  [bayu]
 




-- 
wassalam,
[bayu]