Hi,

I am getting ready to index a recent copy of Wikipedia's pages-articles
dump.  I have two servers, foo and bar.  On foo.com/mediawiki I have a
Mediawiki install serving up the pages. On bar.com/solr I have my solr
install. I have the pages-articles.xml file from Wikipedia and the solr
instructions  at
http://wiki.apache.org/solr/DataImportHandler#Example:_Indexing_wikipedia.
 It looks pretty straightforward but I have a couple of preparatory
questions.

If I index the pages-articles.xml on bar.com/solr, they will then be
pointing to the relative links on solr.com/mediawiki, which don't exist,
right?  So is there a way to tell solr that the base url for a bunch of
index records is different than what it thinks they are? Or would it be
easier simply to put a solr installation on foo.com?




\

FredZ

Reply via email to