Thanks, that's helpful. But if I understand what it's saying, although it shows how to point to a different config directory, you're limited to one alternate configuration since you have to set the location in the NUTCH_CONF_DIR environment variable.
What I really would like is a way to pass in the location of the config files (e.g. nutch-default.xml, regex-urlfilter.txt, etc.) as an argument to the nutch script, so that I can have multiple config files (each for a different site I wish to crawl). I found a somewhat clumsy way to accomplish this, by modfiying the nutch script so that it prepends the current directory to the CLASSPATH and running a copy of the script from the directory that has my config files. This way the script first looks in the current directory and it picks up my site-specific config files. By the way, sorry for the extra post - haven't used a mailing list in awhile. --- Juho Mäkinen <[EMAIL PROTECTED]> wrote: > Take a look into Nutch Wiki FAQ here: > http://wiki.apache.org/nutch/FAQ > And find the Q/A for "How can I force fetcher to use > custom nutch-config?" > > - Juho Mäkinen, http://www.juhonkoti.net > > On 7/8/05, Raymond Creel <[EMAIL PROTECTED]> > wrote: > > I'm just getting started with Nutch. Does someone > > know how I may be able to get the nutch > command-line > > script to load different > > nutch-default.xml/nutch-site.xml files than what > is in > > the nutch/conf directory? I want to be able to > run > > nutch at different sites with different startup > > configurations. > > > > Thanks, > > Raymond Creel > > > > > > > > __________________________________ > > Do you Yahoo!? > > Read only the mail you want - Yahoo! Mail > SpamGuard. > > http://promotions.yahoo.com/new_mail > > > __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com