I have followed the installation instructions as best as I can understand them, the CONFIG file is like this:

##
## CONFIG
##
## This file is part of ht://Dig
##

#
# These variables are set by configure
#
# This specifies the root of the directory tree to be used by ht://Dig
prefix=                 /home/sites/www.volvoclub.org.uk/web/htdig

# This specifies the root of the directory tree to be used for programs
# installed by ht://Dig
exec_prefix=            ${prefix}

#
# Please modify the variables below to reflect your preferences.
#

#
# DEST
#
# This specifies the root of the directory tree to be used by ht://Dig
#
DEST=                  $(prefix)

#
# BIN_DIR
# Set this macro to where you want the binaries to be installed.
#
BIN_DIR=               ${exec_prefix}/bin

#
# CONFIG_DIR
# This is the directory that contains ht://Dig configuration files
#
CONFIG_DIR=            ${prefix}/conf

#
# COMMON_DIR
# This is the directory for files that can be shared between different
# databases.
#
COMMON_DIR=            ${prefix}/common

#
# DATABASE_DIR
# The default directory where the search databases will reside.
#
DATABASE_DIR=          ${prefix}/db

#
# DEFAULT_CONFIG_FILE
# This macro defines where the various programs will look for a configuration
# file.
#
DEFAULT_CONFIG_FILE=   ${CONFIG_DIR}/htdig.conf

#
# CGIBIN_DIR
# The directory where your HTTP server looks for CGI programs.  This is where
# htsearch will get installed.
#
CGIBIN_DIR=             /home/sites/www.volvoclub.org.uk/web/cgi-bin

#
# IMAGE_DIR
# Define this to be a place that can be accessed by your web server.  This is
# where a couple of images will be placed.
#
IMAGE_DIR=              /home/sites/www.volvoclub.org.uk/web/htdig

#
# IMAGE_URL_PREFIX
# This is the URL to prefix the images placed in IMAGE_DIR.
#
IMAGE_URL_PREFIX=        http://www.volvoclub.org.uk/htdig

#
# SEARCH_DIR
# Set this to the absolute path where you want the sample search form to
# be installed.
#
SEARCH_DIR=             /home/sites/www.volvoclub.org.uk/web/htdig

#
# SEARCH_FORM
# Set this to the name you want to give to the search form.  This form
# will be located in the SEARCH_DIR directory.
#
SEARCH_FORM=            search.html


and the following changes made to htdig.conf

#
# Example config file for ht://Dig.
#
# This configuration file is used by all the programs that make up ht://Dig.
# Please refer to the attribute reference manual for more details on what
# can be put into this file.  (http://www.htdig.org/confindex.html)
# Note that most attributes have very reasonable default values so you
# really only have to add attributes here if you want to change the defaults.
#
# What follows are some of the common attributes you might want to change.
#

#
# Specify where the database files need to go.  Make sure that there is
# plenty of free disk space available for the databases.  They can get
# pretty big.
#
database_dir:           /home/sites/www.volvoclub.org.uk/web/htdig/db

#
# This specifies the URL where the robot (htdig) will start.  You can specify
# multiple URLs here.  Just separate them by some whitespace.
# The example here will cause the ht://Dig homepage and related pages to be
# indexed.
# You could also index all the URLs in a file like so:
# start_url:           `${common_dir}/start.url`
#
start_url:              http://www.volvoclub.org.uk/

#
# This attribute limits the scope of the indexing process.  The default is to
# set it to the same as the start_url above.  This way only pages that are on
# the sites specified in the start_url attribute will be indexed and it will
# reject any URLs that go outside of those sites.
#
# Keep in mind that the value for this attribute is just a list of string
# patterns. As long as URLs contain at least one of the patterns it will be
# seen as part of the scope of the index.
#
limit_urls_to:          ${start_url}

#
# If there are particular pages that you definitely do NOT want to index, you
# can use the exclude_urls attribute.  The value is a list of string patterns.
# If a URL matches any of the patterns, it will NOT be indexed.  This is
# useful to exclude things like virtual web trees or database accesses.  By
# default, all CGI URLs will be excluded.  (Note that the /cgi-bin/ convention
# may not work on your web server.  Check the  path prefix used on your web
# server.)
#
exclude_urls:           /cgi-bin/ .cgi
                       /dc/ .php
                       /phpmyadmin/ .php
                                    
The first thing I noticed running rundig is that it ignored the exclude URLs. It indexed them, including a very large MySQL db it accessed through /dc/, and then deleted the entries. I then tired to use the search.html form in my browser, but when I pressed the button I got a 'The page cannot be displayed' error. See it at www.volvoclub.org.uk/htdig/search.html

I have been trying to get this to work for days, but getting nowhere, except very frustrated. I am using a Cobalt RaQ550 server with Apache 1.3 and Linux 2.4 Multitasking OS (GNU/Linux i386 (v 2.4.16C12_V))

Thanks.

Bob

VOLVO OWNERS CLUB ONLINE
Robert Isaac, Director, Volvo Owners Club Limited
All email messages are virus scanned before being sent
PLEASE INCLUDE ALL PREVIOUS MESSAGE TEXT WITH REPLY

Club web site: www.volvoclub.org.uk

Also visit: www.trisaac.com for
John Wayne Collectors Plates
Roil Products

Reply via email to