Hi -
Looking for some help with server_aliases
Ive been using htdig for a few months now, and find it to be excellent.
Im having trouble with the server_aliases however... ive tried a number of
different combinations from searching the mailling list archive and reading the
online help, but cant seem to get it right....
Our site has 16 aliases (8 really, but both WWW and no WWW refer to the same site)...
all of which point to the same site
The problem is when i search for say the world 'help' it'll retrieve 5 or 6
dupilicates the only thing different being the URL pointing to this page.
This leads me to believe alot of duplication might be going on, and the database is
larger then it needs to be.. (not to mention the duplicate results returned to the
user)
here is what the relative keywords configuration are set as:
limit_urls_to: nettrash.com netjunk.com netgarbage.com nettoilet.com
#limit_urls_to: internettrash.com (also tried just internettrash.com with no luck)
limit_normalized: http://internettrash.com
start_url: http://internettrash.com/
http://internettrash.com/userlist.html
allow_virtual_hosts: false
server_aliases: www.internettrash.com=internettrash.com \
www.internetgarbage.com=internettrash.com \
internetgarbage.com=internettrash.com \
www.netgarbage.com=internettrash.com \
netgarbage.com=internettrash.com \
www.internetjunk.com=internettrash.com \
internetjunk.com=internettrash.com \
www.netjunk.com=internettrash.com \
netjunk.com=internettrash.com \
www.internettoilet.com=internettrash.com \
internettoilet.com=internettrash.com \
www.nettoilet.com=internettrash.com \
nettoilet.com=internettrash.com \
www.nettrash.com=internettrash.com \
nettrash.com=internettrash.com
Any help would be appreciated!!!!
Thanks,
Rob
------------------------------------
To unsubscribe from the htdig mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.