Oops.

Perhaps I should've articulated a little more.

I don't want to use META robots since that will prevent Search Engines from
indexing the site.

I thought about creating a custom htdig-robots.txt file specifically for
htdig.

Regards,

Rupert.

-----Original Message-----
From: Malcolm Austen [mailto:[EMAIL PROTECTED]

Sent: 25 March 2004 15:28
To: Rupert Jones
Cc: [EMAIL PROTECTED]
Subject: Re: [htdig] Prevent indexing of HTML files?

On 25/03/2004 14:58, Rupert Jones wrote:
> Does anyone know if there is anyway to prevent ht://dig from adding HTML
> files to the Index, but still parse them for links?
> Basically, I want ht://dig to index .doc, .pdf and .xls files on a site,
but
> not the .html (or actually, .php) files. But it still needs to parse these
> pages as the links to the .doc/.pdf/.xls files are linked from the .php
> pages.

The documentation says it obeys the standard rules for ROBOTS metadata:

      http://www.robotstxt.org/wc/meta-user.html

and I have no cause to doubt that it does just that!

regards, Malcolm.



-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to