On Wed, Jun 04, 2003 at 06:00:19PM +0200, [EMAIL PROTECTED] wrote:
> of them were real; a very big part of them was useless 'cause it contained  
> the link to the page to which the menu belonged and the whole menu as abstract.

If I understand your problem correctly you can solve this with the
following:

http://htdig.org/FAQ.html#q4.15
4.15. Can I use meta tags to prevent htdig from indexing certain files?

Yes, in each HTML file you want to exclude, add the following between the
<HEAD> and </HEAD> tags:
<META NAME="robots" CONTENT="noindex, follow">

Doing so will allow htdig to still follow links to other documents, but
will prevent this document from being put into the index itself. You can
also use "nofollow" to prevent following of links. See the section on
Recognized META information for more details. For documents produced
automatically by MhonArc, you can have that line inserted automatically by
putting it in the MhonArc resource file, in the sections IDXPGBEGIN and
TIDXPGBEGIN.

You can also use the noindex_start and noindex_end attributes to define
one set of tags which will mark sections to be stripped out of documents,
so they don't get indexed, or you can mark sections with the non-DTD
<noindex> and </noindex> tags. The noindex_start and noindex_end
attributes can also be used to suppress in-line JavaScript code that
wasn't properly enclosed in HTML comment tags (see question 4.26). In
3.1.6, you can also put a section between <noindex follow> and </noindex>
tags to turn off indexing of text but still allow htdig to follow links.

-- 
Emma Jane Hogbin
[[ 416 417 2868 ][ www.xtrinsic.com ]]


-------------------------------------------------------
This SF.net email is sponsored by:  Etnus, makers of TotalView, The best
thread debugger on the planet. Designed with thread debugging features
you've never dreamed of, try TotalView 6 free at www.etnus.com.
_______________________________________________
htdig-general mailing list <[EMAIL PROTECTED]>
To unsubscribe, send a message to <[EMAIL PROTECTED]> with a subject of unsubscribe
FAQ: http://htdig.sourceforge.net/FAQ.html

Reply via email to