Anyone have a solution for excluding "section fronts"
"menu pages" "index pages" from search results? 

For the most part, ht://dig is returning great search
results. But section front pages rank very high and
really don't have the actual content users are
searching for. 

I have set the backlink_factor to zero which seems to
have been somewhat helpful in solving the problem. 

There are too many section front pages to add to
"exclude_url" list. (Plus it seems that this would
eliminate the entire section). Excluding all files
named *index* is not a solution either.

Ideally a htdig-noindex-follow meta tag would be great
for me, since pages are dynamic. This question was
brought up before, 
http://www.htdig.org/mail/1999/10/0069.html

A simple <meta name="robots" content="noindex,follow">
is not a solution. These pages are very important for
external search engines. (Think of the NYTimes.com
international news page
http://www.nytimes.com/pages/world/ - good content,
but probably not what you are looking for with an
intrasite search). 

I think what i'll end up doing is "cloaking" my pages
for ht://dig. If the user agent is htdig then I'll
output a robots meta tag for it:
<meta name="robots" content="noindex,follow">

Thanks much,
Josh

__________________________________
Do you Yahoo!?
Yahoo! Small Business $15K Web Design Giveaway 
http://promotions.yahoo.com/design_giveaway/


-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
ht://Dig general mailing list: <[EMAIL PROTECTED]>
ht://Dig FAQ: http://htdig.sourceforge.net/FAQ.html
List information (subscribe/unsubscribe, etc.)
https://lists.sourceforge.net/lists/listinfo/htdig-general

Reply via email to