Hi Folks,
I'd like to be able to limit the number of meta keywords I dig. For
instance if I set the Max_Keywords attribute in the config file to 5,
htdig would only dig the first five keywords in meta tags, disregarding
the rest. If set to 0, default, htdig would dig all keywords.
My rational for this request:
_________________________________________________________________________
I run htdig for a college; each department maintains its site. It has a
webmaster, sort of, who maintains their web pages; some departments take
advantage of enthusiasm of their students and leave it to them to set up
the site. Sometimes students become over-enthusiastic;)
Only one, to my knowledge, of the over-enthusiastic students, or
webmasters, has gotten the idea of abusing meta keywords. They have
created a set of meta tag keywords of over a hundred very common English
words. The set has been placed in two dozen web pages in their site.
When one searched the campus web site for any of those words, the first
two dozen results would be from that site. Think how frustrating it is
for a site, whose very descriptive keywords would bring a dozen unrelated
pages before their page. It is also counter productive for the whole
campus; it defeats the purpose of a search engine.
I can, of course, exclude such abusive sites from the dig, as I have
already done for the above site, but I have to find them first;) I
stumbled upon that site by sheer luck the day after they set it up; who
knows, there may be others I have not discovered;( I'd like to have their
site dug, any way, without rewarding their abuse.
By limiting the number of keywords I limit, if not eliminate, such abuse.
_________________________________________________________________________
I appreciate your consideration.
Best regards,
Joe
--
_/ _/_/_/ _/ ____________ __o
_/ _/ _/ _/ ______________ _-\<,_
_/ _/ _/_/_/ _/ _/ ......(_)/ (_)
_/_/ oe _/ _/. _/_/ ah [EMAIL PROTECTED]
------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.