Barry Warsaw wrote:
>
>On Aug 31, 2009, at 1:15 PM, C Nulk wrote:
>
>> As for using robots.txt, hmm, it is not the legitimate search  
>> engines I
>> care about, it is the search engines/crawlers that do not respect my
>> robots.txt file that I care about.  If I had an effective way to
>> consistently identify those non-legitimate crawlers, I would add  
>> what I
>> needed to drop them into my firewall as I recognized them.
>
>Agreed.


The point in the original post about robots.txt was that if you think
obfuscation is undesirable and don't do it, but you get complaints
from people who find their unobfuscated addresses on your pages via
legitimate search engines, you can use robots.txt to keep the search
engines out.

However, robots.txt is not completely effective in this. You can use it
to prevent Google from crawling your site or portions thereof, but it
won't prevent Google from indexing your pages that it finds via
external links. To prevent this, you need a <meta name="robots"
content="noindex"> tag on the pages themselves.

-- 
Mark Sapiro <m...@msapiro.net>        The highway is for gamblers,
San Francisco Bay Area, California    better use your sense - B. Dylan

_______________________________________________
Mailman-Developers mailing list
Mailman-Developers@python.org
http://mail.python.org/mailman/listinfo/mailman-developers
Mailman FAQ: http://wiki.list.org/x/AgA3
Searchable Archives: 
http://www.mail-archive.com/mailman-developers%40python.org/
Unsubscribe: 
http://mail.python.org/mailman/options/mailman-developers/archive%40jab.org

Security Policy: http://wiki.list.org/x/QIA9

Reply via email to