What happens to those that ignore robot.txt files?
On Thu, Sep 3, 2015 at 3:48 PM Tony Harminc <t...@harminc.net> wrote:

> On 3 September 2015 at 11:18, Andy Higgins <ahigg...@transunion.com>
> wrote:
> >> I don't remember what was there originally...
> >
> > The Internet Archive Wayback Machine does:
> >
> >
> http://web.archive.org/web/*/http://www-03.ibm.com/systems/z/os/zos/library/bkserv/index.html
>
> That's surprising - most IBM pages have a robots.txt that doesn't want
> anything looked at/crawled/remembered.
>
> Tony H.
>
> ----------------------------------------------------------------------
> For IBM-MAIN subscribe / signoff / archive access instructions,
> send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN
>

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@listserv.ua.edu with the message: INFO IBM-MAIN

Reply via email to