Le 05-08-25 à 18:51, Bob Wyman a écrit :
At PubSub we *never* "crawl" to discover feed URLs. The only feeds
we know about are:
1. Feeds that have announced their presence with a ping
2. Feeds that have been announced to us via a FeedMesh message.
3. Feeds that have been manually submitted to us via our "add-
feed"
page.
We don't crawl.
- How one who has previously submitted a feed URL remove it from the
index? (Change of opinions)
- How someone who's not mastering the ping (built-in in the service,
the software) but doesn't want his/her feed being indexed by the
service.
I do not think we qualify as a "robot" in the sense that is
relevant
to robots.txt. It would appear that Walter Underwood of Verity
would agree
with me since he says in his recent post that: "I would call
desktop clients
"clients" not "robots". The distinction is how they add feeds to
the polling
list. Clients add them because of human decisions. Robots discover
them
mechanically and add them." If Walter is correct, then he must
agree with me
that robots.txt does not apply to PubSub! (and, we should not be on
his
"bad" list.... Walter? Please take us off the list...)
It does apply, except if you give a possibility to each subscribers
to ban a specific URIs.
I think it's one of the main issue of the Web, too many implicit
contracts. That's good because it helps to make the things easier,
but at the same time, it means creating the infrastructure to deny
explicitly. I guess the Feed industry is not eager too much to do
that, because implicit data mining is one big part of the business.
Basically imagine this scenario, when you go out every morning from
your house, we take a photo of you and the way you are dress. It
helps us to know how the people of this area are dressed then to
create shops around. That will help us to send you appropriate
catalogs of clothes you like, and to park a car with an ads with the
products you usually like.
Do you have the right to say "No, I don't want to take a photo
of me every morning for that purpose"?
--
Karl Dubost - http://www.w3.org/People/karl/
W3C Conformance Manager
*** Be Strict To Be Cool ***