So how to deal with legitimate uses that require many requests? Is it better to not serve them at all? On Jan 16, 2016 19:50, "John" <phoenixoverr...@gmail.com> wrote:
> In cases of excessive resource usage we have several options. Contact the > source, throttle them, or flat out disable access depending on what each > case calls for. > > I have seen the dev team to this liberally in the past when needed. If any > one person or group is exploiting us by using unproportionate amounts of > resources thats one thing, if we are just trying to make money by selling > access to what we already have thats another. Limiting abusive sources > shouldnt be an issue, but as soon as we start selling access we loose sight > of our mission. > > On Sat, Jan 16, 2016 at 9:11 PM, Denny Vrandecic <dvrande...@wikimedia.org > > > wrote: > > > I find it rather surprising, but I very much find myself in agreement > with > > most what Andreas Kolbe said on this thread. > > > > To give a bit more thoughts: I am not terribly worried about current > > crawlers. But currently, and more in the future, I expect us to provide > > more complex and this expensive APIs: a SPARQL endpoint, parsing APIs, > etc. > > These will be simply expensive to operate. Not for infrequent users - > say, > > to the benefit of us 70,000 editors - but for use cases that involve tens > > or millions of requests per day. These have the potential of burning a > lot > > of funds to basically support the operations of commercial companies > whose > > mission might or might not be aligned with our. > > > > Is monetizing such use cases really entirely unthinkable? Even under > > restrictions like the ones suggested by Andreas, or other such > restrictions > > we should discuss? > > On Jan 16, 2016 3:49 PM, "Risker" <risker...@gmail.com> wrote: > > > > > Hmm. The majority of those crawlers are from search engines - the very > > > search engines that keep us in the top 10 of their results (and often > in > > > the top 3), thus leading to the usage and donations that we need to > > > survive. If they have to pay, then they might prefer to change their > > > algorithm, or reduce the frequency of scraping (thus also failing to > > catch > > > updates to articles including removal of vandalism in the lead > > paragraphs, > > > which is historically one of the key reasons for frequently crawling > the > > > same articles). Those crawlers are what attracts people to our sites, > to > > > read, to make donations, to possibly edit. Of course there are lesser > > > crawlers, but they're not really big players. > > > > > > I'm at a loss to understand why the Wikimedia Foundation should take on > > the > > > costs and indemnities associated with hiring staff to create a for-pay > > API > > > that would have to meet the expectations of a customer (or more than > one > > > customer) that hasn't even agreed to pay for access. If they want a > > > specialized API (and we've been given no evidence that they do), let > THEM > > > hire the staff, pay them, write the code in an appropriately > open-source > > > way, and donate it to the WMF with the understanding that it could be > > > modified as required, and that it will be accessible to everyone. > > > > > > It is good that the WMF has studied the usage patterns. Could a link > be > > > given to the report, please? It's public, correct? This is exactly > the > > > point of transparency. If only the WMF has the information, then it > > gives > > > an excuse for the community's comments to be ignored "because they > don't > > > know the facts". So let's lay out all the facts on the table, please. > > > > > > Risker/Anne > > > > > > > > > > > > On 16 January 2016 at 15:06, Vituzzu <vituzzu.w...@gmail.com> wrote: > > > > > > > Thank you for sharing this but, above all, to focus on digging real > > data. > > > > > > > > IMHO we shouldn't forget our mission, so licenses must be as free as > > > > possible. Turning into something "more closed" would definitely > deplete > > > one > > > > of the most valuable source (the open source world) of volunteering > we > > > have. > > > > > > > > Crawlers' owner should definitely share our increasing expenses but > any > > > > kind of agreement with them should include ways to improve our > > userbase. > > > > I'm wondering about an agreement with Google (or any other player) to > > add > > > > an "edit" button to knowledge graph. Sort of a "knowledge vs. users" > > > > agreement. > > > > > > > > So, we definitely need a long term strategy which the Foundation will > > > > pursue in *negotiating* with anyone who wants a big scale access to > > *our > > > > resources* (while access to our knowledge will have no limits, as > > usual). > > > > > > > > Vito > > > > > > > > > > > > Il 16/01/2016 19:21, Lila Tretikov ha scritto: > > > > > > > >> To share some context of the discussion the board had around this > -- I > > > >> don't think the minutes give enough detail. APIs -- as we are freely > > and > > > >> rapidly creating them today are important, but are not quite at the > > core > > > >> of > > > >> the issue we are facing. > > > >> > > > >> Today Wikimedia is the largest internet channel for open free > > knowledge > > > in > > > >> the world. But the trends are against us. We have to face them > > together. > > > >> We > > > >> have to have the answers on how. The strategic discussion next week > > will > > > >> help guide us. > > > >> > > > >> Over the last year we looked at the trends in Wikimedia traffic, > > > internet > > > >> as a whole and user behaviors. It took a lot of research. When we > > > started > > > >> the process we have not had solid internal data about unique > visitors > > or > > > >> human vs. crawler usage on the site. For a top 10 website this is a > > big > > > >> issue; it hurts our ability to make smart decisions. We've learned a > > > lot. > > > >> > > > >> We found data that supports Leigh's point -- our permissive license > > > >> supports our core value, we are (I know I am) here for free > knowledge. > > > Yet > > > >> it allows others to use the content in ways that truncates, > simplifies > > > and > > > >> reduces it. More importantly this type of reuse separates our > readers > > > from > > > >> our site, disconnecting readers from our contributors (no edit > > buttons) > > > >> and > > > >> ultimately reduces traffic. Is this a problem? I'd like to hear if > > > people > > > >> on this list see it as such. And how we sustain contributions over > > time. > > > >> > > > >> Meanwhile estimated half of our hosting is used to support crawlers > > that > > > >> scan our content. This has an associated cost in infrastructure, > > power, > > > >> servers, employees to support some well-funded organizations. The > > > content > > > >> is used for a variety of commercial purposes, sometimes having > nothing > > > to > > > >> do with putting our contributor's work in front of more readers. > > Still, > > > we > > > >> can say this is tangentially supportive of our mission. > > > >> > > > >> As these two trends increase without our intervention, our traffic > > > decline > > > >> will accelerate, our ability to grow editors, content and cover > costs > > > will > > > >> decline as well. > > > >> > > > >> The first question on the upcoming consultation next week will be > > > squarely > > > >> on this. Please help us. API conversation is a consequence of this > > > >> challenge. If we were to build more for reuse: APIs are a good way > to > > do > > > >> so. If we are to somehow incentivize users of SIri to come back to > > > >> Wikipedia, what would we need to do? Should we improve our site so > > more > > > >> people come to us directly as the first stop? How do we bring people > > > into > > > >> our world vs. the world of commercial knowledge out there? How do we > > > fund > > > >> this if the people moved to access our content through other > > interfaces > > > (a > > > >> trend that has been accelerating)? > > > >> > > > >> Those are the core questions we need to face. We will have to have > > some > > > >> uncomfortable, honest discussions as we test our hypothesis this > year. > > > The > > > >> conversation next week is a good start to prioritize those. Please > > join > > > >> it. > > > >> > > > >> Lila > > > >> > > > >> > > > >> > > > >> On Sat, Jan 16, 2016 at 6:11 AM, Leigh Thelmadatter < > > > osama...@hotmail.com > > > >> > > > > >> wrote: > > > >> > > > >> If we are concerned about Google taking unfair advantage of > Wikipedia, > > > one > > > >>> simple solution is to allow content donations with a non-commercial > > > >>> restriction. Right now, the concept of "free" include commercial > use. > > > An > > > >>> added bonus to this is that we would get a lot more institutional > > > >>> donations > > > >>> of content if we allowed an non-commercial option. > > > >>> My problem with allowing for paying for "premium access" is that we > > are > > > >>> allowing Google to have a priviledged position. There is no way > > around > > > >>> that. > > > >>> What is the impetus behind this proposal? Its not like we are > lacking > > > >>> money. And limiting growth of the Foundation is not a bad thing... > > at > > > >>> least not to the community. > > > >>> > > > >>> > > > >>> To: wikimedia-l@lists.wikimedia.org > > > >>>> From: ricordisa...@openmailbox.org > > > >>>> Date: Sat, 16 Jan 2016 14:13:06 +0100 > > > >>>> Subject: Re: [Wikimedia-l] Monetizing Wikimedia APIs > > > >>>> > > > >>>> "Imagine a world in which every single human being can freemiumly > > > share > > > >>>> in the sum of all knowledge." XD > > > >>>> > > > >>>> Il 16/01/2016 10:23, Pete Forsyth ha scritto: > > > >>>> > > > >>>>> I'm interested to hear some perspectives on the following line of > > > >>>>> > > > >>>> thinking: > > > >>> > > > >>>> Lisa presented some alternative strategies for revenue needs for > the > > > >>>>> Foundation, including the possibility of charging for premium > > access > > > >>>>> > > > >>>> to the > > > >>> > > > >>>> services and APIs, expanding major donor and foundation > fundraising, > > > >>>>> providing specific services for a fee, or limiting the Wikimedia > > > >>>>> Foundation's growth. The Board emphasized the importance of > keeping > > > >>>>> > > > >>>> free > > > >>> > > > >>>> access to the existing APIs and services, keeping operational > growth > > > in > > > >>>>> line with the organization's effectiveness, providing room for > > > >>>>> > > > >>>> innovation > > > >>> > > > >>>> in the Foundation's activities, and other potential fundraising > > > >>>>> > > > >>>> strategies. > > > >>> > > > >>>> The Board asked Lila to analyze and develop some of these > potential > > > >>>>> strategies for further discussion at a Board meeting in 2016. > > > >>>>> Source: https://wikimediafoundation.org/wiki/Minutes/2015-11-07 > > > >>>>> -Pete[[User:Peteforsyth]] > > > >>>>> _______________________________________________ > > > >>>>> Wikimedia-l mailing list, guidelines at: > > > >>>>> > > > >>>> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines > > > >>> > > > >>>> New messages to: Wikimedia-l@lists.wikimedia.org > > > >>>>> Unsubscribe: > > > https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, > > > >>>>> > > > >>>> <mailto:wikimedia-l-requ...@lists.wikimedia.org > > ?subject=unsubscribe> > > > >>> > > > >>>> > > > >>>> _______________________________________________ > > > >>>> Wikimedia-l mailing list, guidelines at: > > > >>>> > > > >>> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines > > > >>> > > > >>>> New messages to: Wikimedia-l@lists.wikimedia.org > > > >>>> Unsubscribe: > > https://lists.wikimedia.org/mailman/listinfo/wikimedia-l > > > , > > > >>>> > > > >>> <mailto:wikimedia-l-requ...@lists.wikimedia.org > ?subject=unsubscribe> > > > >>> > > > >>> _______________________________________________ > > > >>> Wikimedia-l mailing list, guidelines at: > > > >>> https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines > > > >>> New messages to: Wikimedia-l@lists.wikimedia.org > > > >>> Unsubscribe: > > https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, > > > >>> <mailto:wikimedia-l-requ...@lists.wikimedia.org > ?subject=unsubscribe> > > > >>> > > > >>> > > > >> > > > >> > > > > > > > > _______________________________________________ > > > > Wikimedia-l mailing list, guidelines at: > > > > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines > > > > New messages to: Wikimedia-l@lists.wikimedia.org > > > > Unsubscribe: > https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, > > > > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe> > > > > > > > _______________________________________________ > > > Wikimedia-l mailing list, guidelines at: > > > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines > > > New messages to: Wikimedia-l@lists.wikimedia.org > > > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, > > > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe> > > _______________________________________________ > > Wikimedia-l mailing list, guidelines at: > > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines > > New messages to: Wikimedia-l@lists.wikimedia.org > > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, > > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe> > > > _______________________________________________ > Wikimedia-l mailing list, guidelines at: > https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines > New messages to: Wikimedia-l@lists.wikimedia.org > Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, > <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe> _______________________________________________ Wikimedia-l mailing list, guidelines at: https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines New messages to: Wikimedia-l@lists.wikimedia.org Unsubscribe: https://lists.wikimedia.org/mailman/listinfo/wikimedia-l, <mailto:wikimedia-l-requ...@lists.wikimedia.org?subject=unsubscribe>