So how to deal with legitimate uses that require many requests?
Is it better to not serve them at all?
On Jan 16, 2016 19:50, "John" <phoenixoverride(a)gmail.com> wrote:
In cases of excessive resource usage we have several
options. Contact the
source, throttle them, or flat out disable access depending on what each
case calls for.
I have seen the dev team to this liberally in the past when needed. If any
one person or group is exploiting us by using unproportionate amounts of
resources thats one thing, if we are just trying to make money by selling
access to what we already have thats another. Limiting abusive sources
shouldnt be an issue, but as soon as we start selling access we loose sight
of our mission.
On Sat, Jan 16, 2016 at 9:11 PM, Denny Vrandecic <dvrandecic(a)wikimedia.org
wrote:
I find it rather surprising, but I very much find
myself in agreement
with
> most what Andreas Kolbe said on this thread.
> To give a bit more thoughts: I am not
terribly worried about current
> crawlers. But currently, and more in the future, I expect us to provide
> more complex and this expensive APIs: a SPARQL endpoint, parsing APIs,
etc.
These will be simply expensive to operate. Not
for infrequent users -
say,
to the benefit of us 70,000 editors - but for use
cases that involve tens
or millions of requests per day. These have the potential of burning a
lot
of funds to basically support the operations of
commercial companies
whose
> mission might or might not be aligned with our.
> Is monetizing such use cases really
entirely unthinkable? Even under
> restrictions like the ones suggested by Andreas, or other such
restrictions
> we should discuss?
> On Jan 16, 2016 3:49 PM, "Risker" <risker.wp(a)gmail.com> wrote:
> > Hmm. The majority of those
crawlers are from search engines - the very
> > search engines that keep us in the top 10 of their results (and often
in
the top
3), thus leading to the usage and donations that we need to
survive. If they have to pay, then they might prefer to change their
algorithm, or reduce the frequency of scraping (thus also failing to
catch
updates to articles including removal of
vandalism in the lead
paragraphs,
> which is historically one of the key reasons for frequently crawling
the
> same articles). Those crawlers are what
attracts people to our sites,
to
> > read, to make donations, to possibly edit. Of course there are lesser
> > crawlers, but they're not really big players.
>
> > I'm at a loss to
understand why the Wikimedia Foundation should take on
> the
> > costs and indemnities associated with hiring staff to create a for-pay
> API
> > that would have to meet the expectations of a customer (or more than
one
> customer) that hasn't even agreed to pay
for access. If they want a
> specialized API (and we've been given no evidence that they do), let
THEM
> hire the staff, pay them, write the code in
an appropriately
open-source
> > way, and donate it to the WMF with the understanding that it could be
> > modified as required, and that it will be accessible to everyone.
>
> > It is good that the WMF has
studied the usage patterns. Could a link
be
> given to the report, please? It's
public, correct? This is exactly
the
point of
transparency. If only the WMF has the information, then it
gives
> an excuse for the community's comments to be ignored "because they
don't
> > know the facts". So let's lay out all the facts on the table,
please.
>
> > Risker/Anne
>
>
>
> > On
16 January 2016 at 15:06, Vituzzu <vituzzu.wiki(a)gmail.com> wrote:
>
> > > Thank you for sharing
this but, above all, to focus on digging real
> data.
> >
> > > IMHO we
shouldn't forget our mission, so licenses must be as free as
> > > possible. Turning into something "more closed" would definitely
deplete
> one
> > of the most valuable source (the open source world) of volunteering
we
> > have.
> >
> > > Crawlers'
owner should definitely share our increasing expenses but
any
> > > kind of agreement with them should include ways to improve our
> userbase.
> > > I'm wondering about an agreement with Google (or any other player) to
> add
> > > an "edit" button to knowledge graph. Sort of a "knowledge
vs. users"
> > > agreement.
> >
> > > So, we definitely
need a long term strategy which the Foundation will
> > > pursue in *negotiating* with anyone who wants a big scale access to
> *our
> > > resources* (while access to our knowledge will have no limits, as
> usual).
> >
> > > Vito
> >
> >
> > > Il 16/01/2016 19:21, Lila Tretikov ha scritto:
> >
> > >> To share some
context of the discussion the board had around this
-- I
>>
don't think the minutes give enough detail. APIs -- as we are freely
and
>> rapidly creating them today are
important, but are not quite at the
core
>> of
>> the issue we are facing.
>>
>> Today Wikimedia is the largest internet channel for open free
knowledge
in
>> the world. But the trends are against us. We have to face them
together.
>> We
>> have to have the answers on how. The strategic discussion next week
will
> >> help guide us.
> >>
> >> Over the last year we looked at the trends in Wikimedia traffic,
> internet
> >> as a whole and user behaviors. It took a lot of research. When we
> started
> >> the process we have not had solid internal data about unique
visitors
or
>> human vs. crawler usage on the site. For
a top 10 website this is a
big
> >> issue; it hurts our ability to make smart decisions. We've learned a
> lot.
> >>
> >> We found data that supports Leigh's point -- our permissive license
> >> supports our core value, we are (I know I am) here for free
knowledge.
> Yet
> >> it allows others to use the content in ways that truncates,
simplifies
> and
> >> reduces it. More importantly this type of reuse separates our
readers
from
>> our site, disconnecting readers from our contributors (no edit
buttons)
> and
> ultimately reduces traffic. Is this a problem? I'd like to hear if
people
>> on this list see it as such. And how we sustain contributions over
time.
>>
>> Meanwhile estimated half of our hosting is used to support crawlers
that
>> scan our content. This has an associated
cost in infrastructure,
power,
> >> servers, employees to support some well-funded organizations. The
> content
> >> is used for a variety of commercial purposes, sometimes having
nothing
to
>> do with putting our contributor's work in front of more readers.
Still,
> we
> >> can say this is tangentially supportive of our mission.
> >>
> >> As these two trends increase without our intervention, our traffic
> decline
> >> will accelerate, our ability to grow editors, content and cover
costs
> will
> >> decline as well.
> >>
> >> The first question on the upcoming consultation next week will be
> squarely
> >> on this. Please help us. API conversation is a consequence of this
> >> challenge. If we were to build more for reuse: APIs are a good way
to
do
>> so. If we are to somehow incentivize
users of SIri to come back to
>> Wikipedia, what would we need to do? Should we improve our site so
more
>
people come to us directly as the first stop? How do we bring people
into
> our world vs. the world of commercial
knowledge out there? How do we
fund
>> this if the people moved to access our content through other
interfaces
(a
>> trend that has been accelerating)?
>>
>> Those are the core questions we need to face. We will have to have
some
> >> uncomfortable, honest discussions as we test our hypothesis this
year.
> > The
> > >> conversation next week is a good start to prioritize those. Please
> join
> > >> it.
> > >>
> > >> Lila
> > >>
> > >>
> > >>
> > >> On Sat, Jan 16, 2016 at 6:11 AM, Leigh Thelmadatter <
> > osamadre(a)hotmail.com
> > >>
> > >>
wrote:
> > >>
> > >> If we are concerned about Google taking unfair advantage of
Wikipedia,
> one
> >>> simple solution is to allow content donations with a non-commercial
> >>> restriction. Right now, the concept of "free" include
commercial
use.
An
>>> added bonus to this is that we would get a lot more institutional
>>> donations
>>> of content if we allowed an non-commercial option.
>>> My problem with allowing for paying for "premium access" is that
we
are
>>> allowing Google to have a
priviledged position. There is no way
around
> >>> that.
> >>> What is the impetus behind this proposal? Its not like we are
lacking
>>> money. And limiting growth of the Foundation is not a bad thing...
at
> >>> least not to the community.
> >>>
> >>>
> >>> To: wikimedia-l(a)lists.wikimedia.org
> >>>> From: ricordisamoa(a)openmailbox.org
> >>>> Date: Sat, 16 Jan 2016 14:13:06 +0100
> >>>> Subject: Re: [Wikimedia-l] Monetizing Wikimedia APIs
> >>>>
> >>>> "Imagine a world in which every single human being can
freemiumly
> share
> >>>> in the sum of all knowledge." XD
> >>>>
> >>>> Il 16/01/2016 10:23, Pete Forsyth ha scritto:
> >>>>
> >>>>> I'm interested to hear some perspectives on the following
line of
> >>>>>
> >>>> thinking:
> >>>
> >>>> Lisa presented some alternative strategies for revenue needs for
the
>>>>> Foundation, including the possibility of charging for premium
access
> >>>>>
> >>>> to the
> >>>
> >>>> services and APIs, expanding major donor and foundation
fundraising,
> >>>>> providing specific
services for a fee, or limiting the Wikimedia
> >>>>> Foundation's growth. The Board emphasized the importance of
keeping
> >>>>>
> >>>> free
> >>>
> >>>> access to the existing APIs and services, keeping operational
growth
> in
> >>>>> line with the organization's effectiveness, providing room
for
> >>>>>
> >>>> innovation
> >>>
> >>>> in the Foundation's activities, and other potential
fundraising
> >>>>>
> >>>> strategies.
> >>>
> >>>> The Board asked Lila to analyze and develop some of these
potential
>>>> strategies for further discussion at a
Board meeting in 2016.
>>>> Source:
https://wikimediafoundation.org/wiki/Minutes/2015-11-07
>>>> -Pete[[User:Peteforsyth]]
>>>> _______________________________________________
>>>> Wikimedia-l mailing list, guidelines at:
>>>>
>>>
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
>>
>>> New messages to: Wikimedia-l(a)lists.wikimedia.org
>>>> Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
>>>>>
>>>> <mailto:wikimedia-l-request@lists.wikimedia.org
?subject=unsubscribe>
>>>
>>>>
>>>> _______________________________________________
>>>> Wikimedia-l mailing list, guidelines at:
>>>>
>>>
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
>>>
>>>> New messages to: Wikimedia-l(a)lists.wikimedia.org
>>>> Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l
> ,
> >>>>
> >>> <mailto:wikimedia-l-request@lists.wikimedia.org
?subject=unsubscribe>
>>>
>>> _______________________________________________
>>> Wikimedia-l mailing list, guidelines at:
>>>
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
>>> New messages to: Wikimedia-l(a)lists.wikimedia.org
>>> Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> >>> <mailto:wikimedia-l-request@lists.wikimedia.org
?subject=unsubscribe>
> > >>>
> > >>>
> > >>
> > >>
> >
> > >
_______________________________________________
> > > Wikimedia-l mailing list, guidelines at:
> > >
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> > > New messages to: Wikimedia-l(a)lists.wikimedia.org
> > > Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > >
<mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>
> >
> >
_______________________________________________
> > Wikimedia-l mailing list, guidelines at:
> >
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> > New messages to: Wikimedia-l(a)lists.wikimedia.org
> > Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> > <mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>
> _______________________________________________
> Wikimedia-l mailing list, guidelines at:
>
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
> New messages to: Wikimedia-l(a)lists.wikimedia.org
> Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
> <mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>
_______________________________________________
Wikimedia-l mailing list, guidelines at:
https://meta.wikimedia.org/wiki/Mailing_lists/Guidelines
New messages to: Wikimedia-l(a)lists.wikimedia.org
Unsubscribe:
https://lists.wikimedia.org/mailman/listinfo/wikimedia-l,
<mailto:wikimedia-l-request@lists.wikimedia.org?subject=unsubscribe>