Re: [Wikitech-l] images torrent

28 Oct 2007


      On 10/27/07, Anthony wikimail@inbox.org wrote:
...
...
[[Wikipedia:Database download#Please do not use a web crawler]]
Have Google and Yahoo been informed of this policy?
Context: "Please do not use a web crawler to download large numbers of
articles."
As in "Don't use a web crawler to get big amounts of data for your own
personal use" (i.e. for mirroring). And it's quite valid, if lots of
people downloaded the entire site one article at a time, we'd end up
with big problems - especially seeing as the load would be evenly
distributed across many articles, and hence there'd be a lot of extra
parsing happening.
Google and Yahoo have nothing to do with this, as search engines would
represent a tiny portion of our requests (whereas many users doing a
lot of requesting would not), and use the data obtained for the public
benefit.
-- 
Andrew Garrett

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] images torrent