Ken, I'm forwarding your inquiry to wikitech-l.
----- Forwarded message from Ken Dobruskin ken@dobruskin.com -----
From: Ken Dobruskin ken@dobruskin.com Date: Fri, 19 Dec 2003 12:00:02 +0100 (CET) To: Jimbo Wales jwales@bomis.com Subject: Forbidden access to wikipedia server
Greetings!
After reading the Wikipedia GFDL and policy on robots, I thought I'd try an experiment, absolutely in line with said policies.
However when I tried to access the site from my server it failed with a 403.
User-agent: Python-urllib/1.15 IP address: 216.28.158.40
Sorry if I should be asking this to the mailing list, but I did see a word about access issues to be addressed to a site admin.
Would appreciate your advice.
Best 'net regards,
Ken
----- End forwarded message -----
----- Forwarded message from Ken Dobruskin -----
However when I tried to access the site from my server it failed with a 403.
User-agent: Python-urllib/1.15
Along with dozens of whole-site-downloaders and known mail-harvester bots, Wget, Python-urllib, and libwww-perl are blocked by default, sorry.
Please give your bot a real user-agent string that can be _individually_ blocked if your bot accidentally goes mad (hey, it happens).
-- brion vibber (brion @ pobox.com)
From: Ken Dobruskin ken@dobruskin.com Date: Fri, 19 Dec 2003 12:00:02 +0100 (CET) To: Jimbo Wales jwales@bomis.com Subject: Forbidden access to wikipedia server
Greetings!
After reading the Wikipedia GFDL and policy on robots, I thought I'd try an experiment, absolutely in line with said policies.
However when I tried to access the site from my server it failed with a 403.
User-agent: Python-urllib/1.15 IP address: 216.28.158.40
Sorry if I should be asking this to the mailing list, but I did see a word about access issues to be addressed to a site admin.
Ken,
You're welcome to join the pywikipediabot project on sourceforge. You may be able to use quite some code that has already been written.
Rob
wikitech-l@lists.wikimedia.org