Re: [Toolserver-l] question about performance/getting webpage content

24 Nov 2008

      seth wrote:
...
Hi!
I wrote a perl script, which works on some HTML content of some 
wikipedia-webpages. Some of those pages are >300kB and perls LWP-mirror 
hangs up.
Two questions:

Is there a better/faster way to get the HTML content of e.g.

http://meta.wikimedia.org/wiki/Spam_blacklist/Log
than
   my $ua = LWP::UserAgent->new;
   $ua->mirror($url, $filename);
?
To get the content of wikipedia pages you should be using WikiProxy 
http://meta.wikimedia.org/wiki/User:Duesentrieb/WikiProxy
If you still do need to fetch it by yourself, you can launch an external 
tool (wget, curl...) to download it and then read it as a normal file.
...

If I've questions about such stuff, am I right here? Otherwise, sorry

for bothering you. :-)
Cheers
seth
Yes, this is a good place :)
Platonides

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

Re: [Toolserver-l] question about performance/getting webpage content