[Pywikipedia-l] [Wikitech-l] serious interwiki.py issues on MW 1.18 wikis
Andre Engels
andreengels at gmail.com
Fri Sep 30 09:17:10 UTC 2011
On Fri, Sep 30, 2011 at 11:12 AM, Max Semenik <maxsem.wiki at gmail.com> wrote:
> On Fri, Sep 30, 2011 at 12:56 PM, Andre Engels <andreengels at gmail.com
> >wrote:
>
> >
> > The interwiki links are retrieved from page content. The page content has
> > been received through a call to Special:Export.
> >
> >
> > > I.e. would receiving no content (from the bot POV) produce that
> behavior?
> > >
> >
> > Yes, the only reasonable explanation seems to be that the bot interprets
> > what it gets from the server as an empty page.
> >
>
> So you screen-scrape? No surprise it breaks. Why? For example, due to
> protocol-relative URLs. Or some other changes to HTML output. Why not just
> use API?
>
Basically, because most of the core functionality comes from before the API
came into existence. At least, that would be my explanation.
--
André Engels, andreengels at gmail.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikimedia.org/pipermail/pywikipedia-l/attachments/20110930/fee5bc0c/attachment.htm
More information about the Pywikipedia-l
mailing list