It might be due to the page object format changes (posting while traveling, so can't confirm my thoughts) and how it looks up the information in the pickled object
On Friday, August 7, 2015, masti mastigm@gmail.com wrote:
On 08/07/2015 11:50 PM, Merlijn van Deen wrote:
Hi masti,
On 7 August 2015 at 23:11, masti <mastigm@gmail.com mailto:mastigm@gmail.com> wrote:
weblinchecker.py working file deadlinks-wikipedia-pl.dat has a binary format in core. In Compat it was a text file.
This confuses me. As far as I can see, both compat and core use 'pickle' to write the working file (and as far as I can see, compat always has). There are, however, different versions of the pickle format, which could explain the binary/text difference. Could you post an excerpt of the old file?
(dp0 Vhttp:// www.european-athletics.org/european-athletics-awards-night/baldini-says-goodbye-at-the-giro-al-sas.html p1 (lp2 (VStefano Baldini p3 F1404150662.121833 S'404 Not Found' p4 tp5 a(V2010 w lekkoatletyce p6 F1404569856.34401 S'404 Not Found' p7 tp8 a(VStefano Baldini p9 F1404729661.197063 S'404 Not Found' p10 tp11 a(V2010 w lekkoatletyce p12 F1404761790.987255 S'404 Not Found' p13 tp14 a(VStefano Baldini p15 F1413308613.299339 S'404 Not Found' p16 tp17
the script works after copying it to the new directory but it looks it does not recognize information about previously reported links.
Is there an error message posted, is the file just being overwritten, or is there some other kind of issue?
no. no error, it does not overwrite the file. I am still testing wether it adds the records. But it tries to report deadlinks the were previously reported. So this makes me think the script does not recognize properly old records.
Best regards, Merlijn
pywikibot mailing list pywikibot@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikibot
pywikibot mailing list pywikibot@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikibot