Hi! Do you have any idea why, using replace.py on some large dumps, I get this error message:
C:\pywikipedia>replace.py -xml:enwiki-20091128-pages-articles.xml Please enter the text that should be replaced: impossibletofindword Please enter the new text: found Please enter another text that should be replaced, or press Enter to start: The summary message will default to: Robot: Automated text replacement (-impossibletofindword +found ) Press Enter to use this default message, or enter a description of the changes your bot will make: test Reading XML dump... Traceback (most recent call last): File "C:\pywikipedia\pagegenerators.py", line 847, in __iter__ for page in self.wrapped_gen: File "C:\pywikipedia\pagegenerators.py", line 779, in DuplicateFilterPageGenerator for page in generator: File "C:\pywikipedia\replace.py", line 218, in __iter__ for entry in self.parser: File "C:\pywikipedia\xmlreader.py", line 295, in new_parse for rev in self._parse(event, elem): File "C:\pywikipedia\xmlreader.py", line 304, in _parse_only_latest yield self._create_revision(revision) File "C:\pywikipedia\xmlreader.py", line 341, in _create_revision redirect=self.isredirect File "C:\pywikipedia\xmlreader.py", line 64, in __init__ self.username = username.strip() AttributeError: 'NoneType' object has no attribute 'strip' 'NoneType' object has no attribute 'strip'
I updated pywikipedia to the last revision with no results. As you can see it does not seem to be user-fixes.py or regex-related.
Thanks in advance! Davide Bolsi
Does Username fail too? I have written an e-mail about page title and page id with empty or None values some minutes ago. I hope we solve this.
2010/1/3 Davide Bolsi dave28@iol.it
Hi! Do you have any idea why, using replace.py on some large dumps, I get this error message:
C:\pywikipedia>replace.py -xml:enwiki-20091128-pages-articles.xml Please enter the text that should be replaced: impossibletofindword Please enter the new text: found Please enter another text that should be replaced, or press Enter to start: The summary message will default to: Robot: Automated text replacement (-impossibletofindword +found ) Press Enter to use this default message, or enter a description of the changes your bot will make: test Reading XML dump... Traceback (most recent call last): File "C:\pywikipedia\pagegenerators.py", line 847, in __iter__ for page in self.wrapped_gen: File "C:\pywikipedia\pagegenerators.py", line 779, in DuplicateFilterPageGenerator for page in generator: File "C:\pywikipedia\replace.py", line 218, in __iter__ for entry in self.parser: File "C:\pywikipedia\xmlreader.py", line 295, in new_parse for rev in self._parse(event, elem): File "C:\pywikipedia\xmlreader.py", line 304, in _parse_only_latest yield self._create_revision(revision) File "C:\pywikipedia\xmlreader.py", line 341, in _create_revision redirect=self.isredirect File "C:\pywikipedia\xmlreader.py", line 64, in __init__ self.username = username.strip() AttributeError: 'NoneType' object has no attribute 'strip' 'NoneType' object has no attribute 'strip'
I updated pywikipedia to the last revision with no results. As you can see it does not seem to be user-fixes.py or regex-related.
Thanks in advance! Davide Bolsi
Pywikipedia-l mailing list Pywikipedia-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/pywikipedia-l
pywikipedia-l@lists.wikimedia.org