jenkins-bot has submitted this change and it was merged.
Change subject: proofreadpage.py: purge IndexPage when Index has no required class set
......................................................................
proofreadpage.py: purge IndexPage when Index has no required class set
Purge index if class="qualityN prp-pagequality-N" is not found in HTML
parsing of IndexPage.
Raise ValueError if page.purge() has no effect.
Bug: T114318
Change-Id: I342df4e5b5e2743dbb5bdb8f55296e07242b6b58
---
M pywikibot/proofreadpage.py
1 file changed, 10 insertions(+), 0 deletions(-)
Approvals:
John Vandenberg: Looks good to me, approved
jenkins-bot: Verified
diff --git a/pywikibot/proofreadpage.py b/pywikibot/proofreadpage.py
index a14e2f2..1bc70cd 100644
--- a/pywikibot/proofreadpage.py
+++ b/pywikibot/proofreadpage.py
@@ -385,6 +385,16 @@
# href="/wiki/Page:xxx.djvu/n"
# title="Page:xxx.djvu/n">m
# </a>
+ # Try to purge or raise ValueError.
+ if not self._soup.find_all('a', attrs=attrs):
+ self.purge()
+ del self._parsed_text
+ self._parsed_text = self._get_parsed_page()
+ self._soup = BeautifulSoup(self._parsed_text, 'html.parser')
+ if not self._soup.find_all('a', attrs=attrs):
+ raise ValueError(
+ 'Missing class="qualityN prp-pagequality-N" in:
%s.'
+ % self)
page_cnt = 0
for a_tag in self._soup.find_all('a', attrs=attrs):
--
To view, visit
https://gerrit.wikimedia.org/r/243032
To unsubscribe, visit
https://gerrit.wikimedia.org/r/settings
Gerrit-MessageType: merged
Gerrit-Change-Id: I342df4e5b5e2743dbb5bdb8f55296e07242b6b58
Gerrit-PatchSet: 3
Gerrit-Project: pywikibot/core
Gerrit-Branch: master
Gerrit-Owner: Mpaa <mpaa.wiki(a)gmail.com>
Gerrit-Reviewer: John Vandenberg <jayvdb(a)gmail.com>
Gerrit-Reviewer: Ladsgroup <ladsgroup(a)gmail.com>
Gerrit-Reviewer: Mpaa <mpaa.wiki(a)gmail.com>
Gerrit-Reviewer: jenkins-bot <>