https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
Web browser: --- Bug ID: 54563 Summary: Make RegexFilterPageGenerator work on page bodies. Product: Pywikibot Version: unspecified Hardware: All OS: All Status: NEW Severity: normal Priority: Unprioritized Component: General Assignee: Pywikipedia-bugs@lists.wikimedia.org Reporter: legoktm.wikipedia@gmail.com Classification: Unclassified Mobile Platform: ---
Originally from: http://sourceforge.net/p/pywikipediabot/patches/552/ Reported by: loxley Created on: 2012-05-12 16:09:39 Subject: Make RegexFilterPageGenerator work on page bodies. Original description: Make RegexFilterPageGenerator work on page bodies. As suggested by valhallasw. The re.S flag is backwards compatible and allows matching newlines.
https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
--- Comment #1 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- Version 2
https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
--- Comment #2 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- Commandline parameter to filter articles based on their bodies.
https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
--- Comment #3 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- Looks good to me. If you have time, could you a) add a command line parameter for this option in the GeneratorFactory, and b) change the re.I and re.S to the full names (I think it's IGNORECASE and MULTILINE?)
Thanks!
https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
--- Comment #4 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- Hi again!
I added a new parameter -articlefilterregex which works on all subsequently given generators.
Regards,
Niki
https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
--- Comment #5 from Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com --- Hi Valhallasw!
Thanks for the feedback. I changed the flags to their verbose representation and will add the commandline switch within the next to days.
Regards
Niki
https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
Kunal Mehta (Legoktm) legoktm.wikipedia@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- See Also| |https://sourceforge.net/p/p | |ywikipediabot/patches/552
https://bugzilla.wikimedia.org/show_bug.cgi?id=54563
xqt info@gno.de changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |info@gno.de Component|General |pagegenerators
pywikipedia-bugs@lists.wikimedia.org