The problem gets worse
pwb.py listpages -ns:0 -start -grep:"REDIRECT" -format:"{page.title}"
won't spot files with #REDIRECT, and
pwb.py listpages -ns:0 -start -format:"{page.title}"
won't list pages with just a #DIRECT line, though it will if they have
extra text
pwb.py replace -page:"David" "#REDIRECT" "@redirect" works
pwb.py replace -start "REDIRECT" "@redirect" will ignore the page
David if it just contains the # line
So whatever pywikibot does for page selection, a #REDIRECT line upsets it
On Sat, 29 Aug 2020 at 10:31, John Bray <johnbray822(a)gmail.com> wrote:
Escaping the # makes no difference
pwb.py listpages -ns:0 -start -grep:"\#[Rr]" -format:"{page.title}"
only returns pages with # in the body of URLs
On Wed, 19 Aug 2020 at 10:48, Strainu <strainu10(a)gmail.com> wrote:
>
> '#' is special in python, maybe the regex gets post-processed into becoming
a comment? Try escaping it.
>
> Strainu
>
> Pe marți, 18 august 2020, John Bray <johnbray822(a)gmail.com> a scris:
>>
>> I'd like to get all the redirect pages out of a wiki, but
>>
>> pwb.py listpages -start:UK -grep:'#REDIRECT'
-format:"{page.title}" -get
>>
>> produces nothing, but
>>
>> pwb.py listpages -start:UK -grep:'United Kingdom'
-format:"{page.title}" -get
>>
>> produces, as expected
>>
>> UK
>> #REDIRECT [[United Kingdom]]
>>
>> pwb.py listpages -start:UK -grep:'#' -format:"{page.title}"
-get
>>
>> picks up pages with a # in them, but not any of the #REDIRECTs
>>
>> why isn't -grep just parsing the page fully?
>>
>> John
>>
>> _______________________________________________
>> pywikibot mailing list
>> pywikibot(a)lists.wikimedia.org
>>
https://lists.wikimedia.org/mailman/listinfo/pywikibot
>
> _______________________________________________
> pywikibot mailing list
> pywikibot(a)lists.wikimedia.org
>
https://lists.wikimedia.org/mailman/listinfo/pywikibot