[Toolserver-l] Looking for utility to perform text search in dump

Simon Walker stwalkerster at googlemail.com
Sat Jul 25 12:46:09 UTC 2009


Does AWB not do something along those lines?

2009/7/25 Danny B. <Wikipedia.Danny.B at email.cz>

> Hello,
>
> I'm looking for any kind of tool which would take the XML dump (most
> probably the pages-meta-current.xml.bz2, at least the
> pages-articles.xml.bz2) and would return the list of page titles (or
> alternatively/configurably page ids) of pages containing given string.
>
> Does anybody have such (kind of) tool and is willing to share? Both command
> line or webpage interface are OK.
>
> Thank you.
>
>
> Danny B.
>
> _______________________________________________
> Toolserver-l mailing list
> Toolserver-l at lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/toolserver-l
>



-- 
Regards,

Simon Walker
User:Stwalkerster on all public Wikimedia Foundation wikis
Administrator on the English Wikipedia
Developer of Helpmebot, the ACC tool, and Nubio 2 FAQ repository
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.wikimedia.org/pipermail/toolserver-l/attachments/20090725/4734e49f/attachment.htm 


More information about the Toolserver-l mailing list