Does AWB not do something along those lines?

2009/7/25 Danny B. <Wikipedia.Danny.B@email.cz>
Hello,

I'm looking for any kind of tool which would take the XML dump (most probably the pages-meta-current.xml.bz2, at least the pages-articles.xml.bz2) and would return the list of page titles (or alternatively/configurably page ids) of pages containing given string.

Does anybody have such (kind of) tool and is willing to share? Both command line or webpage interface are OK.

Thank you.


Danny B.

_______________________________________________
Toolserver-l mailing list
Toolserver-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/toolserver-l



--
Regards,

Simon Walker
User:Stwalkerster on all public Wikimedia Foundation wikis
Administrator on the English Wikipedia
Developer of Helpmebot, the ACC tool, and Nubio 2 FAQ repository