2009/10/23 Robert Ullmann rlullmann@gmail.com:
Hi Hippietrail!
What do you mean by "intractably slow"? Just how fast must it be?
If I do http://en.wikipedia.org/w/api.php?action=query&list=embeddedin&eitit... it says (on one given try) that it was served in 0,047 seconds. How long can it take to read them all? A few minutes?
Yes I found how to get it through the API now. It was actually just the Toolserver database that was intractably slow.
Seems to me that time would be swamped by the time it takes to pull the wikitext for the pages?
And methinks you might be trying too hard to parse the text, some fairly simple regex or such can extract the template invocation and the parameters; people use it in a pretty regular way.
I've been spending hours on the parsing now and don't find it simple at all due to the fact that templates can be nested. Just extracting the Infobox as one big lump is hard due to the need to match nested {{ and }}
Andrew Dunbar (hippietrail)
Oh, and do remember to look for "Template:Infobox language" as well, depending on which way you find them.
Robert
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l