If I use
http://en.wikipedia.org/w/api.php?action=query&prop=extracts&titles=...
or
http://en.wikipedia.org/w/api.php?action=query&prop=extracts&titles=...
it gets a plain text extract of a page's main content nicely - but ignores anything inside tables. Is there a way round that?
Thanks; Andrew
On 20.04.2012, 19:38 Andrew wrote:
If I use
http://en.wikipedia.org/w/api.php?action=query&prop=extracts&titles=...
or
http://en.wikipedia.org/w/api.php?action=query&prop=extracts&titles=...
it gets a plain text extract of a page's main content nicely - but ignores anything inside tables. Is there a way round that?
Tables are removed because they're often used for stuff unrelated to article text such as infoboxes and maintenance templates. Indeed, some tables could be converted to text, but how do we differentiate between useful and not useful tables?
mediawiki-api@lists.wikimedia.org