https://bugzilla.wikimedia.org/show_bug.cgi?id=70682
Bug ID: 70682 Summary: Implement extracts in Pywikibot Product: Pywikibot Version: core (2.0) Hardware: All OS: All Status: NEW Severity: enhancement Priority: Unprioritized Component: General Assignee: Pywikipedia-bugs@lists.wikimedia.org Reporter: maarten@mdammers.nl Web browser: --- Mobile Platform: ---
Mediawiki has the extracts api function. It should be implemented in Pywikibot too.
* prop=extracts (ex) * Returns plain-text or limited HTML extracts of the given page(s) https://www.mediawiki.org/wiki/Extension:TextExtracts#API
This module requires read rights Parameters: exchars - How many characters to return, actual text returned might be slightly longer. The value must be no less than 1 exsentences - How many sentences to return The value must be between 1 and 10 exlimit - How many extracts to return No more than 20 (20 for bots) allowed Default: 1 exintro - Return only content before the first section explaintext - Return extracts as plaintext instead of limited HTML exsectionformat - How to format sections in plaintext mode: plain - No formatting wiki - Wikitext-style formatting == like this == raw - This module's internal representation (section titles prefixed with <ASCII 1><ASCII 2><section level><ASCII 2><ASCII 1> One value: plain, wiki, raw Default: wiki excontinue - When more results are available, use this to continue exvariant - Convert content into this language variant` Example: Get a 175-character extract: api.php?action=query&prop=extracts&exchars=175&titles=Therion
https://nl.wikipedia.org/w/api.php?action=query&prop=extracts&exchar...
https://bugzilla.wikimedia.org/show_bug.cgi?id=70682
John Mark Vandenberg jayvdb@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- See Also| |https://bugzilla.wikimedia. | |org/show_bug.cgi?id=54569
https://bugzilla.wikimedia.org/show_bug.cgi?id=70682
John Mark Vandenberg jayvdb@gmail.com changed:
What |Removed |Added ---------------------------------------------------------------------------- CC| |jayvdb@gmail.com
--- Comment #1 from John Mark Vandenberg jayvdb@gmail.com --- How do you intend to use this?
https://bugzilla.wikimedia.org/show_bug.cgi?id=70682
--- Comment #2 from Maarten Dammers maarten@mdammers.nl --- I'm already using it to extract date of birth and date of death. Extracts already gets rid of the infobox template or image so I don't have to do that myself.
https://bugzilla.wikimedia.org/show_bug.cgi?id=70682
--- Comment #3 from John Mark Vandenberg jayvdb@gmail.com --- Why not extract those dates from the infobox?
https://bugzilla.wikimedia.org/show_bug.cgi?id=70682
--- Comment #4 from Maarten Dammers maarten@mdammers.nl --- A lot of articles don't have an infobox with this information.
pywikipedia-bugs@lists.wikimedia.org