Yep, that did it, thanks! (I never tried including both export and exportnowrap, just one or the other and it was defining things like revisionids as element attributes instead of children.)
Andrea
On Thu, Jul 14, 2011 at 2:35 PM, Roan Kattouw roan.kattouw@gmail.com wrote:
On Thu, Jul 14, 2011 at 6:58 PM, Andrea Forte andrea.forte@gmail.com wrote:
I'm trying to download the complete revision history for just a few pages. The options, as I see it, are using the API or special:export. The API returns XML that is formatted differently than special:export and I already have a set of parsers that work with special:export data so I'm inclined to go with that.
You can use api.php?action=query&export&exportnowrap&titles=Foo|Bar|Baz , that should give you the same format.
Roan Kattouw (Catrope)
Ok, sorry, I spoke too soon. Yes, it formatted properly but it seems that the "export" parameter only exports the most recent revision and if I add the revision property to ask for more than one, it defaults back to the other format. :-\ So I'm still not able to get the whole edit history of one page in the special:export format. leSigh.
Andrea
On Thu, Jul 14, 2011 at 3:18 PM, Andrea Forte andrea.forte@gmail.com wrote:
Yep, that did it, thanks! (I never tried including both export and exportnowrap, just one or the other and it was defining things like revisionids as element attributes instead of children.)
Andrea
On Thu, Jul 14, 2011 at 2:35 PM, Roan Kattouw roan.kattouw@gmail.com wrote:
On Thu, Jul 14, 2011 at 6:58 PM, Andrea Forte andrea.forte@gmail.com wrote:
I'm trying to download the complete revision history for just a few pages. The options, as I see it, are using the API or special:export. The API returns XML that is formatted differently than special:export and I already have a set of parsers that work with special:export data so I'm inclined to go with that.
You can use api.php?action=query&export&exportnowrap&titles=Foo|Bar|Baz , that should give you the same format.
Roan Kattouw (Catrope)
-- :: Andrea Forte :: Assistant Professor :: College of Information Science and Technology, Drexel University :: http://www.andreaforte.net
I'm not familiar with Special:Export (I use the api.php) but looking at page http://www.mediawiki.org/wiki/Manual:Parameters_to_Special:Export , it seems there is a parameter "history". I added it to your query and it seems to work, i.e. I get a bunch of "revisions" tags in the xml.
http://en.wikipedia.org/w/index.php?title=Special:Export&pages=WSQ&o...
Take into account that the previous query is for page "WSQ", a page I choose randomly because it has a very short history, 7 edits overall, see http://en.wikipedia.org/w/index.php?title=WSQ&action=history
I tried the same Special:Export on the page "Wikipedia" in the browser and it does not finish on my computer, maybe with a curl POST it will but I doubt it. Hope it helps.
P.
On Thu, Jul 14, 2011 at 10:53 PM, Andrea Forte andrea.forte@gmail.com wrote:
Ok, sorry, I spoke too soon. Yes, it formatted properly but it seems that the "export" parameter only exports the most recent revision and if I add the revision property to ask for more than one, it defaults back to the other format. :-\ So I'm still not able to get the whole edit history of one page in the special:export format. leSigh.
Andrea
On Thu, Jul 14, 2011 at 3:18 PM, Andrea Forte andrea.forte@gmail.com wrote:
Yep, that did it, thanks! (I never tried including both export and exportnowrap, just one or the other and it was defining things like revisionids as element attributes instead of children.)
Andrea
On Thu, Jul 14, 2011 at 2:35 PM, Roan Kattouw roan.kattouw@gmail.com wrote:
On Thu, Jul 14, 2011 at 6:58 PM, Andrea Forte andrea.forte@gmail.com wrote:
I'm trying to download the complete revision history for just a few pages. The options, as I see it, are using the API or special:export. The API returns XML that is formatted differently than special:export and I already have a set of parsers that work with special:export data so I'm inclined to go with that.
You can use api.php?action=query&export&exportnowrap&titles=Foo|Bar|Baz , that should give you the same format.
Roan Kattouw (Catrope)
-- :: Andrea Forte :: Assistant Professor :: College of Information Science and Technology, Drexel University :: http://www.andreaforte.net
-- :: Andrea Forte :: Assistant Professor :: College of Information Science and Technology, Drexel University :: http://www.andreaforte.net
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
wiki-research-l@lists.wikimedia.org