Dear Diane, I cannot give you an answer on your original question, but maybe I can still help. For what exactly do you need the data?
For the JWPL DataMachine, you won't need the pages-meta-history files - only meta-current, which is available as a single file. For the RevisionMachine, you can define multiple input files. Consequently, there is no problem using the archives without recombining them.
Only in the case you want to recreate an old/historic dump (or a series of old dumps) from the current history dump using the TimeMachine, will you need the pages-meta-history files recombined. Is this the case?
Best, Oliver
-- ------------------------------------------------------------------- Oliver Ferschke, M.A. Doctoral Researcher Ubiquitous Knowledge Processing Lab FB 20 Computer Science Department Technische Universität Darmstadt Hochschulstr. 10, D-64289 Darmstadt, Germany phone [+49] (0)6151 16-6227, fax -5455, room S2/02/B111 ferschke@tk.informatik.tu-darmstadt.de www.ukp.tu-darmstadt.de Web Research at TU Darmstadt (WeRC) www.werc.tu-darmstadt.de -------------------------------------------------------------------
-----Ursprüngliche Nachricht----- Von: xmldatadumps-l-bounces@lists.wikimedia.org [mailto:xmldatadumps-l-bounces@lists.wikimedia.org] Im Auftrag von Napolitano, Diane Gesendet: Mittwoch, 3. August 2011 17:36 An: xmldatadumps-l@lists.wikimedia.org Betreff: [Xmldatadumps-l] 7/22 enwiki dump pages-meta-history
Hello, are there any plans to combine all of the pages-meta-history XML dumps from the 7/22 dump into one file? This is useful for importing into JWPL.
Thanks,
Diane M. Napolitano Associate Research Engineer Educational Testing Service Turnbull Hall R-239 Princeton, New Jersey 08540
_______________________________________________ Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l